36 3

Noah

noahml

https://researchpod.app

researchpodapp

AI & ML interests

None yet

Recent Activity

commentedon a paper about 1 hour ago

Can Vision-Language Models Solve the Shell Game?

commentedon a paper about 1 hour ago

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement

commentedon a paper about 1 hour ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

View all activity

Organizations

None yet

commented 5 papers about 1 hour ago

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published Mar 9 • 39 •

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement

Paper • 2603.06333 • Published Mar 6 • 1 •

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published Apr 9 • 20 •

Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding

Paper • 2604.08537 • Published Apr 9 • 9 •

Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory

Paper • 2604.11544 • Published Apr 13 • 4 •

commented a paper about 7 hours ago

Models That Know How Evaluations Are Designed Score Safer

Paper • 2605.28591 • Published 3 days ago • 4 •

commented 5 papers about 8 hours ago

SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context

Paper • 2604.11716 • Published Apr 13 • 5 •

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 3 days ago • 78 •

Models That Know How Evaluations Are Designed Score Safer

Paper • 2605.28591 • Published 3 days ago • 4 •

Self-Improving Language Models with Bidirectional Evolutionary Search

Paper • 2605.28814 • Published 3 days ago • 50 •

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

Paper • 2605.25378 • Published 5 days ago • 48 •

commented 6 papers 1 day ago

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

Paper • 2605.27354 • Published 4 days ago • 12 •

Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization

Paper • 2605.28109 • Published 3 days ago • 17 •

commented 3 papers 3 months ago

Computer-Using World Model

Paper • 2602.17365 • Published Feb 19 • 18 •

Fast KV Compaction via Attention Matching

Paper • 2602.16284 • Published Feb 18 • 1 •

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 75 •

Noah

AI & ML interests

Recent Activity

Organizations

noahml's activity