QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks Paper • 2605.24218 • Published 11 days ago • 41
VibeSearchBench: Benchmarking Long-horizon Proactive Search in the Wild Paper • 2605.27882 • Published 6 days ago • 12
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 5 days ago • 123
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence Paper • 2605.26494 • Published 7 days ago • 37
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 12 days ago • 30
Toto 2.0: Time Series Forecasting Enters the Scaling Era Paper • 2605.20119 • Published 14 days ago • 38
Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution Paper • 2605.15301 • Published 19 days ago • 22
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published 29 days ago • 40
MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning Paper • 2605.13037 • Published 20 days ago • 8
It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks Paper • 2602.12147 • Published Mar 4 • 4
Retrieval from Within: An Intrinsic Capability of Attention-Based Models Paper • 2605.05806 • Published 25 days ago • 6
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 21 days ago • 125
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 25 days ago • 69
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI Paper • 2605.06651 • Published 26 days ago • 15
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems Paper • 2605.04018 • Published 28 days ago • 40
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published May 2 • 24