Same Architecture, Different Capacity: Optimizer-Induced Spectral Scaling Laws Paper • 2605.21803 • Published 3 days ago • 2
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 2 days ago • 16
Q-ARVD: Quantizing Autoregressive Video Diffusion Models Paper • 2605.21072 • Published 3 days ago • 17
WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 2 days ago • 29
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning Paper • 2605.22012 • Published 2 days ago • 35
Toto 2.0: Time Series Forecasting Enters the Scaling Era Paper • 2605.20119 • Published 4 days ago • 35
Mem-π: Adaptive Memory through Learning When and What to Generate Paper • 2605.21463 • Published 3 days ago • 4
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook Paper • 2605.20266 • Published 5 days ago • 52
OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond Paper • 2605.19660 • Published 4 days ago • 39
Stage-adaptive Token Selection for Efficient Omni-modal LLMs Paper • 2605.20035 • Published 4 days ago • 4
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 4 days ago • 112
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 11 days ago • 186
Agent Bazaar: Enabling Economic Alignment in Multi-Agent Marketplaces Paper • 2605.17698 • Published 6 days ago • 6
AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents Paper • 2605.17933 • Published 5 days ago • 6
From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements Paper • 2605.17242 • Published 6 days ago • 11