Persona Vectors: Monitoring and Controlling Character Traits in Language Models Paper • 2507.21509 • Published 8 days ago • 23
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models Paper • 2507.23682 • Published 6 days ago • 21
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper • 2507.23726 • Published 5 days ago • 96
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published 7 days ago • 59
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Paper • 2507.14111 • Published 18 days ago • 22
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Paper • 2507.22058 • Published 7 days ago • 36
Diversity-Enhanced Reasoning for Subjective Questions Paper • 2507.20187 • Published 10 days ago • 22
Region-based Cluster Discrimination for Visual Representation Learning Paper • 2507.20025 • Published 10 days ago • 17
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning Paper • 2507.21049 • Published 8 days ago • 38
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset Paper • 2507.21033 • Published 8 days ago • 20
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper • 2507.20984 • Published 8 days ago • 51
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement Paper • 2507.18742 • Published 12 days ago • 5
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning Paper • 2507.19457 • Published 11 days ago • 20
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published 19 days ago • 118
Pixels, Patterns, but No Poetry: To See The World like Humans Paper • 2507.16863 • Published 15 days ago • 65