DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published 22 days ago • 95
The Art of Scaling Reinforcement Learning Compute for LLMs Paper • 2510.13786 • Published Oct 15 • 30
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published Oct 3 • 96
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 137
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published Sep 30 • 3 • 2
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published Sep 30 • 3
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published Sep 30 • 3
AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs Paper • 2509.08031 • Published Sep 9 • 21
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 188