Learning to Self-Verify Makes Language Models Better Reasoners Paper • 2602.07594 • Published Feb 7 • 3
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 23 days ago • 111
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published Feb 11 • 31
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper • 2509.22611 • Published Sep 26, 2025 • 119
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper • 2509.22611 • Published Sep 26, 2025 • 119
\texttt{R$^\textbf{2}$AI}: Towards Resistant and Resilient AI in an Evolving World Paper • 2509.06786 • Published Sep 8, 2025 • 3
R^textbf{2AI}: Towards Resistant and Resilient AI in an Evolving World Paper • 2509.06786 • Published Sep 8, 2025 • 3
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation Paper • 2502.12638 • Published Feb 18, 2025 • 9
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published Dec 12, 2024 • 21
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 38
Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction Paper • 2310.18770 • Published Oct 28, 2023
Discovering Spatio-Temporal Rationales for Video Question Answering Paper • 2307.12058 • Published Jul 22, 2023
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models Paper • 2410.07133 • Published Oct 9, 2024 • 19
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 38
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning Paper • 2407.04078 • Published Jul 4, 2024 • 21