Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts Paper • 2509.23188 • Published Sep 27 • 3
Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning Paper • 2509.11420 • Published Sep 14 • 2
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 137
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 137
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification Paper • 2502.07299 • Published Feb 11 • 2
SemiReward: A General Reward Model for Semi-supervised Learning Paper • 2310.03013 • Published Oct 4, 2023 • 2
A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN Paper • 2205.13943 • Published May 27, 2022 • 1
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper • 2503.07459 • Published Mar 10 • 16
PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking Paper • 2505.01700 • Published May 3 • 1
AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity Paper • 2505.23520 • Published May 29
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8 • 75
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery Paper • 2406.18151 • Published Jun 26, 2024 • 1