Agentic RL - a rohjain Collection

rohjain 's Collections

Agentic RL

updated Oct 3

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs

Paper • 2509.25779 • Published Sep 30 • 16