Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xlalex 's Collections
encoder
data
svg
video
interleaved
ocr
3d
world model
omni
infra
synthesis
perception
survey
RL
critic
speech full duplex
agent
self-paly

agent

updated 11 days ago
Upvote
-

  • Agent Learning via Early Experience

    Paper • 2510.08558 • Published Oct 9 • 262

  • The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

    Paper • 2509.02547 • Published Sep 2 • 224

  • Scaling Agents via Continual Pre-training

    Paper • 2509.13310 • Published Sep 16 • 115

  • Agent Lightning: Train ANY AI Agents with Reinforcement Learning

    Paper • 2508.03680 • Published Aug 5 • 119

  • PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold

    Paper • 2510.15862 • Published 29 days ago • 9

  • Interleaved Reasoning for Large Language Models via Reinforcement Learning

    Paper • 2505.19640 • Published May 26 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs