Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ljupco
's Collections
RL - Reinforcement Learning
agents
context, prompt
speed efficiency gains
thinking, CoT
thinking, CoT
updated
Sep 9
Upvote
-
Test-Time Scaling with Reflective Generative Model
Paper
•
2507.01951
•
Published
Jul 2
•
106
Reverse-Engineered Reasoning for Open-Ended Generation
Paper
•
2509.06160
•
Published
Sep 7
•
147
Upvote
-
Share collection
View history
Collection guide
Browse collections