thinking, CoT - a ljupco Collection

ljupco 's Collections

RL - Reinforcement Learning

agents

context, prompt

speed efficiency gains

thinking, CoT

updated Sep 9

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 106
Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 147