view article Article Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • 6 days ago • 11
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Paper • 2506.15211 • Published 8 days ago • 31
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published 9 days ago • 35
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques Paper • 2506.08060 • Published 17 days ago • 7
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model Paper • 2505.14135 • Published May 20 • 15
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 53
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published 28 days ago • 23
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 24 days ago • 161
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published 27 days ago • 93
view article Article System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience By codelion • 24 days ago • 13
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 78
view article Article AutoThink: Adaptive Reasoning for Large Language Models By codelion • 30 days ago • 4
view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • May 20 • 26
view article Article Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training By codelion • May 17 • 5
Pivotal Token Search Collection Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 9 items • Updated May 14 • 3