Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper β’ 2506.09250 β’ Published 15 days ago β’ 27
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper β’ 2506.05209 β’ Published 21 days ago β’ 41
One-RL-to-See-Them-All Collection One RL to See Them All: Visual Triple Unified Reinforcement Learning. GitHub: https://github.com/MiniMax-AI/One-RL-to-See-Them-All β’ 5 items β’ Updated 16 days ago β’ 27
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper β’ 2505.17612 β’ Published May 23 β’ 78
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper β’ 2505.22954 β’ Published 28 days ago β’ 11
view article Article π Introducing **Moon**: Storytelling Generator Model By kulia-moon and 1 other β’ 27 days ago β’ 6
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published 30 days ago β’ 61
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper β’ 2505.19297 β’ Published May 25 β’ 77
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks π±π§πΌβπ» By sasha β’ 29 days ago β’ 21
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ May 23 β’ 135
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ about 1 month ago β’ 44
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 86