Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper β’ 2506.01939 β’ Published Jun 2 β’ 176
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ May 15 β’ 116
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 495
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. β’ 43 items β’ Updated 6 days ago β’ 168
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 324
view article Article Rearchitecting Hugging Face Uploads and Downloads By jsulz and 2 others β’ Nov 26, 2024 β’ 48
view article Article From Files to Chunks: Improving Hugging Face Storage Efficiency By jsulz and 1 other β’ Nov 20, 2024 β’ 63
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 196
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper β’ 2501.09686 β’ Published Jan 16 β’ 41
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 199
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk β’ Oct 7, 2024 β’ 45
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 146
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy By medmekk and 5 others β’ Sep 18, 2024 β’ 264