Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published 17 days ago • 37
MinT: Managed Infrastructure for Training and Serving Millions of LLMs Paper • 2605.13779 • Published 8 days ago • 216
HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text • 27B • Updated 27 days ago • 606k • 339
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 7 days ago • 49
Jiunsong/supergemma4-26b-abliterated-multimodal-mlx-4bit Image-Text-to-Text • 5B • Updated Apr 18 • 6.23k • 52
MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference Paper • 2605.07363 • Published 13 days ago • 12
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key Paper • 2605.06638 • Published 14 days ago • 14