Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 7 days ago • 96 • 3
view article Article Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling By RichardBian and 8 others • 28 days ago • 11
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 7 days ago • 96
Moonlight-A3B Collection Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer • 3 items • Updated 4 days ago • 7
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 7 days ago • 96
Kimi-Linear-A3B Collection Moonshot's experimental MoE model with Kimi Delta Attention • 3 items • Updated 6 days ago • 10