Kimi-Linear-A3B Collection Moonshot's experimental MoE model with Kimi Delta Attention • 3 items • Updated 3 days ago • 8
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 13 items • Updated 3 days ago • 32
ibm-granite/granite-guardian-hap-38m Text Classification • 38.5M • Updated Dec 19, 2024 • 6.39k • • 42