featherless-ai
/

Qwerky-72B

Text Generation

Model card Files Files and versions Community

nielsr HF Staff commited on 18 days ago

Commit

19dd4f4

·

verified ·

1 Parent(s): 4d08491

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -9,6 +9,7 @@ library_name: transformers
 - Try out the model on [![Featherless](https://img.shields.io/badge/featherless--ai%2FQwerky--72B-Dummy?style=flat&label=Featherless&color=facc15)](https://featherless.ai/models/featherless-ai/Qwerky-72B)
 - Model details from our blog post here! [![Substack](https://img.shields.io/badge/Substack-Dummy?style=flat&color=facc15)](https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large)
 Benchmarks is as follows for both Qwerky-QwQ-32B and Qwerky-72B models:

 - Try out the model on [![Featherless](https://img.shields.io/badge/featherless--ai%2FQwerky--72B-Dummy?style=flat&label=Featherless&color=facc15)](https://featherless.ai/models/featherless-ai/Qwerky-72B)
 - Model details from our blog post here! [![Substack](https://img.shields.io/badge/Substack-Dummy?style=flat&color=facc15)](https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large)
+- This model was presented in [RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale](https://huggingface.co/papers/2505.03005).
 Benchmarks is as follows for both Qwerky-QwQ-32B and Qwerky-72B models: