Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,7 @@ library_name: transformers
|
|
9 |
|
10 |
- Try out the model on [](https://featherless.ai/models/featherless-ai/Qwerky-72B)
|
11 |
- Model details from our blog post here! [](https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large)
|
|
|
12 |
|
13 |
Benchmarks is as follows for both Qwerky-QwQ-32B and Qwerky-72B models:
|
14 |
|
|
|
9 |
|
10 |
- Try out the model on [](https://featherless.ai/models/featherless-ai/Qwerky-72B)
|
11 |
- Model details from our blog post here! [](https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large)
|
12 |
+
- This model was presented in [RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale](https://huggingface.co/papers/2505.03005).
|
13 |
|
14 |
Benchmarks is as follows for both Qwerky-QwQ-32B and Qwerky-72B models:
|
15 |
|