-
-
-
-
-
-
Inference Providers
Active filters:
sparsity
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
12
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
15
•
1
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
198
•
62
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
20
•
3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Text Generation
•
8B
•
Updated
•
19
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4
Text Generation
•
8B
•
Updated
•
12
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
21
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
17
bartowski/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
716
•
3
QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
275
•
4
tensorblock/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
219
nintwentydo/pixtral-12b-2409-2of4-sparse
Image-Text-to-Text
•
13B
•
Updated
•
17
•
1
HangGuo/Llama2-70B-QuaRot-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
2
•
1
HangGuo/Llama2-70B-QuaRot-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama2-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama2-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama3-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
2
HangGuo/Llama3-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
6
HangGuo/Llama3-70B-QuaRot-OBR-RTN-W4A4KV16S50
Text Generation
•
Updated
•
3
HangGuo/Llama3-70B-QuaRot-OBR-GPTQ-W4A4KV16S50
Text Generation
•
Updated
•
3
HangGuo/QWen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
6
HangGuo/QWen2.5-32B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
4
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
4
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
2
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3