Inference Providers
Active filters: vLLM
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 333k
• 344
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 52.6k
• 380
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 1.15k
• 90
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 842k
• 26
Text Generation
• 754B • Updated • 1.53k
• 8
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 907k
• 13
RecViking/Mistral-Medium-3.5-128B-NVFP4
74B • Updated • 13.8k
• 7
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 10.9k
• 70
selode-ai/Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ-21.2GB
Image-Text-to-Text
• 29B • Updated • 13.2k
• 15
mistralai/Mistral-Medium-3.5-128B-EAGLE
Updated • 518
• 40
bartowski/mistralai_Mistral-Medium-3.5-128B-GGUF
Image-Text-to-Text
• 125B • Updated • 12.3k
• 8
cyankiwi/Mistral-Medium-3.5-128B-AWQ-INT4
25B • Updated • 18.7k
• 3
inferencerlabs/Mistral-Medium-3.5-MLX-9bit
Image-Text-to-Text
• Updated • 1.16k
• 1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 66
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 7
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 61
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 61
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 5
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 182
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 513
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 210
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 9
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 2.6k
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 16
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 26.3k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 389
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 21
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 91
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 87.5k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 697