Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Model Tree
Reset
mlx-community/Qwen3-32B-8bit
Quantizations
Apps
LM Studio
MLX LM
Apps with no match
llama.cpp
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
Inference Providers
Inference Providers with no match
Fireworks
Novita
Nebius AI
Together AI
Featherless AI
fal
Cerebras
Nscale
SambaNova
Hyperbolic
Groq
Replicate
Cohere
HF Inference API
Misc
4-bit precision
Misc with no match
Inference Endpoints
text-generation-inference
Eval Results
Merge
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
1
Full-text search
Edit filters
Sort: Trending
Active filters:
mlx-community/Qwen3-32B-8bit
Clear all
mlx-community/Qwen3-32B-4bit-DWQ
Text Generation
•
Updated
May 17
•
1.86k
•
1