Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers
Inference Providers with no match
Fireworks
Novita
Nebius AI
Together AI
Featherless AI
fal
Cerebras
Nscale
SambaNova
Hyperbolic
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
llama.cpp
Inference Endpoints
4-bit precision
text-generation-inference
Merge
Eval Results
Misc with no match
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
183
Full-text search
Edit filters
Sort: Trending
Active filters:
llama.cpp
Clear all
tifin-india/sarvam-m-24b-q3-k-s-gguf
Text Generation
•
Updated
May 24
•
88
tifin-india/sarvam-m-24b-q3-k-gguf
Text Generation
•
Updated
May 24
•
91
tifin-india/sarvam-m-24b-q4-k-m-gguf
Text Generation
•
Updated
May 24
•
125
•
1
tifin-india/sarvam-m-24b-q3-k-m-gguf
Text Generation
•
Updated
May 24
•
40
tifin-india/sarvam-m-24b-q4-k-s-gguf
Text Generation
•
Updated
May 24
•
80
tifin-india/sarvam-m-24b-q5-k-m-gguf
Text Generation
•
Updated
May 24
•
247
•
2
ykarout/MiMo-VL-7B-SFT-GGUF
Image-Text-to-Text
•
Updated
23 days ago
•
175
XythicK/Qwen.Qwen2.5-Math-1.5B-GGUF
Updated
21 days ago
•
46
Govind222/Koyna-V2-1b-instruct-GGUF
Updated
20 days ago
•
2
agentlans/SmolLM2-135M-Instruct-GGUF
Updated
19 days ago
•
66
ReallyFloppyPenguin/Holo1-3B-GGUF
Updated
15 days ago
•
172
•
2
mgonzs13/SpaceOm-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
300
•
1
Darkhn/L3.3-70B-Animus-V1-GGUF
Updated
9 days ago
•
429
allura-quants/allura-org_Q3-8B-Kintsugi-GGUF
Updated
11 days ago
ReallyFloppyPenguin/sarvam-m-GGUF
Updated
11 days ago
•
177
•
1
ReallyFloppyPenguin/DeepSeek-R1-0528-Qwen3-8B-GGUF
Updated
11 days ago
•
110
ReallyFloppyPenguin/MiniCPM4-8B-GGUF
Updated
11 days ago
•
51
ReallyFloppyPenguin/Nemotron-Research-Reasoning-Qwen-1.5B-GGUF
Updated
11 days ago
•
154
•
1
ReallyFloppyPenguin/OpenCodeReasoning-Nemotron-14B-GGUF
Updated
9 days ago
•
94
•
1
ReallyFloppyPenguin/Jan-nano-GGUF
Updated
9 days ago
•
86
ReallyFloppyPenguin/Qwen2.5-Math-7B-GGUF
Updated
9 days ago
ReallyFloppyPenguin/Qwen3-0.6B-GGUF
Updated
9 days ago
•
82
ReallyFloppyPenguin/Holo1-7B-GGUF
Updated
9 days ago
•
92
ReallyFloppyPenguin/DeepSeek-R1-Distill-Qwen-32B-GGUF
Updated
8 days ago
•
35
ReallyFloppyPenguin/Gemma-3-Gaia-PT-BR-4b-it-GGUF
Updated
8 days ago
•
91
ReallyFloppyPenguin/Qwen3-30B-A3B-GGUF
Updated
7 days ago
•
10
ReallyFloppyPenguin/II-Medical-8B-1706-GGUF
Updated
5 days ago
•
41
Darkhn/L3.3-70B-Animus-V4-Final-GGUF
Updated
about 1 hour ago
ReallyFloppyPenguin/Polaris-4B-Preview-GGUF
Updated
2 days ago
ReallyFloppyPenguin/Arch-Agent-7B-GGUF
Updated
2 days ago
Previous
1
...
4
5
6
7
Next