Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

5,015

Base only

Active filters: llama.cpp

pramodlohra/Qween3_4B_thinking_finetune

4B • Updated Oct 21, 2025 • 14.6k • 2

bigatuna/Qwen3.5-9b-Sushi-Coder-RL-GGUF

Text Generation • 9B • Updated Apr 20 • 12.5k • 83

AtomicChat/gemma-4-E2B-it-assistant-GGUF

Text Generation • 78M • Updated 26 days ago • 6.24k • 7

11-47/GPT2.5.2-NSFW-Codex-0.4B-GGUF

Text Generation • 0.4B • Updated May 2 • 1.27k • 11

Mozzipa/ko_vicuna_7b_ggml_q4

Updated Apr 20, 2023 • 4

ManthanKulakarni/JQL_LLaMa_GGML

Text Generation • Updated Jun 26, 2023 • 2

mys/ggml_llava-v1.5-7b

7B • Updated Oct 9, 2023 • 2.95k • 113

mys/ggml_llava-v1.5-13b

13B • Updated Oct 10, 2023 • 417 • 55

mys/ggml_bakllava-1

7B • Updated Oct 19, 2023 • 489 • 75

wasmedge/llama2

Text Generation • 13B • Updated Nov 11, 2023 • 163 • 8

alvarobartt/lince-zero-7b-GGUF

Text Generation • 7B • Updated Nov 1, 2023 • 52

vietgpt/dama-2-7b-chat-gguf

Text Generation • 7B • Updated Nov 17, 2023 • 19 • 1

FlexingD/yarn-mistral-7B-64k-instruct-alpaca-cleaned-GGUF

7B • Updated Dec 4, 2023 • 63 • 5

jdluzen/Mistral-7B-Instruct-v0.2-GGUF

7B • Updated Dec 24, 2023 • 8

jacobhoffmann/CodeLlama-13B-TestGen-Dart_v0.2-GGUF

Text Generation • 13B • Updated Dec 11, 2024 • 49

mostafaamiri/persian-llama-7b-GGUF-Q4

7B • Updated Jan 13, 2024 • 104 • 9

ayoubkirouane/Mistral-Depth-UP-Scaled-9B-AlpacaInstruct-gguf

Text Generation • 9B • Updated Jan 24, 2024 • 21

ehristoforu/LLMs

Text Generation • 7B • Updated Apr 16, 2024 • 5 • 4

osanseviero/DareVox-7B-AWQ

7B • Updated Feb 7, 2024 • 7

ahmetkca/trendyol-7B-v1.0-f16-gguf

7B • Updated Feb 15, 2024 • 5

ahmetkca/trendyol-7B-v1.0-f32-gguf

7B • Updated Feb 15, 2024 • 7

google/gemma-7b-it-GGUF

9B • Updated Aug 14, 2024 • 43 • 45

google/gemma-7b-GGUF

9B • Updated Jun 27, 2024 • 33 • 22

google/gemma-2b-GGUF

3B • Updated Jun 27, 2024 • 83 • 20

iAkashPaul/Indic-gemma-2b-finetuned-sft-Navarasa-GGUF

3B • Updated Mar 8, 2024 • 35 • 3

MrOvkill/gemma-2-inference-endpoint-GGUF

Text Generation • Updated Mar 11, 2024 • 8

google/gemma-1.1-7b-it-GGUF

9B • Updated Jun 27, 2024 • 22

google/gemma-1.1-2b-it-GGUF

3B • Updated Jun 27, 2024 • 13 • 21

webbigdata/C3TR-Adapter_gguf

Translation • 9B • Updated Aug 14, 2024 • 175 • 26

google/codegemma-2b-GGUF

Text Generation • 3B • Updated Jun 27, 2024 • 84 • 35