ggml-org

Team

company

ggml_org

ggml-org

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

allozaur updated a bucket 9 minutes ago

ggml-org/llama-ui

ggerganov updated a model about 11 hours ago

ggml-org/Qwen3.6-35B-A3B-GGUF

ggerganov updated a model about 11 hours ago

ggml-org/Qwen3.6-27B-GGUF

View all activity

Articles

ggml-org 's collections 24

NVIDIA-Nemotron-3-Nano-Omni

ggml-org/NVIDIA-Nemotron-3-Nano-Omni

32B • Updated 24 days ago • 4.25k • 6

Gemma 4

ggml-org/gemma-4-E2B-it-GGUF

5B • Updated Apr 12 • 68.2k • 70
ggml-org/gemma-4-E4B-it-GGUF

8B • Updated Apr 12 • 102k • 54
ggml-org/gemma-4-26B-A4B-it-GGUF

25B • Updated Apr 12 • 442k • 63
ggml-org/gemma-4-31B-it-GGUF

31B • Updated Apr 12 • 49.8k • 40

Devstral 2

Collection for Devstral-Small-2-24B-Instruct-2512 models

ggml-org/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 18, 2025 • 302 • 6
ggml-org/Devstral-2-123B-Instruct-2512-GGUF

125B • Updated Dec 19, 2025 • 45 • 2

Multimodal GGUFs

Vision and audio models compatible with llama-server and llama-mtmd-cli

GLM-V

Collection

4 items • Updated Dec 17, 2025 • 14
Ministral 3

Collection

6 items • Updated Dec 16, 2025 • 4
Gemma 3

Collection

10 items • Updated Dec 16, 2025 • 24
Kimi-VL

Collection

1 item • Updated Mar 2 • 2

Ministral 3

ggml-org/Ministral-3-3B-Reasoning-2512-GGUF

Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 214 • 2
ggml-org/Ministral-3-8B-Reasoning-2512-GGUF

Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 159 • 1
ggml-org/Ministral-3-14B-Reasoning-2512-GGUF

Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 269 • 3
ggml-org/Ministral-3-3B-Instruct-2512-GGUF

Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 240 • 4

Gemma 3-270m

Collection of models for Gemma 3-270m

ggml-org/gemma-3-270m-GGUF

0.3B • Updated Aug 14, 2025 • 1.63k • 20
ggml-org/gemma-3-270m-it-GGUF

0.3B • Updated Aug 15, 2025 • 2.28k • 22
ggml-org/gemma-3-270m-qat-GGUF

0.3B • Updated Aug 14, 2025 • 14.2k • 10
ggml-org/gemma-3-270m-it-qat-GGUF

0.3B • Updated Aug 15, 2025 • 6.42k • 12

Kimi-VL

ggml-org/Kimi-VL-A3B-Thinking-2506-GGUF

16B • Updated Aug 20, 2025 • 5.56k • 29

VAD

Voice Activity Detection (VAD) models for whisper.cpp.

ggml-org/whisper-vad

Updated Nov 17, 2025 • 17

Qwen 2 VL and Qwen 2.5 VL

ggml-org/Qwen2.5-VL-3B-Instruct-GGUF

3B • Updated Apr 30, 2025 • 7.2k • 6
ggml-org/Qwen2.5-VL-7B-Instruct-GGUF

8B • Updated Apr 30, 2025 • 8.21k • 10
ggml-org/Qwen2.5-VL-32B-Instruct-GGUF

33B • Updated May 15, 2025 • 306 • 5
ggml-org/Qwen2-VL-2B-Instruct-GGUF

2B • Updated Apr 30, 2025 • 2.5k • 2

SmolVLM GGUF

ggml-org/SmolVLM2-2.2B-Instruct-GGUF

2B • Updated Apr 30, 2025 • 24.9k • 33
ggml-org/SmolVLM2-500M-Video-Instruct-GGUF

0.4B • Updated Apr 30, 2025 • 21.3k • 16
ggml-org/SmolVLM2-256M-Video-Instruct-GGUF

0.2B • Updated Apr 30, 2025 • 9.45k • 9
ggml-org/SmolVLM-Instruct-GGUF

2B • Updated Apr 30, 2025 • 22.5k • 9

llama.cpp presets

Models that are used for presets in llama.cpp.

ggml-org/gte-small-Q8_0-GGUF

Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 99 • 2
ggml-org/bge-small-en-v1.5-Q8_0-GGUF

Feature Extraction • 33.2M • Updated Feb 6, 2025 • 2.54k • 6
ggml-org/e5-small-v2-Q8_0-GGUF

Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 59 • 1

llama.vim

ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF

Text Generation • 0.5B • Updated Jan 31, 2025 • 792 • 10
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF

Text Generation • 2B • Updated Oct 28, 2024 • 3.36k • 17
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF

Text Generation • 3B • Updated Nov 26, 2024 • 2.17k • 9
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF

Text Generation • 8B • Updated Oct 28, 2024 • 2.07k • 9

OCR models

ggml-org/GLM-OCR-GGUF

0.9B • Updated Mar 10 • 26.8k • 62
ggml-org/DeepSeek-OCR-GGUF

3B • Updated Mar 25 • 13.1k • 10
ggml-org/dots.ocr-GGUF

2B • Updated Apr 5 • 1.17k • 4
ggml-org/Qianfan-OCR-GGUF

4B • Updated Apr 10 • 741 • 3

NVIDIA Nemotron 3 Super

Collection for Nemotron-3-Super-120B models

ggml-org/Nemotron-3-Super-120B-GGUF

121B • Updated Mar 16 • 375 • 11

NVIDIA Nemotron 3

Collection for Nemotron-Nano-3-30B-A3B models

ggml-org/Nemotron-Nano-3-30B-A3B-GGUF

32B • Updated Dec 16, 2025 • 6.39k • 15

GLM-V

ggml-org/GLM-4.6V-Flash-GGUF

9B • Updated Jan 15 • 997 • 22
ggml-org/GLM-4.6V-GGUF

107B • Updated Jan 15 • 9.25k • 7
ggml-org/AutoGLM-Phone-9B-GGUF

9B • Updated Dec 17, 2025 • 2.72k • 3
ggml-org/GLM-4.5V-GGUF

107B • Updated Feb 17 • 178 • 5

EmbeddingGemma 300M

ggml-org/embeddinggemma-300M-GGUF

0.3B • Updated 24 days ago • 444k • 31
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF

Feature Extraction • 0.3B • Updated Sep 15, 2025 • 1.05k • 5
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF

Feature Extraction • 0.3B • Updated Sep 15, 2025 • 28k • 16

GPT OSS

ggml-org/gpt-oss-120b-GGUF

117B • Updated Oct 30, 2025 • 376k • 72
ggml-org/gpt-oss-20b-GGUF

21B • Updated Oct 30, 2025 • 90k • 152

Gemma 3n

ggml-org/gemma-3n-E2B-it-GGUF

4B • Updated Aug 22, 2025 • 4.15k • 25
ggml-org/gemma-3n-E4B-it-GGUF

7B • Updated Jun 26, 2025 • 5.86k • 20

InternVL 3 and InternVL 2.5

ggml-org/InternVL3-1B-Instruct-GGUF

0.6B • Updated May 10, 2025 • 210 • 4
ggml-org/InternVL3-2B-Instruct-GGUF

2B • Updated May 10, 2025 • 230 • 5
ggml-org/InternVL3-8B-Instruct-GGUF

8B • Updated May 10, 2025 • 284 • 6
ggml-org/InternVL3-14B-Instruct-GGUF

15B • Updated May 10, 2025 • 201 • 4

Qwen 3

ggml-org/Qwen3-0.6B-GGUF

0.8B • Updated Sep 28, 2025 • 69.5k • 14
ggml-org/Qwen3-1.7B-GGUF

2B • Updated Apr 28, 2025 • 5.42k • 7
ggml-org/Qwen3-4B-GGUF

4B • Updated Apr 28, 2025 • 1.56k • 6
ggml-org/Qwen3-8B-GGUF

8B • Updated Apr 28, 2025 • 1.66k • 6

Gemma 3

ggml-org/gemma-3-270m-it-GGUF

0.3B • Updated Aug 15, 2025 • 2.28k • 22
ggml-org/gemma-3-1b-it-GGUF

1.0B • Updated Mar 12, 2025 • 20.9k • 29
ggml-org/gemma-3-4b-it-GGUF

Image-Text-to-Text • 4B • Updated May 21, 2025 • 51.1k • 52
ggml-org/gemma-3-12b-it-GGUF

Image-Text-to-Text • 12B • Updated May 21, 2025 • 4.9k • 31

GGUF LoRA adapters

Adapters extracted from fine tuned models, using mergekit-extract-lora

ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF

88.1M • Updated Nov 1, 2024 • 9
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF

93.6M • Updated Jan 23, 2025 • 56 • 4
ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 9, 2025 • 23 • 1
ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF

90.9M • Updated Jan 8, 2025 • 26 • 3

Gemma 1.1 GGUFs

ggml-org/gemma-1.1-2b-it-Q8_0-GGUF

3B • Updated Apr 5, 2024 • 49 • 1
ggml-org/gemma-1.1-7b-it-Q8_0-GGUF

9B • Updated Apr 5, 2024 • 16
ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF

9B • Updated Apr 5, 2024 • 310 • 4

NVIDIA-Nemotron-3-Nano-Omni

ggml-org/NVIDIA-Nemotron-3-Nano-Omni

32B • Updated 24 days ago • 4.25k • 6

OCR models

ggml-org/GLM-OCR-GGUF

0.9B • Updated Mar 10 • 26.8k • 62
ggml-org/DeepSeek-OCR-GGUF

3B • Updated Mar 25 • 13.1k • 10
ggml-org/dots.ocr-GGUF

2B • Updated Apr 5 • 1.17k • 4
ggml-org/Qianfan-OCR-GGUF

4B • Updated Apr 10 • 741 • 3

Gemma 4

ggml-org/gemma-4-E2B-it-GGUF

5B • Updated Apr 12 • 68.2k • 70
ggml-org/gemma-4-E4B-it-GGUF

8B • Updated Apr 12 • 102k • 54
ggml-org/gemma-4-26B-A4B-it-GGUF

25B • Updated Apr 12 • 442k • 63
ggml-org/gemma-4-31B-it-GGUF

31B • Updated Apr 12 • 49.8k • 40

NVIDIA Nemotron 3 Super

Collection for Nemotron-3-Super-120B models

ggml-org/Nemotron-3-Super-120B-GGUF

121B • Updated Mar 16 • 375 • 11

Devstral 2

Collection for Devstral-Small-2-24B-Instruct-2512 models

ggml-org/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 18, 2025 • 302 • 6
ggml-org/Devstral-2-123B-Instruct-2512-GGUF

125B • Updated Dec 19, 2025 • 45 • 2

NVIDIA Nemotron 3

Collection for Nemotron-Nano-3-30B-A3B models

ggml-org/Nemotron-Nano-3-30B-A3B-GGUF

32B • Updated Dec 16, 2025 • 6.39k • 15

Multimodal GGUFs

Vision and audio models compatible with llama-server and llama-mtmd-cli

GLM-V

Collection

4 items • Updated Dec 17, 2025 • 14
Ministral 3

Collection

6 items • Updated Dec 16, 2025 • 4
Gemma 3

Collection

10 items • Updated Dec 16, 2025 • 24
Kimi-VL

Collection

1 item • Updated Mar 2 • 2

GLM-V

ggml-org/GLM-4.6V-Flash-GGUF

9B • Updated Jan 15 • 997 • 22
ggml-org/GLM-4.6V-GGUF

107B • Updated Jan 15 • 9.25k • 7
ggml-org/AutoGLM-Phone-9B-GGUF

9B • Updated Dec 17, 2025 • 2.72k • 3
ggml-org/GLM-4.5V-GGUF

107B • Updated Feb 17 • 178 • 5

Ministral 3

ggml-org/Ministral-3-3B-Reasoning-2512-GGUF

Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 214 • 2
ggml-org/Ministral-3-8B-Reasoning-2512-GGUF

Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 159 • 1
ggml-org/Ministral-3-14B-Reasoning-2512-GGUF

Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 269 • 3
ggml-org/Ministral-3-3B-Instruct-2512-GGUF

Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 240 • 4

EmbeddingGemma 300M

ggml-org/embeddinggemma-300M-GGUF

0.3B • Updated 24 days ago • 444k • 31
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF

Feature Extraction • 0.3B • Updated Sep 15, 2025 • 1.05k • 5
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF

Feature Extraction • 0.3B • Updated Sep 15, 2025 • 28k • 16

Gemma 3-270m

Collection of models for Gemma 3-270m

ggml-org/gemma-3-270m-GGUF

0.3B • Updated Aug 14, 2025 • 1.63k • 20
ggml-org/gemma-3-270m-it-GGUF

0.3B • Updated Aug 15, 2025 • 2.28k • 22
ggml-org/gemma-3-270m-qat-GGUF

0.3B • Updated Aug 14, 2025 • 14.2k • 10
ggml-org/gemma-3-270m-it-qat-GGUF

0.3B • Updated Aug 15, 2025 • 6.42k • 12

GPT OSS

ggml-org/gpt-oss-120b-GGUF

117B • Updated Oct 30, 2025 • 376k • 72
ggml-org/gpt-oss-20b-GGUF

21B • Updated Oct 30, 2025 • 90k • 152

Kimi-VL

ggml-org/Kimi-VL-A3B-Thinking-2506-GGUF

16B • Updated Aug 20, 2025 • 5.56k • 29

Gemma 3n

ggml-org/gemma-3n-E2B-it-GGUF

4B • Updated Aug 22, 2025 • 4.15k • 25
ggml-org/gemma-3n-E4B-it-GGUF

7B • Updated Jun 26, 2025 • 5.86k • 20

VAD

Voice Activity Detection (VAD) models for whisper.cpp.

ggml-org/whisper-vad

Updated Nov 17, 2025 • 17

InternVL 3 and InternVL 2.5

ggml-org/InternVL3-1B-Instruct-GGUF

0.6B • Updated May 10, 2025 • 210 • 4
ggml-org/InternVL3-2B-Instruct-GGUF

2B • Updated May 10, 2025 • 230 • 5
ggml-org/InternVL3-8B-Instruct-GGUF

8B • Updated May 10, 2025 • 284 • 6
ggml-org/InternVL3-14B-Instruct-GGUF

15B • Updated May 10, 2025 • 201 • 4

Qwen 2 VL and Qwen 2.5 VL

ggml-org/Qwen2.5-VL-3B-Instruct-GGUF

3B • Updated Apr 30, 2025 • 7.2k • 6
ggml-org/Qwen2.5-VL-7B-Instruct-GGUF

8B • Updated Apr 30, 2025 • 8.21k • 10
ggml-org/Qwen2.5-VL-32B-Instruct-GGUF

33B • Updated May 15, 2025 • 306 • 5
ggml-org/Qwen2-VL-2B-Instruct-GGUF

2B • Updated Apr 30, 2025 • 2.5k • 2

Qwen 3

ggml-org/Qwen3-0.6B-GGUF

0.8B • Updated Sep 28, 2025 • 69.5k • 14
ggml-org/Qwen3-1.7B-GGUF

2B • Updated Apr 28, 2025 • 5.42k • 7
ggml-org/Qwen3-4B-GGUF

4B • Updated Apr 28, 2025 • 1.56k • 6
ggml-org/Qwen3-8B-GGUF

8B • Updated Apr 28, 2025 • 1.66k • 6

SmolVLM GGUF

ggml-org/SmolVLM2-2.2B-Instruct-GGUF

2B • Updated Apr 30, 2025 • 24.9k • 33
ggml-org/SmolVLM2-500M-Video-Instruct-GGUF

0.4B • Updated Apr 30, 2025 • 21.3k • 16
ggml-org/SmolVLM2-256M-Video-Instruct-GGUF

0.2B • Updated Apr 30, 2025 • 9.45k • 9
ggml-org/SmolVLM-Instruct-GGUF

2B • Updated Apr 30, 2025 • 22.5k • 9

Gemma 3

ggml-org/gemma-3-270m-it-GGUF

0.3B • Updated Aug 15, 2025 • 2.28k • 22
ggml-org/gemma-3-1b-it-GGUF

1.0B • Updated Mar 12, 2025 • 20.9k • 29
ggml-org/gemma-3-4b-it-GGUF

Image-Text-to-Text • 4B • Updated May 21, 2025 • 51.1k • 52
ggml-org/gemma-3-12b-it-GGUF

Image-Text-to-Text • 12B • Updated May 21, 2025 • 4.9k • 31

llama.cpp presets

Models that are used for presets in llama.cpp.

ggml-org/gte-small-Q8_0-GGUF

Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 99 • 2
ggml-org/bge-small-en-v1.5-Q8_0-GGUF

Feature Extraction • 33.2M • Updated Feb 6, 2025 • 2.54k • 6
ggml-org/e5-small-v2-Q8_0-GGUF

Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 59 • 1

GGUF LoRA adapters

Adapters extracted from fine tuned models, using mergekit-extract-lora

ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF

88.1M • Updated Nov 1, 2024 • 9
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF

93.6M • Updated Jan 23, 2025 • 56 • 4
ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 9, 2025 • 23 • 1
ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF

90.9M • Updated Jan 8, 2025 • 26 • 3

llama.vim

ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF

Text Generation • 0.5B • Updated Jan 31, 2025 • 792 • 10
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF

Text Generation • 2B • Updated Oct 28, 2024 • 3.36k • 17
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF

Text Generation • 3B • Updated Nov 26, 2024 • 2.17k • 9
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF

Text Generation • 8B • Updated Oct 28, 2024 • 2.07k • 9

Gemma 1.1 GGUFs

ggml-org/gemma-1.1-2b-it-Q8_0-GGUF

3B • Updated Apr 5, 2024 • 49 • 1
ggml-org/gemma-1.1-7b-it-Q8_0-GGUF

9B • Updated Apr 5, 2024 • 16
ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF

9B • Updated Apr 5, 2024 • 310 • 4

AI & ML interests

Recent Activity

Articles

Using OCR models with llama.cpp

New in llama.cpp: Anthropic Messages API

New in llama.cpp: Model Management

Team members 13

ggml-org 's collections 24