Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ggml-org 's Collections
Multimodal GGUFs
VAD
InternVL 3 and InternVL 2.5
Qwen 2 VL and Qwen 2.5 VL
Qwen 3
SmolVLM GGUF
Gemma 3
llama.cpp presets
GGUF LoRA adapters
llama.vim
Gemma 1.1 GGUFs

Multimodal GGUFs

updated about 1 month ago

Vision and audio models compatible with llama-server and llama-mtmd-cli

Upvote
4

  • Gemma 3

    Collection
    4 items • Updated May 14 • 15

  • ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF

    Image-Text-to-Text • Updated May 1 • 484 • 4

  • InternVL 3 and InternVL 2.5

    Collection
    10 items • Updated May 14

  • Qwen 2 VL and Qwen 2.5 VL

    Collection
    4 items • Updated May 14

  • SmolVLM GGUF

    Collection
    6 items • Updated May 14 • 3

  • ggml-org/moondream2-20250414-GGUF

    Updated May 25 • 2.78k • 3

  • ggml-org/pixtral-12b-GGUF

    Updated Apr 30 • 643 • 4

    Note . ------------ Below are audio models


  • ggml-org/ultravox-v0_5-llama-3_2-1b-GGUF

    Audio-Text-to-Text • Updated May 25 • 1.12k • 2

  • ggml-org/ultravox-v0_5-llama-3_1-8b-GGUF

    Updated May 22 • 647 • 3

    Note . ------------ Below are vision+audio models


  • ggml-org/Qwen2.5-Omni-3B-GGUF

    Any-to-Any • Updated about 1 month ago • 1.79k • 2

  • ggml-org/Qwen2.5-Omni-7B-GGUF

    Any-to-Any • Updated about 1 month ago • 2.75k • 8
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs