Edit Models filters

Apps

Apps with no match

Inference Providers

HF Inference API

Inference Providers with no match

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

1,239

Full-text search

Active filters: multimodal

lmstudio-community/Qwen2.5-VL-72B-Instruct-GGUF

Image-Text-to-Text • Updated May 10 • 668 • 1

unsloth/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • Updated May 12 • 4.53k • 4

osunlp/WebJudge-7B

Image-Text-to-Text • Updated May 12 • 77 • 5

ggml-org/Qwen2.5-Omni-7B-GGUF

Any-to-Any • Updated about 1 month ago • 2.75k • 8

imageomics/bioclip-2

Zero-Shot Image Classification • Updated 20 days ago • 4.1k • 10

davidelobba/TEMU-VTOFF

Image-to-Image • Updated 27 days ago • 3

OpenGVLab/ZeroGUI-AndroidLab-7B

Image-Text-to-Text • Updated 27 days ago • 87 • 4

lingshu-medical-mllm/Lingshu-32B

Image-Text-to-Text • Updated about 15 hours ago • 1.58k • 37

Sungyeon/GENIUS

Visual Document Retrieval • Updated 19 days ago • 1

humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom

Image-to-Text • Updated 18 days ago • 56 • 1

mehmetkuzucu/Waffle-v1.0

Visual Question Answering • Updated 16 days ago • 91 • 4

adriabama06/UI-TARS-1.5-7B-exl2

Image-Text-to-Text • Updated 5 days ago • 2 • 1

adriabama06/UI-TARS-1.5-7B-Q4_K_M-GGUF

Image-Text-to-Text • Updated 5 days ago • 16 • 1

adriabama06/UI-TARS-1.5-7B-GGUF

Image-Text-to-Text • Updated 5 days ago • 68 • 1

avin-255/nanoVLM

Image-Text-to-Text • Updated 5 days ago • 4 • 1

thesby/Qwen2.5-VL-7B-NSFW-Caption-V3

Image-Text-to-Text • Updated 9 days ago • 107 • 7

sujitpal/clip-imageclef

Zero-Shot Image Classification • Updated Oct 31, 2023 • 59 • 3

waybarrios/guidance-based-video-grounding

Updated Apr 1, 2023

MonoHime/mosei-senti-intermodal

Feature Extraction • Updated May 18, 2023 • 51

MonoHime/mosei-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 39

MonoHime/iemocap-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 24

MonoHime/mosi-senti-intermodal

Feature Extraction • Updated May 18, 2023 • 44

MonoHime/meld-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 17

imageomics/bioclip

Zero-Shot Image Classification • Updated May 17, 2024 • 31.2k • 49

HuggingFaceM4/idefics-80b

Text Generation • Updated Oct 12, 2023 • 43 • 70

HuggingFaceM4/idefics-9b

Text Generation • Updated Oct 12, 2023 • 1.37k • 46

HuggingFaceM4/idefics-80b-instruct

Text Generation • Updated Oct 12, 2023 • 1.91k • 189

typeof/idefics-9b

Text Generation • Updated Oct 13, 2023 • 21

sshh12/Mistral-7B-LoRA-VisionCLIP-LLAVA

Text Generation • Updated Oct 28, 2023 • 70 • 10

sshh12/Mistral-7B-LoRA-ImageBind-LLAVA

Text Generation • Updated Nov 2, 2023 • 70 • 12