Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Together AI
Novita
Nebius AI
Groq
Hyperbolic
Nscale
+ 6
Apply filters
Models
5,419
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
zai-org/GLM-4.5V
Image-Text-to-Text
•
108B
•
Updated
1 day ago
•
9.5k
•
•
561
AIDC-AI/Ovis2.5-9B
Image-Text-to-Text
•
9B
•
Updated
about 15 hours ago
•
56
•
200
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
1 day ago
•
24.1k
•
797
AIDC-AI/Ovis2.5-2B
Image-Text-to-Text
•
3B
•
Updated
about 15 hours ago
•
70
•
155
LiquidAI/LFM2-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
6 days ago
•
1k
•
131
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
7 days ago
•
5.76k
•
447
LiquidAI/LFM2-VL-450M
Image-Text-to-Text
•
0.5B
•
Updated
6 days ago
•
2.36k
•
87
inference-net/ClipTagger-12b
Image-Text-to-Text
•
12B
•
Updated
6 days ago
•
73
•
26
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
4.19M
•
•
1.15k
google/medgemma-4b-it
Image-Text-to-Text
•
5B
•
Updated
Jul 9
•
109k
•
607
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
397k
•
•
1.57k
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
8B
•
Updated
Apr 18
•
98.1k
•
359
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
1 day ago
•
55
•
17
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
Jul 14
•
109k
•
727
fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text
•
8B
•
Updated
May 16
•
42.4k
•
192
XiaomiMiMo/MiMo-VL-7B-RL-2508
Image-Text-to-Text
•
8B
•
Updated
10 days ago
•
2.88k
•
57
LiquidAI/LFM2-VL-1.6B-GGUF
Image-Text-to-Text
•
1B
•
Updated
1 day ago
•
15
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21
•
1.11M
•
794
inclusionAI/UI-Venus-Ground-7B
Image-Text-to-Text
•
8B
•
Updated
about 16 hours ago
•
14
LiquidAI/LFM2-VL-450M-GGUF
Image-Text-to-Text
•
0.4B
•
Updated
1 day ago
•
14
ds4sd/SmolDocling-256M-preview
Image-Text-to-Text
•
0.3B
•
Updated
May 16
•
36.2k
•
1.54k
google/gemma-3n-E4B-it-litert-preview
Image-Text-to-Text
•
Updated
May 26
•
1.45k
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
315k
•
•
494
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text
•
27B
•
Updated
Apr 1
•
23.2k
•
147
QuantTrio/GLM-4.5V-AWQ
Image-Text-to-Text
•
17B
•
Updated
7 days ago
•
544
•
13
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
15 days ago
•
1.01M
•
1.64k
nvidia/Cosmos-Reason1-7B
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
214k
•
135
google/gemma-3n-E2B-it-litert-preview
Image-Text-to-Text
•
Updated
May 20
•
545
moonshotai/Kimi-VL-A3B-Thinking-2506
Image-Text-to-Text
•
16B
•
Updated
1 day ago
•
23.9k
•
269
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
Jul 8
•
174k
•
•
715
Previous
1
2
3
...
100
Next