Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 39
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 11
Inference Providers
Fireworks
Novita
Nebius AI
Together AI
Featherless AI
fal
Cerebras
Nscale
+ 6
Apply filters
Models
10,810
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
7 days ago
•
6.58k
•
160
mPLUG/DocOwl2
Image-Text-to-Text
•
Updated
Sep 27, 2024
•
349
•
105
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.97k
•
1.67k
latent-action-pretraining/LAPA-7B-openx
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
11
OpenGVLab/InternVL2-8B-MPO
Image-Text-to-Text
•
Updated
Dec 20, 2024
•
124
•
35
unsloth/Pixtral-12B-2409-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
2.31k
•
4
google/paligemma2-28b-pt-896
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
1.62k
•
49
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
37k
•
210
onnx-community/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Mar 6
•
97
•
9
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
Updated
Dec 6, 2024
•
1.98k
•
79
OpenGVLab/Mini-InternVL2-1B-DA-Medical
Image-Text-to-Text
•
Updated
Dec 9, 2024
•
17
•
1
OpenGVLab/PVC-InternVL2-8B
Image-Text-to-Text
•
Updated
Dec 17, 2024
•
33
•
9
Stanford-ILIAD/minivla-libero90-prismatic
Image-Text-to-Text
•
Updated
Dec 12, 2024
•
101
•
2
Bllossom/llama-3.2-Korean-Bllossom-AICA-5B
Image-Text-to-Text
•
Updated
Mar 14
•
5.01k
•
87
TianHuiLab/Falcon-Single-Instruction-Large
Image-Text-to-Text
•
Updated
Mar 21
•
7
5CD-AI/Vintern-1B-v3_5
Image-Text-to-Text
•
Updated
Feb 12
•
310k
•
72
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
110k
•
217
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
7.45k
•
133
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
143
•
2
prithivMLmods/QvQ-Step-Tiny
Image-Text-to-Text
•
Updated
Jan 24
•
26
•
2
IPEC-COMMUNITY/spatialvla-4b-224-pt
Image-Text-to-Text
•
Updated
Mar 16
•
12.6k
•
6
ibm-granite/granite-vision-3.1-2b-preview
Image-Text-to-Text
•
Updated
14 days ago
•
8.77k
•
102
unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
May 12
•
59.3k
•
34
RedHatAI/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
Updated
Apr 3
•
41.7k
•
3
AIDC-AI/Ovis2-2B
Image-Text-to-Text
•
Updated
Feb 27
•
1.16k
•
55
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
26.9k
•
65
HuggingFaceTB/SmolVLM2-500M-Video-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
131k
•
72
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
20.1k
•
41
ibm-granite/granite-vision-3.2-2b
Image-Text-to-Text
•
Updated
14 days ago
•
6.95k
•
98
JKCHSTR/llama-joycaption-alpha-two-hf-llava-FP8-Dynamic
Image-Text-to-Text
•
Updated
Feb 18
•
65
•
4
Previous
1
...
4
5
6
7
8
...
100
Next