Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 39
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 11
Inference Providers
Fireworks
Novita
Nebius AI
Together AI
Featherless AI
fal
Cerebras
Nscale
+ 6
Apply filters
Models
10,811
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26, 2024
•
89
•
263
liuhaotian/llava-v1.6-vicuna-7b
Image-Text-to-Text
•
Updated
May 9, 2024
•
28.5k
•
128
cjpais/llava-1.6-mistral-7b-gguf
Image-Text-to-Text
•
Updated
Mar 6, 2024
•
4.63k
•
107
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
May 1
•
215k
•
267
facebook/chameleon-7b
Image-Text-to-Text
•
Updated
Jul 23, 2024
•
46.8k
•
185
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14, 2024
•
551k
•
608
google/paligemma-3b-pt-224
Image-Text-to-Text
•
Updated
Sep 21, 2024
•
45.2k
•
333
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
4 days ago
•
1.79k
•
119
microsoft/llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
Updated
May 14, 2024
•
35.6k
•
93
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
Updated
Jan 15
•
36.8k
•
1.4k
AIML-TUDA/LlavaGuard-v1.0-7B
Image-Text-to-Text
•
Updated
Apr 22
•
11
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
Updated
Jul 20, 2024
•
61.2k
•
355
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
Updated
Jul 20, 2024
•
58.4k
•
117
SpursgoZmy/table-llava-v1.5-7b
Image-Text-to-Text
•
Updated
Feb 7
•
248
•
13
onnx-community/Florence-2-base-ft
Image-Text-to-Text
•
Updated
May 8
•
24.9k
•
33
mlx-community/llava-v1.6-mistral-7b-4bit
Image-Text-to-Text
•
Updated
Jan 11
•
96
•
5
gokaygokay/Florence-2-SD3-Captioner
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
5.41k
•
37
John6666/gokaygokay-Florence-2-SD3-Captioner-8bit
Image-Text-to-Text
•
Updated
Jun 27, 2024
•
29
•
2
OpenGVLab/InternVL2-8B
Image-Text-to-Text
•
Updated
Mar 25
•
347k
•
172
mPLUG/mPLUG-Owl3-7B-240728
Image-Text-to-Text
•
Updated
Sep 29, 2024
•
792
•
40
llava-hf/llava-onevision-qwen2-7b-ov-hf
Image-Text-to-Text
•
Updated
8 days ago
•
40.2k
•
31
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 12
•
999k
•
428
multimodalart/Florence-2-large-no-flash-attn
Image-Text-to-Text
•
Updated
Aug 29, 2024
•
67k
•
19
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
1.98k
•
3
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
Updated
Jan 12
•
8.44k
•
55
NVEagle/Eagle-X4-8B-Plus
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
1.7k
•
4
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
Dec 26, 2024
•
•
649
AIML-TUDA/LlavaGuard-v1.0-7B-hf
Image-Text-to-Text
•
Updated
Apr 22
•
16
•
4
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Apr 4
•
145k
•
532
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
7 days ago
•
6.58k
•
160
Previous
1
...
3
4
5
6
7
...
100
Next