Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
DiffusionBee
Draw Things
Invoke
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
MLX LM
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
JoyFusion
Inference Providers
Select all
Fireworks
HF Inference API
Hyperbolic
Nebius AI
Inference Providers with no match
Novita
Together AI
Featherless AI
fal
Cerebras
Nscale
SambaNova
Groq
Replicate
Cohere
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
1,239
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
Updated
Oct 25, 2024
•
258k
•
101
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
1.98k
•
3
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
Updated
Jan 12
•
8.44k
•
55
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Apr 4
•
145k
•
532
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
7 days ago
•
6.58k
•
160
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
822
•
519
unsloth/Pixtral-12B-2409-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
2.31k
•
4
CogACT/CogACT-Base
Robotics
•
Updated
Dec 4, 2024
•
1.75k
•
14
CogACT/CogACT-Large
Robotics
•
Updated
Dec 4, 2024
•
204
•
4
CogACT/CogACT-Small
Robotics
•
Updated
Dec 4, 2024
•
248
•
5
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
Updated
Dec 6, 2024
•
1.98k
•
79
unsloth/Pixtral-12B-2409-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
4.24k
•
12
Stanford-ILIAD/minivla-libero90-prismatic
Image-Text-to-Text
•
Updated
Dec 12, 2024
•
101
•
2
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448
Video-Text-to-Text
•
Updated
Mar 16
•
1.84k
•
20
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
110k
•
217
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
7.45k
•
133
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
143
•
2
unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
May 12
•
59.3k
•
34
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
20.1k
•
41
Ertugrul/Qwen2.5-VL-7B-Captioner-Relaxed
Image-Text-to-Text
•
Updated
Mar 22
•
1.27k
•
22
turing-motors/Heron-NVILA-Lite-1B
Image-Text-to-Text
•
Updated
May 1
•
1.25k
•
3
Qwen/Qwen2.5-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
53k
•
49
osunlp/Dreamer-72B
Image-Text-to-Text
•
Updated
Apr 9
•
18
•
2
OpenGVLab/VideoChat-R1_7B
Video-Text-to-Text
•
Updated
Apr 22
•
2.17k
•
8
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
Updated
4 days ago
•
3.98k
•
19
remyxai/SpaceOm
Image-Text-to-Text
•
Updated
4 days ago
•
334
•
5
TheDenk/Qwen2.5-VL-3B-TrackAnyObject-LoRa-v1
Image-Text-to-Text
•
Updated
Apr 26
•
5
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
Updated
May 8
•
2.71k
•
88
openbmb/AgentCPM-GUI
Image-Text-to-Text
•
Updated
12 days ago
•
735
•
120
bartowski/Qwen_Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
May 8
•
3.53k
•
3
Previous
1
2
3
4
...
42
Next