Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers
Inference Providers with no match
Fireworks
Novita
Nebius AI
Together AI
Featherless AI
fal
Cerebras
Nscale
SambaNova
Hyperbolic
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
vision-language-model
Inference Endpoints
Eval Results
custom_code
text-generation-inference
4-bit precision
8-bit precision
Misc with no match
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
23
Full-text search
Edit filters
Sort: Trending
Active filters:
vision-language-model
Clear all
ByteDance/Dolphin
Image-Text-to-Text
•
Updated
30 days ago
•
19.1k
•
399
xiaorui638/flair
Updated
Mar 6
•
56
•
3
humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom
Image-to-Text
•
Updated
17 days ago
•
56
•
1
remyxai/SpaceLLaVA
Image-Text-to-Text
•
Updated
Apr 20
•
345
•
24
deadzzz/qwen_VLM_finetuning
Updated
Oct 24, 2024
SVECTOR-CORPORATION/Spec-Vision-V1
Image-Text-to-Text
•
Updated
Feb 11
•
98
•
4
Duino/Duino-Lidar
Depth Estimation
•
Updated
Feb 18
•
7
sankim2/cosmos
Image-Text-to-Text
•
Updated
Mar 27
•
7
•
1
yjj23/minivlm
Updated
Apr 20
•
17
samihalawa/APOLO-medical-multimodal-instruct
Image-Text-to-Text
•
Updated
May 8
•
1
daniel3303/QwenStoryteller
Image-to-Text
•
Updated
May 16
•
5.66k
•
8
mradermacher/QwenStoryteller-GGUF
Image-to-Text
•
Updated
May 13
•
148
mradermacher/QwenStoryteller-i1-GGUF
Image-to-Text
•
Updated
May 13
•
659
•
1
lordChipotle/nutrition-label-detector
Image-Text-to-Text
•
Updated
May 19
•
11
truworthai/DynamicVisualLearning-v2-mlx
Updated
23 days ago
truworthai/FixedDynamicLearning-v3-mlx
Updated
23 days ago
truworthai/FinalVisualLearning-v4-mlx
Updated
23 days ago
truworthai/verynew
Updated
23 days ago
truworthai/testhellow
Updated
22 days ago
truworthai/Combined-mlx
Updated
22 days ago
•
10
phronetic-ai/owlet-har-1
Video Classification
•
Updated
2 days ago
•
9
convaiinnovations/ECG-Instruct-Llama-3.2-11B-Vision
Text Generation
•
Updated
6 days ago
•
31
gribok201/smolvla
Robotics
•
Updated
6 days ago
•
11