Edit Models filters

Apps

Apps with no match

Inference Providers

HF Inference API

Inference Providers with no match

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

1,239

Full-text search

Active filters: multimodal

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 258k • 101

openvla/openvla-7b-finetuned-libero-10

Image-Text-to-Text • Updated Oct 9, 2024 • 1.98k • 3

Qwen/Qwen2-VL-7B

Image-Text-to-Text • Updated Jan 12 • 8.44k • 55

allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated Apr 4 • 145k • 532

allenai/Molmo-7B-O-0924

Image-Text-to-Text • Updated 7 days ago • 6.58k • 160

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 822 • 519

unsloth/Pixtral-12B-2409-bnb-4bit

Image-Text-to-Text • Updated Nov 21, 2024 • 2.31k • 4

CogACT/CogACT-Base

Robotics • Updated Dec 4, 2024 • 1.75k • 14

CogACT/CogACT-Large

Robotics • Updated Dec 4, 2024 • 204 • 4

CogACT/CogACT-Small

Robotics • Updated Dec 4, 2024 • 248 • 5

Qwen/Qwen2-VL-72B

Image-Text-to-Text • Updated Dec 6, 2024 • 1.98k • 79

unsloth/Pixtral-12B-2409-unsloth-bnb-4bit

Image-Text-to-Text • Updated Dec 4, 2024 • 4.24k • 12

Stanford-ILIAD/minivla-libero90-prismatic

Image-Text-to-Text • Updated Dec 12, 2024 • 101 • 2

OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448

Video-Text-to-Text • Updated Mar 16 • 1.84k • 20

ByteDance-Seed/UI-TARS-7B-DPO

Image-Text-to-Text • Updated Jan 25 • 110k • 217

ByteDance-Seed/UI-TARS-72B-DPO

Image-Text-to-Text • Updated Jan 25 • 7.45k • 133

lmstudio-community/UI-TARS-72B-DPO-GGUF

Image-Text-to-Text • Updated Jan 23 • 143 • 2

unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • Updated May 12 • 59.3k • 34

Qwen/Qwen2.5-VL-3B-Instruct-AWQ

Image-Text-to-Text • Updated Apr 6 • 20.1k • 41

Ertugrul/Qwen2.5-VL-7B-Captioner-Relaxed

Image-Text-to-Text • Updated Mar 22 • 1.27k • 22

turing-motors/Heron-NVILA-Lite-1B

Image-Text-to-Text • Updated May 1 • 1.25k • 3

Qwen/Qwen2.5-VL-32B-Instruct-AWQ

Image-Text-to-Text • Updated Apr 6 • 53k • 49

osunlp/Dreamer-72B

Image-Text-to-Text • Updated Apr 9 • 18 • 2

OpenGVLab/VideoChat-R1_7B

Video-Text-to-Text • Updated Apr 22 • 2.17k • 8

remyxai/SpaceThinker-Qwen2.5VL-3B

Image-Text-to-Text • Updated 4 days ago • 3.98k • 19

remyxai/SpaceOm

Image-Text-to-Text • Updated 4 days ago • 334 • 5

TheDenk/Qwen2.5-VL-3B-TrackAnyObject-LoRa-v1

Image-Text-to-Text • Updated Apr 26 • 5

lusxvr/nanoVLM-222M

Image-Text-to-Text • Updated May 8 • 2.71k • 88

openbmb/AgentCPM-GUI

Image-Text-to-Text • Updated 12 days ago • 735 • 120

bartowski/Qwen_Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • Updated May 8 • 3.53k • 3