Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AXERA-TECH 's Collections
Multimodal Models
Qwen2.5
MiniCPM4
Qwen3
DeepSeek-R1-Distill
HuggingFaceTB
Vision Models
Audio Models
Tools

Multimodal Models

updated 6 days ago
Upvote
-

  • AXERA-TECH/lcm-lora-sdv1-5

    Updated Jun 23 • 9 • 1

  • AXERA-TECH/InternVL3-2B

    Visual Question Answering • Updated 2 days ago • 15 • 2

  • AXERA-TECH/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • Updated about 3 hours ago • 18

  • AXERA-TECH/InternVL3-1B

    Image-Text-to-Text • Updated Jun 28 • 9

  • AXERA-TECH/SmolVLM2-500M-Video-Instruct

    Visual Question Answering • Updated 23 days ago • 9 • 2

  • AXERA-TECH/InternVL2_5-1B-MPO

    Image-Text-to-Text • Updated Jun 27 • 6

  • AXERA-TECH/InternVL2_5-1B

    Image-Text-to-Text • Updated Apr 4 • 8 • 1

  • AXERA-TECH/Janus-Pro-1B

    Visual Question Answering • Updated Apr 14 • 8 • 2

  • AXERA-TECH/SmolVLM-256M-Instruct

    Updated Apr 4 • 16 • 2

  • AXERA-TECH/YOLO-World-V2

    Object Detection • Updated Mar 23 • 7

  • AXERA-TECH/LivePortrait

    Image-to-Video • Updated Jun 21 • 7 • 4

  • AXERA-TECH/cnclip

    Updated 2 days ago • 9

  • AXERA-TECH/clip

    Updated 2 days ago • 8

  • AXERA-TECH/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated 2 days ago • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs