Qwen
/

Qwen3-VL-4B-Instruct-FP8

Image-Text-to-Text

Model card Files Files and versions

Resources

View closed (1)

vllm version for inference of Qwen/Qwen3-VL-4B-Instruct-FP8 and Qwen/Qwen3-VL-4B-Instruct

#3 opened 7 days ago by

VRAM usage not making sense

#2 opened 20 days ago by