Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ChaoHuangCS
/
DRIFT-VL-7B

Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
image-to-text
vision-language-model
multimodal
reasoning
fine-tuned
qwen
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
1
DRIFT-VL-7B
61.6 kB
  • 2 contributors
History: 5 commits
ChaoHuangCS's picture
ChaoHuangCS
Upload fine-tuned Qwen2.5-VL reasoning model - batch 2
ac5e43d verified about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • added_tokens.json
    648 Bytes
    Upload fine-tuned Qwen2.5-VL reasoning model - batch 2 about 1 month ago
  • chat_template.json
    1.05 kB
    Upload fine-tuned Qwen2.5-VL reasoning model about 1 month ago
  • model.safetensors.index.json
    57.6 kB
    Upload fine-tuned Qwen2.5-VL reasoning model about 1 month ago
  • preprocessor_config.json
    763 Bytes
    Upload fine-tuned Qwen2.5-VL reasoning model about 1 month ago