Official models and datasets for paper μ²Tokenizer(https://arxiv.org/abs/2507.00316)
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ViewSAM: Learning View-aware Cross-modal Semantics for Weakly Supervised Cross-view Referring Multi-Object Tracking
Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension
Official models and datasets for paper QTSpuls(https://arxiv.org/abs/2511.11910)
-
AlpachinoNLP/QTSplus-7B
Image-Text-to-Text • 9B • Updated • 16 • 1 -
AlpachinoNLP/QTSplus-3B
Image-Text-to-Text • 4B • Updated • 28 • 1 -
AlpachinoNLP/QTSplus-3B-FT
Image-Text-to-Text • 4B • Updated • 20 • 1 -
AlpachinoNLP/QTSplus-LLaVA-Video-7B-Qwen2
Image-Text-to-Text • 9B • Updated • 20
Official models and datasets for paper μ²Tokenizer(https://arxiv.org/abs/2507.00316)
Official models and datasets for paper QTSpuls(https://arxiv.org/abs/2511.11910)
-
AlpachinoNLP/QTSplus-7B
Image-Text-to-Text • 9B • Updated • 16 • 1 -
AlpachinoNLP/QTSplus-3B
Image-Text-to-Text • 4B • Updated • 28 • 1 -
AlpachinoNLP/QTSplus-3B-FT
Image-Text-to-Text • 4B • Updated • 20 • 1 -
AlpachinoNLP/QTSplus-LLaVA-Video-7B-Qwen2
Image-Text-to-Text • 9B • Updated • 20