OpenGVLab/InternVL3-14B-Instruct
Image-Text-to-Text
•
15B
•
Updated
•
1.64k
•
9
Computer Vision
ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning