markendo/Visual-Extraction-Tuning-382K
Viewer
•
Updated
•
382k
•
112
Data and Models for Extract+Think as part of Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models