Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT Paper • 2511.17405 • Published 16 days ago • 10
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics Paper • 2510.07181 • Published Oct 8 • 1
MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval Paper • 2509.26378 • Published Sep 30
Uniform Discrete Diffusion with Metric Path for Video Generation Paper • 2510.24717 • Published Oct 28 • 39
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench Paper • 2510.26865 • Published Oct 30 • 11
🐻 URSA Collection URSA: Uniform Discrete Diffusion with Metric Path for Video Generation • 6 items • Updated Nov 2 • 6