Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework Paper • 2506.02454 • Published 23 days ago • 5
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published 27 days ago • 48
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published about 1 month ago • 17
🌸 April 2025 - Open releases from the Chinese community Collection 42 items • Updated 8 days ago • 13
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published May 12 • 29
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Paper • 2503.21979 • Published Mar 27 • 3
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published Apr 23 • 57
Skywork-R1V2 Collection Multimodal Hybrid Reinforcement Learning for Reasoning • 7 items • Updated 20 days ago • 10
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published Apr 8 • 83
Skywork-R1V Collection pioneering multimodal reasoning with cot • 5 items • Updated 13 days ago • 8