view article Article 📄 PDF Support in the Hugging Face Dataset Viewer By asoria • about 17 hours ago • 2
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Paper • 2506.19851 • Published 1 day ago • 47
Doc VL Collection drex [doc ], virex [ video (image ++) exp ] • 3 items • Updated about 18 hours ago • 2
view article Article 🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other • 5 days ago • 50
VisionScope OCR Experimentals Collection Based on Qwen2.5 VL, Qwen2 VL • 5 items • Updated 3 days ago • 1
Improved Iterative Refinement for Chart-to-Code Generation via Structured Instruction Paper • 2506.14837 • Published 11 days ago • 10
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published 8 days ago • 41
EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models Paper • 2506.10100 • Published 15 days ago • 10
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Paper • 2506.14234 • Published 9 days ago • 38
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published 9 days ago • 43
view article Article Testing VisionOCR-3B-061125 and Qwen2-VL-OCR-2B-Instruct for precise recognition of [messy] handwriting. By prithivMLmods • 9 days ago • 3
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated 13 days ago • 128