AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Paper • 2506.19851 • Published 1 day ago • 40
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published 2 days ago • 43
3D Arena: An Open Platform for Generative 3D Evaluation Paper • 2506.18787 • Published 2 days ago • 9
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 2 days ago • 60
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs Paper • 2506.18896 • Published 2 days ago • 25
Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset Paper • 2506.18851 • Published 2 days ago • 25
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published 2 days ago • 78
view article Article 🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other • 4 days ago • 49
view changelog Changelog Organization and User profiles now include repository listing pages 5 days ago • 40
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model Paper • 2506.13642 • Published 9 days ago • 26
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Paper • 2506.15211 • Published 8 days ago • 31
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 5 items • Updated 3 days ago • 5
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 9 days ago • 235
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 4 items • Updated 6 days ago • 96
LeVo: High-Quality Song Generation with Multi-Preference Alignment Paper • 2506.07520 • Published 17 days ago • 4