Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models Paper • 2504.08809 • Published Apr 9 • 1
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24 • 76
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models Paper • 2504.08809 • Published Apr 9 • 1
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24 • 76
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11 • 79
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published May 21 • 34
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published Apr 2 • 37
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation Paper • 2412.01316 • Published Dec 2, 2024 • 9
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation Paper • 2412.01316 • Published Dec 2, 2024 • 9
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation Paper • 2412.01316 • Published Dec 2, 2024 • 9 • 2
Running 450 450 Chat-with-OpenAI-o1 🚀 Generate conversational responses using OpenAI's language model
Centroid-centered Modeling for Efficient Vision Transformer Pre-training Paper • 2303.04664 • Published Mar 8, 2023
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos Paper • 2402.06119 • Published Feb 9, 2024 • 1
3D-VLA: A 3D Vision-Language-Action Generative World Model Paper • 2403.09631 • Published Mar 14, 2024 • 11