Boosting Omni-Modal Language Models: Staged Post-Training with Visually Debiased Evaluation Paper • 2605.12034 • Published 9 days ago • 5
LLM-based Detection of Manipulative Political Narratives Paper • 2605.14354 • Published 8 days ago • 5
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 28 days ago • 63
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published 29 days ago • 19
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published Mar 27 • 66
Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection Paper • 2603.12916 • Published Mar 13 • 3
HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios Paper • 2603.11975 • Published Mar 12 • 11
HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios Paper • 2603.11975 • Published Mar 12 • 11 • 4
Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection Paper • 2601.19375 • Published Jan 27 • 5
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published Jan 4 • 46
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 268
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout Paper • 2511.20649 • Published Nov 25, 2025 • 51
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper • 2512.00425 • Published Nov 29, 2025 • 53