UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper • 2511.08521 • Published 11 days ago • 36
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 9 days ago • 40
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 8 days ago • 78
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 9 days ago • 9