Abidoye Aanuoluwapo

Aanuoluwapo65

https://anuoluwapo65.github.io/

AI & ML interests

Computer vision and multimodal learning

Recent Activity

liked a dataset about 2 hours ago

allenai/paloma

upvoted a paper 1 day ago

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

upvoted an article 2 days ago

What is test-time compute and how to scale it?

View all activity

Organizations

liked a dataset about 2 hours ago

allenai/paloma

Viewer • Updated Jun 6, 2024 • 309k • 2.8k • 44

upvoted a paper 1 day ago

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Paper • 2605.30161 • Published 5 days ago • 52

upvoted an article 2 days ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 122

upvoted a paper 3 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 6 days ago • 68

upvoted 5 papers 4 days ago

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

Paper • 2605.28691 • Published 6 days ago • 21

upvoted 8 papers 13 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 18 days ago • 53

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

Paper • 2605.15980 • Published 18 days ago • 36

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published 29 days ago • 40

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published 15 days ago • 67

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published 15 days ago • 112

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published 19 days ago • 118

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 20 days ago • 269

Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis

Paper • 2605.18451 • Published 15 days ago • 41

upvoted 2 papers 18 days ago

VoiceGRPO: Modern MoE Transformers with Group Relative Policy Optimization GRPO for AI Voice Health Care Applications on Voice Pathology Detection

Paper • 2503.03797 • Published Mar 5, 2025 • 1

A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks

Paper • 2501.15724 • Published Jan 27, 2025 • 1

liked a model 18 days ago

InfoBayAI/resnet18-ct-pathology-classifier

Image Classification • Updated 15 days ago • 9 • 3

Abidoye Aanuoluwapo

AI & ML interests

Recent Activity

Organizations

Aanuoluwapo65's activity

What is test-time compute and how to scale it?