4 648

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

Towards Robust Mathematical Reasoning

upvoted a paper about 5 hours ago

Deep Self-Evolving Reasoning

upvoted a paper 3 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

View all activity

Organizations

None yet

upvoted 2 papers about 5 hours ago

Towards Robust Mathematical Reasoning

Paper • 2511.01846 • Published 4 days ago • 7

Deep Self-Evolving Reasoning

Paper • 2510.17498 • Published 18 days ago • 11

upvoted a paper 3 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 8 days ago • 40

upvoted 13 papers 5 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published 17 days ago • 82

Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs

Paper • 2510.18279 • Published 17 days ago • 4

Prompt-MII: Meta-Learning Instruction Induction for LLMs

Paper • 2510.16932 • Published 19 days ago • 6

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published 18 days ago • 64

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 24 days ago • 102

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published 24 days ago • 13

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Paper • 2510.12635 • Published 24 days ago • 15

Base Models Know How to Reason, Thinking Models Learn When

Paper • 2510.07364 • Published 30 days ago • 1

Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models

Paper • 2510.08492 • Published 29 days ago • 8

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published 24 days ago • 46

upvoted 2 papers 6 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 11 days ago • 56

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published 30 days ago • 48

upvoted 2 papers 8 days ago

VISTA: A Test-Time Self-Improving Video Generation Agent

Paper • 2510.15831 • Published 21 days ago • 20

Robust Layerwise Scaling Rules by Proper Weight Decay Tuning

Paper • 2510.15262 • Published 21 days ago • 5

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity