Ziheng Zhou's picture

2 2 5

Ziheng Zhou

josephziheng

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

upvoted a paper 3 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

liked a model about 2 years ago

baichuan-inc/Baichuan2-13B-Base

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Paper • 2509.23866 • Published Sep 28 • 12

upvoted a paper 3 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 178