2 6

Haizhong

haizhongzheng

http://zhenghaizhong.com/

haizhongzheng

AI & ML interests

Efficient machine learning

Recent Activity

upvoted a paper 19 days ago

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?

commented on a paper 19 days ago

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?

upvoted a paper about 1 month ago

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

View all activity

Organizations

upvoted a paper 19 days ago

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?

Paper • 2510.17862 • Published 26 days ago • 6

commented a paper 19 days ago

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?

Paper • 2510.17862 • Published 26 days ago • 6 •

upvoted a paper about 1 month ago

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Paper • 2510.01161 • Published Oct 1 • 13

commented a paper about 1 month ago

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Paper • 2510.01161 • Published Oct 1 • 13 •

upvoted 2 papers 5 months ago

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11 • 55

Kinetics: Rethinking Test-Time Scaling Laws

Paper • 2506.05333 • Published Jun 5 • 6

updated a dataset 8 months ago

haizhongzheng/DAPO-Math-17K-cleaned

Viewer • Updated Mar 26 • 17.9k • 48 • 1

published a dataset 8 months ago

haizhongzheng/DAPO-Math-17K-cleaned

Viewer • Updated Mar 26 • 17.9k • 48 • 1

upvoted a paper 8 months ago

Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale

Paper • 2502.01798 • Published Feb 3 • 1

published a model 9 months ago

haizhongzheng/Qwen2.5-1.5B-Open-R1-GRPO

Updated Feb 10

updated a model 12 months ago

haizhongzheng/Llama-3.2-1B-dpo-lora

Text Generation • Updated Nov 26, 2024 • 1

upvoted a paper over 1 year ago

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14, 2024 • 55

Haizhong

AI & ML interests

Recent Activity

Organizations

haizhongzheng's activity