Jiang's picture

4 8

Jiang

Louieworth

·

AI & ML interests

None yet

Recent Activity

liked a dataset 20 days ago

KbsdJames/Omni-MATH

liked a model 20 days ago

agentica-org/DeepScaleR-1.5B-Preview

upvoted a paper 30 days ago

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

View all activity

Organizations

None yet

upvoted a paper 30 days ago

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

Paper • 2507.17746 • Published Jul 23 • 1

upvoted a paper about 1 month ago

Which Heads Matter for Reasoning? RL-Guided KV Cache Compression

Paper • 2510.08525 • Published Oct 9 • 22

upvoted 2 papers about 2 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2 • 52

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

Paper • 2509.21320 • Published Sep 25 • 99