Haoyuan WU's picture

Haoyuan WU

hywu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Knocking-Heads Attention

authored a paper about 2 months ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

upvoted a paper about 2 months ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

View all activity

Organizations

None yet

upvoted a paper 24 days ago

Knocking-Heads Attention

Paper • 2510.23052 • Published 25 days ago • 28

upvoted 2 papers about 2 months ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

Paper • 2509.26313 • Published Sep 30 • 4

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 68

upvoted a collection 3 months ago

GroveMoE

3 items • Updated Aug 22 • 1

upvoted a paper 3 months ago

Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Paper • 2401.02731 • Published Jan 5, 2024 • 3

upvoted a collection 3 months ago

GroveMoE

GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute. • 4 items • Updated Oct 13 • 7

upvoted a paper 3 months ago

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11 • 28

upvoted an article 9 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

107

upvoted 2 collections 10 months ago

Camelidae

5 items • Updated Aug 22 • 2

Cosmos

The collection of Cosmos models • 31 items • Updated 2 days ago • 298

upvoted a collection 12 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 19 hours ago • 95