Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Li Yunshui's picture
5 14

Li Yunshui

Wa2erGo
21world's profile picture lun-ren's profile picture rookiemango's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
authored a paper 7 days ago
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
authored a paper 7 days ago
Model Merging in Pre-training of Large Language Models
View all activity

Organizations

SIAT-NLP's profile picture Qwen's profile picture CodeScience's profile picture

commented a paper 14 days ago

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Paper • 2506.09003 • Published 16 days ago • 18 •
3
commented 2 papers about 1 month ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 36 •
6

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 36 •
6
New activity in dandelin/vilt-b32-mlm almost 3 years ago

Update README.md

#1 opened almost 3 years ago by
Wa2erGo
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs