Stella Li's picture

2 3

Stella Li PRO

stellalisy

·

https://stellalisy.com/

AI & ML interests

None yet

Recent Activity

published a dataset 15 days ago

stellalisy/PrefPalette

updated a dataset 15 days ago

stellalisy/PrefPalette

updated a dataset 29 days ago

stellalisy/HorizonPref_natural_0827

View all activity

Organizations

Collections 2

Papers 10

arXiv:2510.00177

arXiv:2507.13541

arXiv:2506.10947

arXiv:2505.03054

models 30

stellalisy/system_select_dpo-3b-lr1e-5-b0.1

Text Generation • 3B • Updated Aug 6 • 3

stellalisy/system_select_dpo-3b-lr1e-6-b0.1

Text Generation • 3B • Updated Aug 6 • 5

stellalisy/system_select_dpo-3b-lr1e-5-b0.0

Text Generation • 3B • Updated Aug 6 • 3

stellalisy/system_select_dpo-1b-lr1e-6-b0.1

Text Generation • 1B • Updated Aug 6 • 1

stellalisy/system_select_dpo-1b-lr1e-5-b0.1

Text Generation • 1B • Updated Aug 6 • 1

stellalisy/system_select_dpo-1b-lr1e-6-b0.0

Text Generation • 1B • Updated Aug 6 • 1

stellalisy/system_select_dpo-1b-lr1e-5-b0.0

Text Generation • 1B • Updated Aug 6 • 2

stellalisy/rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step150

Text Generation • 8B • Updated Jun 13 • 8

stellalisy/rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step100

Text Generation • 8B • Updated Jun 13 • 8

stellalisy/rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step50

Text Generation • 8B • Updated Jun 13 • 6

datasets 21

stellalisy/PrefPalette

Viewer • Updated 15 days ago • 2.01M • 5

stellalisy/HorizonPref_natural_0827

Viewer • Updated 29 days ago • 1.75k • 66

stellalisy/DAPO-Math-14k-Processed-RLVR_random

Viewer • Updated Sep 14 • 14.1k • 249

stellalisy/rlvr_orz_math_57k_collected_random

Viewer • Updated Aug 26 • 56.9k • 88

stellalisy/personalized_simpleqa

Preview • Updated Aug 26 • 14

stellalisy/personalized_socialiqa

Preview • Updated Aug 26 • 10

stellalisy/personalized_scienceqa

Preview • Updated Aug 26 • 11

stellalisy/personalized_mmlu

Preview • Updated Aug 26 • 10

stellalisy/personalized_medqa

Preview • Updated Aug 26 • 14

stellalisy/personalized_commonsenseqa

Preview • Updated Aug 26 • 10

View 21 datasets