Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Alexander Bukharin
alexwb
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
liked
a model
9 days ago
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
liked
a model
4 months ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
upvoted
a
paper
10 months ago
HelpSteer2-Preference: Complementing Ratings with Preferences
View all activity
Organizations
Papers
2
arxiv:
2410.01257
arxiv:
2306.03109
models
8
Sort: Recently updated
alexwb/reward_modeling_anthropic_hh_rm1e-3
0.3B
•
Updated
Aug 7, 2024
•
3
alexwb/reward_modeling_anthropic_hh_rm1e-4
0.3B
•
Updated
Aug 7, 2024
•
2
alexwb/reward_modeling_anthropic_hh_rm1.4e-5
0.3B
•
Updated
Aug 4, 2024
•
2
alexwb/reward_modeling_anthropic_hh_rm1e-6
0.3B
•
Updated
Aug 3, 2024
•
4
alexwb/reward_modeling_anthropic_hh_rm0.99
0.3B
•
Updated
Aug 2, 2024
•
2
alexwb/reward_modeling_anthropic_hh_rm0.9_lr5e-5
0.3B
•
Updated
Aug 2, 2024
•
5
alexwb/reward_modeling_anthropic_hh
Text Classification
•
0.3B
•
Updated
Aug 1, 2024
•
11
alexwb/sft_trl_test
Updated
May 15, 2024
•
2
datasets
0
None public yet