Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ivangabriele
/
trl-sandbox
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-sandbox
/
examples
/
datasets
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
ivangabriele
feat: initialize project
2f5127c
verified
12 days ago
hh-rlhf-helpful-base.py
Safe
5.41 kB
feat: initialize project
12 days ago
lm-human-preferences-descriptiveness.py
Safe
4.94 kB
feat: initialize project
12 days ago
lm-human-preferences-sentiment.py
Safe
4.62 kB
feat: initialize project
12 days ago
math_shepherd.py
Safe
6.56 kB
feat: initialize project
12 days ago
prm800k.py
Safe
6.08 kB
feat: initialize project
12 days ago
rlaif-v.py
Safe
4.64 kB
feat: initialize project
12 days ago
tldr.py
Safe
4.29 kB
feat: initialize project
12 days ago
tldr_preference.py
Safe
4.46 kB
feat: initialize project
12 days ago
ultrafeedback-prompt.py
Safe
3.53 kB
feat: initialize project
12 days ago
ultrafeedback.py
Safe
5.53 kB
feat: initialize project
12 days ago