Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ivangabriele
/
trl-sandbox
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-sandbox
/
examples
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
ivangabriele
feat: initialize project
2f5127c
verified
12 days ago
evals
feat: initialize project
12 days ago
ppo
feat: initialize project
12 days ago
rloo
feat: initialize project
12 days ago
alignprop.py
Safe
5.14 kB
feat: initialize project
12 days ago
bco.py
Safe
5.92 kB
feat: initialize project
12 days ago
cpo.py
Safe
3.52 kB
feat: initialize project
12 days ago
ddpo.py
Safe
7.6 kB
feat: initialize project
12 days ago
dpo.py
Safe
900 Bytes
feat: initialize project
12 days ago
dpo_online.py
Safe
5.36 kB
feat: initialize project
12 days ago
dpo_vlm.py
Safe
5.73 kB
feat: initialize project
12 days ago
gkd.py
Safe
4.63 kB
feat: initialize project
12 days ago
kto.py
Safe
3.71 kB
feat: initialize project
12 days ago
nash_md.py
Safe
5.22 kB
feat: initialize project
12 days ago
orpo.py
Safe
3.61 kB
feat: initialize project
12 days ago
prm.py
Safe
4.41 kB
feat: initialize project
12 days ago
reward_modeling.py
Safe
4.76 kB
feat: initialize project
12 days ago
sft.py
Safe
900 Bytes
feat: initialize project
12 days ago
sft_gemma3.py
Safe
1.93 kB
feat: initialize project
12 days ago
sft_video_llm.py
Safe
8.32 kB
feat: initialize project
12 days ago
sft_vlm.py
Safe
4.97 kB
feat: initialize project
12 days ago
sft_vlm_gemma3.py
Safe
8.4 kB
feat: initialize project
12 days ago
sft_vlm_smol_vlm.py
Safe
5.38 kB
feat: initialize project
12 days ago
xpo.py
Safe
4.65 kB
feat: initialize project
12 days ago