Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
ivangabriele
/
trl-sandbox
Paused

App Files Files Community
Fetching metadata from the HF Docker repository...
trl-sandbox / examples /scripts
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
ivangabriele's picture
ivangabriele
feat: initialize project
2f5127c verified 12 days ago
  • evals
    feat: initialize project 12 days ago
  • ppo
    feat: initialize project 12 days ago
  • rloo
    feat: initialize project 12 days ago
  • alignprop.py
    5.14 kB
    feat: initialize project 12 days ago
  • bco.py
    5.92 kB
    feat: initialize project 12 days ago
  • cpo.py
    3.52 kB
    feat: initialize project 12 days ago
  • ddpo.py
    7.6 kB
    feat: initialize project 12 days ago
  • dpo.py
    900 Bytes
    feat: initialize project 12 days ago
  • dpo_online.py
    5.36 kB
    feat: initialize project 12 days ago
  • dpo_vlm.py
    5.73 kB
    feat: initialize project 12 days ago
  • gkd.py
    4.63 kB
    feat: initialize project 12 days ago
  • kto.py
    3.71 kB
    feat: initialize project 12 days ago
  • nash_md.py
    5.22 kB
    feat: initialize project 12 days ago
  • orpo.py
    3.61 kB
    feat: initialize project 12 days ago
  • prm.py
    4.41 kB
    feat: initialize project 12 days ago
  • reward_modeling.py
    4.76 kB
    feat: initialize project 12 days ago
  • sft.py
    900 Bytes
    feat: initialize project 12 days ago
  • sft_gemma3.py
    1.93 kB
    feat: initialize project 12 days ago
  • sft_video_llm.py
    8.32 kB
    feat: initialize project 12 days ago
  • sft_vlm.py
    4.97 kB
    feat: initialize project 12 days ago
  • sft_vlm_gemma3.py
    8.4 kB
    feat: initialize project 12 days ago
  • sft_vlm_smol_vlm.py
    5.38 kB
    feat: initialize project 12 days ago
  • xpo.py
    4.65 kB
    feat: initialize project 12 days ago