Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

psinghal
/
content

PEFT
Safetensors
trl
dpo
Generated from Trainer
Model card Files Files and versions Community
content / dpo_model
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
psinghal's picture
psinghal
psinghal/dpo_model
0489183 verified 10 months ago
  • README.md
    5.18 kB
    psinghal/dpo_model 10 months ago
  • config.json
    1.41 kB
    psinghal/dpo_model 10 months ago
  • generation_config.json
    184 Bytes
    psinghal/dpo_model 10 months ago
  • model.safetensors
    1.56 GB
    LFS
    psinghal/dpo_model 10 months ago