Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dbest-isi
/
searchless-chess-9M-dpo
like
1
Reinforcement Learning
JAX
English
chess
dpo
direct-preference-optimization
haiku
self-play
arXiv:
2402.04494
arXiv:
2305.18290
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
searchless-chess-9M-dpo
/
model_info.json
Commit History
Upload searchless chess model
0839651
verified
dbest-isi
commited on
9 days ago