THU-KEG
/

LLaDA-8B-BGPO-sudoku

Reinforcement Learning

Model card Files Files and versions

LLaDA-8B-BGPO-sudoku

16 GB

2 contributors

History: 5 commits

nielsr's picture

nielsr HF Staff

Improve model card: Add pipeline_tag and library_name

7420387 verified 29 days ago

.gitattributes

1.52 kB

initial commit about 1 month ago
README.md

2.46 kB

Improve model card: Add pipeline_tag and library_name 29 days ago
config.json

1.44 kB

Upload folder using huggingface_hub about 1 month ago
configuration_llada.py

12.4 kB

Upload folder using huggingface_hub about 1 month ago
generation_config.json

143 Bytes

Upload folder using huggingface_hub about 1 month ago
model-00001-of-00004.safetensors

5 GB
xet

Upload folder using huggingface_hub about 1 month ago
model-00002-of-00004.safetensors

4.93 GB
xet

Upload folder using huggingface_hub about 1 month ago
model-00003-of-00004.safetensors

4.99 GB
xet

Upload folder using huggingface_hub about 1 month ago
model-00004-of-00004.safetensors

1.11 GB
xet

Upload folder using huggingface_hub about 1 month ago
model.safetensors.index.json

24.9 kB

Upload folder using huggingface_hub about 1 month ago
modeling_llada.py

68.9 kB

Upload folder using huggingface_hub about 1 month ago
tokenizer.json

6.1 MB

Upload folder using huggingface_hub about 1 month ago
tokenizer_config.json

51.7 kB

Upload folder using huggingface_hub about 1 month ago