Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
launch
/
ThinkPRM-14B
like
3
Follow
LAUNCH Lab
24
Text Generation
Transformers
Safetensors
qwen2
reward-model
prm
generative reward model
process supervision
chain-of-thought
verification
math reasoning
code verification
conversational
text-generation-inference
arxiv:
2504.16828
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
Add link to code and library name
#2
by
nielsr
HF Staff
- opened
about 16 hours ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+2
-3
nielsr
about 16 hours ago
This PR improves the model card by:
Adding a link to the Github repository.
Adding the library_name metadata.
See translation
Add link to code and library name
c18d3d95
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Ready to merge
This branch is ready to get merged automatically.
Comment
·
Sign up
or
log in
to comment