starriver030515's picture
Upload folder using huggingface_hub
7f6224b verified
|
raw
history blame
439 Bytes
metadata
license: mit
library_name: transformers
pipeline_tag: text-generation

The base Qwen2.5-Math-1.5B model used by ReLIFT. We change to rope_theta from 10000 to 40000 and extend the context window to 16k. Also, we modify the chat_template for the system prompt and add .

Github: https://github.com/TheRoadQaQ/ReLIFT

Citation

If you find our model, data, or evaluation code useful, please kindly cite our paper: