Update README.md
Browse files
README.md
CHANGED
|
@@ -11,13 +11,13 @@ license: mit
|
|
| 11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B-base模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
| 12 |
|
| 13 |
|
| 14 |
-
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://
|
| 15 |
|
| 16 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
| 17 |
|
| 18 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B-base model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
| 19 |
|
| 20 |
-
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://
|
| 21 |
<br>
|
| 22 |
|
| 23 |
## 快速使用(Quickstart)
|
|
@@ -96,7 +96,7 @@ If you find our work helpful, feel free to give us a cite.
|
|
| 96 |
@article{chuxin,
|
| 97 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
| 98 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
| 99 |
-
journal={arXiv preprint arXiv:
|
| 100 |
year={2024}
|
| 101 |
}
|
| 102 |
```
|
|
|
|
| 11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B-base模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
| 12 |
|
| 13 |
|
| 14 |
+
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://arxiv.org/pdf/2405.04828)
|
| 15 |
|
| 16 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
| 17 |
|
| 18 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B-base model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
| 19 |
|
| 20 |
+
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://arxiv.org/pdf/2405.04828).
|
| 21 |
<br>
|
| 22 |
|
| 23 |
## 快速使用(Quickstart)
|
|
|
|
| 96 |
@article{chuxin,
|
| 97 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
| 98 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
| 99 |
+
journal={arXiv preprint arXiv:2405.04828},
|
| 100 |
year={2024}
|
| 101 |
}
|
| 102 |
```
|