bys0318 commited on
Commit
d2a101b
·
verified ·
1 Parent(s): 4ea7d1a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -21,7 +21,7 @@ library_name: transformers
21
 
22
 
23
  <p align="center">
24
- 📃 <a href="https://arxiv.org" target="_blank">Paper</a> • 📝 <a href="https://arxiv.org" target="_blank">Wandb</a>
25
  </p>
26
 
27
  ---
@@ -61,4 +61,14 @@ SIRI solves this trade-off by **iteratively alternating between compression and
61
 
62
  ## 📝 Citation
63
 
64
- ```bibtex
 
 
 
 
 
 
 
 
 
 
 
21
 
22
 
23
  <p align="center">
24
+ 📃 <a href="https://arxiv.org/abs/2509.25176" target="_blank">Paper</a> • 📝 <a href="https://api.wandb.ai/links/teamsiri/isge4elx" target="_blank">Wandb</a>
25
  </p>
26
 
27
  ---
 
61
 
62
  ## 📝 Citation
63
 
64
+ ```bibtex
65
+ @misc{wen2025siriscalingiterativereinforcement,
66
+ title={SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression},
67
+ author={Haoming Wen and Yushi Bai and Juanzi Li and Jie Tang},
68
+ year={2025},
69
+ eprint={2509.25176},
70
+ archivePrefix={arXiv},
71
+ primaryClass={cs.LG},
72
+ url={https://arxiv.org/abs/2509.25176},
73
+ }
74
+ ```