Mohammed Mohammed Ali PRO
MohammedEltoum
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego's
post
with ๐
1 day ago
Just included example scripts for aligning models using GSPO (including VLM example) ๐โโ๏ธ๐โโ๏ธ
GSPO is the latest RL alignment algo by @Alibaba_Qwen and it's already supported in the latest TRL v0.20 release.
Super-easy-to-get-started example scripts below, GO run them!๐ฉโ๐ป๐ฉโ๐ป
๐งโ๐จ Script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo.py
๐ฆ VLM script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo_vlm.py
๐งฉ More TRL examples: https://huggingface.co/docs/trl/main/en/example_overview
๐งโโ๏ธ GSPO paper: https://huggingface.co/papers/2507.18071
liked
a Space
7 days ago
google/appoint-ready