SSFT - a shengjia-toronto Collection

shengjia-toronto 's Collections

SSFT

SSFT

updated 2 days ago

Training Large Language Models To Reason In Parallel With Global Forking Tokens

Paper • 2510.05132 • Published Oct 1 • 1
shengjia-toronto/ssft-32B-N6

Text Generation • 4B • Updated 2 days ago • 6.03k
shengjia-toronto/grpo-test-ssft-32B

Text Generation • 33B • Updated 2 days ago • 29