LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26 • 57