SpecExit: Accelerating Large Reasoning Model via Speculative Exit Paper • 2509.24248 • Published Sep 29 • 1
Tequila: Trapping-free Ternary Quantization for Large Language Models Paper • 2509.23809 • Published Sep 28 • 2