arxiv:2511.15248
Kai Yang
yangkaiSIGS
AI & ML interests
None yet
Recent Activity
authored
a paper
9 days ago
Thinking-Free Policy Initialization Makes Distilled Reasoning Models
More Effective and Efficient Reasoners
authored
a paper
9 days ago
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
updated
a Space
9 days ago
yangkaiSIGS/entropic