arxiv:2509.22611
Kexin Huang
737443h
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
RePO: ReLU-based Preference Optimization
authored
a paper
about 2 months ago
SPRec: Self-Play to Debias LLM-based Recommendation
authored
a paper
about 2 months ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
Organizations
None yet