Chong Ruan
Chester111
AI & ML interests
AGI & LLM
Recent Activity
authored
a paper
about 1 month ago
Insights into DeepSeek-V3: Scaling Challenges and Reflections on
Hardware for AI Architectures
authored
a paper
4 months ago
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse
Attention
authored
a paper
5 months ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning