colourful-tree
colourful-tree
AI & ML interests
None yet
Recent Activity
new activity
15 days ago
Qwen/Qwen3-14B-Base:max_lr and lr_scheduler
updated
a model
9 months ago
chuxin-llm/Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
upvoted
a
paper
almost 2 years ago
RecycleGPT: An Autoregressive Language Model with Recyclable Module