2 40 16

haoxintong

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Virtual Width Networks

liked a dataset 4 days ago

PleIAs/SYNTH

liked a model 4 days ago

PleIAs/Baguettotron

View all activity

Organizations

upvoted a paper 1 day ago

Virtual Width Networks

Paper • 2511.11238 • Published 4 days ago • 24

upvoted a paper 5 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 6 days ago • 157

upvoted a paper 12 days ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 13 days ago • 114

upvoted a paper 28 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published 30 days ago • 98

upvoted a paper about 1 month ago

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

Paper • 2510.10681 • Published Oct 12 • 5

upvoted 2 papers about 2 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 77

Synthetic bootstrapped pretraining

Paper • 2509.15248 • Published Sep 17 • 8

upvoted a paper 3 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

upvoted a collection 3 months ago

Seed-OSS

Collection

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

upvoted a paper 3 months ago

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 19

upvoted 3 papers 4 months ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 113

GR-3 Technical Report

Paper • 2507.15493 • Published Jul 21 • 47

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

725

upvoted 6 papers 5 months ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published Jun 26 • 28

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 271

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 102

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published Jun 6 • 6

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

haoxintong

AI & ML interests

Recent Activity

Organizations

haoxintong's activity

SmolLM3: smol, multilingual, long-context reasoner