2 38 14

haoxintong

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Diffusion Language Models are Super Data Learners

upvoted a paper 19 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

upvoted a paper 23 days ago

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

View all activity

Organizations

upvoted a paper 3 days ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 4 days ago • 94

upvoted a paper 19 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published 21 days ago • 93

upvoted a paper 23 days ago

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

Paper • 2510.10681 • Published 28 days ago • 5

upvoted a paper about 1 month ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 76

upvoted a paper about 2 months ago

Synthetic bootstrapped pretraining

Paper • 2509.15248 • Published Sep 17 • 8

upvoted a paper 2 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

liked 2 models 3 months ago

ByteDance-Seed/Seed-OSS-36B-Base-woSyn

Text Generation • 36B • Updated Aug 26 • 335 • 51

ByteDance-Seed/Seed-OSS-36B-Base

Text Generation • 36B • Updated Aug 26 • 2.8k • 54

upvoted a collection 3 months ago

Seed-OSS

Collection

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

liked a dataset 3 months ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Sep 18 • 228k • 447 • 75

upvoted 2 papers 3 months ago

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 19

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 113

upvoted 2 papers 4 months ago

GR-3 Technical Report

Paper • 2507.15493 • Published Jul 21 • 47

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 717

upvoted 5 papers 5 months ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published Jun 26 • 28

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 270

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 102

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published Jun 6 • 6

haoxintong

AI & ML interests

Recent Activity

Organizations

haoxintong's activity

SmolLM3: smol, multilingual, long-context reasoner