88 78 305

Lee Junbum PRO

beomi

https://junbuml.ee

AI & ML interests

AI/ML GDE. Advancing Low-Resource Language Open Access LLM

Recent Activity

liked a Space 7 days ago

HuggingFaceTB/smol-training-playbook

liked a dataset 19 days ago

HuggingFaceFW/finewiki

upvoted a paper 20 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

View all activity

Organizations

liked a Space 7 days ago

1.92k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

Explore loss curves for training LLMs

liked a dataset 19 days ago

HuggingFaceFW/finewiki

Viewer • Updated 20 days ago • 61.6M • 19.1k • 241

upvoted a paper 20 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 25 days ago • 145

liked a model 24 days ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 6 days ago • 36.4k • 1.24k

liked 2 models 28 days ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9 • 4.01k • 344

zai-org/GLM-4.6

Text Generation • 357B • Updated Sep 30 • 62.3k • • 1.01k

upvoted 2 papers 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 63

liked a model 2 months ago

SamilPwC-AXNode-GenAI/PwC-Embedding_expr

liked 4 models 3 months ago

upvoted a paper 3 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 178

liked 3 models 3 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 193k • • 2.19k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.79M • • 4.13k

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 21.8k • • 1.38k

liked a dataset 4 months ago

NousResearch/Hermes-3-Dataset

Viewer • Updated Jul 11 • 959k • 1.36k • 291

liked 2 models 4 months ago

skt/A.X-3.1

Text Generation • 35B • Updated Jul 23 • 2.78k • 46

Qwen/Qwen3-Coder-480B-A35B-Instruct

Text Generation • 480B • Updated Aug 21 • 48.1k • • 1.24k

Lee Junbum PRO

AI & ML interests

Recent Activity

Organizations

beomi's activity

The Smol Training Playbook: The Secrets to Building World-Class LLMs