Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

seung hwan jung's picture

2 13

seung hwan jung

digit82

jian1114's profile picture

·

seujung

AI & ML interests

None yet

Organizations

None yet

digit82 's collections 6

Korean Pretraining Dataset

heegyu/namuwiki-extracted

Viewer • Updated Jan 15, 2023 • 565k • 139 • 23
heegyu/kowikitext

Viewer • Updated Oct 2, 2022 • 1.33M • 82 • 6
maywell/korean_textbooks

Viewer • Updated Jan 10, 2024 • 4.42M • 1.84k • 120
hac541309/basic_korean_dict

Viewer • Updated Jul 26, 2023 • 74.9k • 53 • 6

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 39.6k • 643

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78

Resaoning Dataset

nvidia/OpenCodeReasoning

Viewer • Updated May 4 • 753k • 3.42k • 503
nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 4.32k • 597
NovaSky-AI/Sky-T1_data_17k

Viewer • Updated Jan 14 • 16.4k • 272 • 186

Pretraining Dataset

math-ai/AutoMathText

Viewer • Updated Jul 16 • 7.89M • 5.33k • 182
nampdn-ai/tiny-strange-textbooks

Viewer • Updated Feb 2, 2024 • 1M • 38 • 92
HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 305k • 2.42k
nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 11.2k • 158

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 56
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Paper • 2410.21271 • Published Oct 28, 2024 • 7

Korean Pretraining Dataset

heegyu/namuwiki-extracted

Viewer • Updated Jan 15, 2023 • 565k • 139 • 23
heegyu/kowikitext

Viewer • Updated Oct 2, 2022 • 1.33M • 82 • 6
maywell/korean_textbooks

Viewer • Updated Jan 10, 2024 • 4.42M • 1.84k • 120
hac541309/basic_korean_dict

Viewer • Updated Jul 26, 2023 • 74.9k • 53 • 6

Resaoning Dataset

nvidia/OpenCodeReasoning

Viewer • Updated May 4 • 753k • 3.42k • 503
nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 4.32k • 597
NovaSky-AI/Sky-T1_data_17k

Viewer • Updated Jan 14 • 16.4k • 272 • 186

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 39.6k • 643

Pretraining Dataset

math-ai/AutoMathText

Viewer • Updated Jul 16 • 7.89M • 5.33k • 182
nampdn-ai/tiny-strange-textbooks

Viewer • Updated Feb 2, 2024 • 1M • 38 • 92
HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 305k • 2.42k
nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 11.2k • 158

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 56
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Paper • 2410.21271 • Published Oct 28, 2024 • 7

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs