tokyotech-llm

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

s-mizuki-nlp updated a Space 5 days ago

tokyotech-llm/README

s-mizuki-nlp published a model 5 days ago

tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1-MXFP4

s-mizuki-nlp published a model 5 days ago

tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1-MXFP4

View all activity

Organization Card

Community About org cards

Swallow LLM

Research and development of large language models conducted by the members mainly in Okazaki Laboratory and Yokota Laboratory at Institute of Science Tokyo (formerly known as Tokyo Institute of Technology)

From Okazaki Laboratory, Institute of Science Tokyo, the following members:
- Naoaki Okazaki
- Sakae Mizuki
- Youmi Ma
- Koki Maeda
- Masanari Ohi
- Koshiro Saito
- Tatsuya Ichinose
- Naoya Matsushita
- Sora Miyamoto
- Nguyen Tien Dung
- Yuta Katayama
- Takaya Hiratsuka
From YOKOTA Laboratory, Institute of Science Tokyo, the following members:
- Rio Yokota
- Kazuki Fujii
- Taishi Nakamura
- Shigeki Ishida
- Masaki Kawamura
- Yukito Tajima
- Daisuke Nohara
From Artificial Intelligence Research Center, AIST, Japan, the following members:
- Hiroya Takamura

Collections 16

View 16 collections

models 134

tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1-MXFP4

Text Generation • 22B • Updated 6 days ago • 345

tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1-MXFP4

Text Generation • 120B • Updated 6 days ago • 374 • 1

tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2

Text Generation • 8B • Updated Feb 23 • 1.16k • • 5

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

Text Generation • 33B • Updated Feb 23 • 263 • • 2

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

Text Generation • 31B • Updated Feb 23 • 574

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

Text Generation • 8B • Updated Feb 23 • 224 • • 1

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4

Text Generation • 33B • Updated Feb 23 • 1.24k • 2

tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2-AWQ-INT4

Text Generation • 31B • Updated Feb 23 • 685

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2-AWQ-INT4

Text Generation • 8B • Updated Feb 23 • 1.08k • 1

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2

Text Generation • 33B • Updated Feb 23 • 3.95k • • 1

View 134 models

datasets 19

tokyotech-llm/swallow-math

Viewer • Updated Mar 1 • 4.33M • 1.15k • 47

tokyotech-llm/swallow-code

Viewer • Updated Mar 1 • 129M • 1.15k • 65

tokyotech-llm/Swallow-Nemotron-Post-Training-Dataset-v1

Viewer • Updated Feb 21 • 8.84M • 471 • 6

tokyotech-llm/lmsys-chat-1m-synth

Updated Feb 20 • 568 • 21

tokyotech-llm/s1-test-time-scaling-synth-public

Viewer • Updated Feb 19 • 59k • 35

tokyotech-llm/swallow-code-v2

Viewer • Updated Nov 8, 2025 • 147M • 72.4k • 38

tokyotech-llm/swallow-math-v2

Viewer • Updated Nov 6, 2025 • 17.4M • 19.9k • 31

tokyotech-llm/swallow_english_mt_bench

Viewer • Updated Aug 18, 2025 • 80 • 95

tokyotech-llm/MMLU-ProX-English

Updated Aug 18, 2025 • 1.12k

tokyotech-llm/MMLU-Pro-English

Updated Aug 18, 2025 • 239

View 19 datasets