GTAlign

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

zsqzz updated a model 1 day ago

GTAlign/Qwen2.5-3B-Medium-110step

zsqzz updated a model 1 day ago

GTAlign/Qwen2.5-3B-Full-160step

zsqzz updated a model 1 day ago

GTAlign/Qwen2.5-3B-Math-140step

View all activity

Organization Card

Community About org cards

GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

GTAlign applies game-theoretic principles to fine-tune reasoning LLMs, encouraging them to make decisions that are not only accurate but also rational, cooperative, and transparent in dialogue settings.

Models

We have released five model checkpoints, and we are preparing more thoroughly trained models.

Model Name	Size	Dataset	Hugging Face Link
`GTAlign/Qwen2.5-3B-Math-140step`	3B	Math	Model
`GTAlign/Qwen2.5-3B-Medium-110step`	3B	Medium	Model
`GTAlign/Qwen2.5-3B-AbgQA-140step`	3B	Ambig-QA	Model
`GTAlign/Qwen2.5-3B-WildGuard-140step`	3B	WildGuard	Model
`GTAlign/Qwen2.5-3B-Full-160step`	3B	Full	Model

Collections 1

models 5

datasets 0

None public yet

AI & ML interests

Recent Activity

Team members 1

GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

Models

Collections 1

models 5 Sort: Recently updated

datasets 0

models 5