GTAlign
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare
GTAlign applies game-theoretic principles to fine-tune reasoning LLMs, encouraging them to make decisions that are not only accurate but also rational, cooperative, and transparent in dialogue settings.
Models
We have released five model checkpoints, and we are preparing more thoroughly trained models.
models
5
GTAlign/Qwen2.5-3B-Medium-110step
Text Generation
•
3B
•
Updated
•
32
GTAlign/Qwen2.5-3B-Full-160step
Text Generation
•
3B
•
Updated
•
34
GTAlign/Qwen2.5-3B-Math-140step
Text Generation
•
3B
•
Updated
•
36
GTAlign/Qwen2.5-3B-AbgQA-140step
Text Generation
•
3B
•
Updated
•
38
GTAlign/Qwen2.5-3B-WildGuard-140step
Text Generation
•
3B
•
Updated
•
38
datasets
0
None public yet