Testish instruct checkpoint of Koto.
Upscale Nemo 12B to 22B
Do a continual pretrain for 1B tokens on creative data (big thanks to Allura Org for doing this)
Train on ~ 17K instruct samples.
Uses alpaca format.
Chat template
Files info
Base model