Safety classifiers fine-tuned on a bilingual dataset composed of the English QA pairs from BeaverTails and the Italian QA pairs from BeaverTails-IT.
Giuseppe Magazzù
saiteki-kai
AI & ML interests
My research focuses on the developement of safety mitigation strategies and benchmarks for large language models.
Recent Activity
liked a model about 11 hours ago
UofTCSSLab/SIREN-Qwen3-0.6B liked a model about 11 hours ago
liyang-ict/SCM-7B liked a model 1 day ago
zai-org/GLM-5.2