facebook/roberta-hate-speech-dynabench-r4-target Text Classification • 0.1B • Updated Mar 16, 2023 • 1.27M • • 94
NemoGuard Collection Essential datasets and models for content safety, topic-following, and security guardrails • 13 items • Updated 1 day ago • 14
Detecting Relevant Information in High-Volume Chat Logs: Keyphrase Extraction for Grooming and Drug Dealing Forensic Analysis Paper • 2311.04905 • Published Sep 15, 2023
Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation Paper • 2505.10588 • Published May 14 • 4
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation Paper • 2310.17389 • Published Oct 26, 2023