awesome safety resources - a roosttools Collection

roosttools 's Collections

awesome safety resources

updated 2 days ago

a directory of helpful datasets, tools, papers, and resources related to open source online safety

Upvote

worldbank/NaijaHate

Viewer • Updated Jan 17 • 36k • 26 • 5
tomekkorbak/pile-detoxify

Viewer • Updated Feb 7, 2023 • 1.95M • 228 • 1
chatcompanion/compAnIonv1

Updated Apr 14, 2024
CristinaMierla/PAN12_predatorTask_romanianTranslation

Preview • Updated May 17, 2024 • 15
facebook/roberta-hate-speech-dynabench-r4-target

Text Classification • 0.1B • Updated Mar 16, 2023 • 1.27M • • 94
ucberkeley-dlab/measuring-hate-speech

Viewer • Updated Nov 15, 2022 • 136k • 2.01k • 41
DJK101/TransphobiaDetectionBluesky

Viewer • Updated Nov 4 • 12.4k • 13
somosnlp-hackathon-2023/suicide-comments-es

Viewer • Updated Apr 10, 2023 • 10.1k • 104 • 5
sivasothy-Tharsi/self-harm-detection

Viewer • Updated Oct 29 • 256k • 28
fmplaza/offendes

Updated Mar 22, 2024 • 45 • 11
lmsys/toxic-chat

Viewer • Updated May 14, 2024 • 20.3k • 3.62k • 173
mmathys/openai-moderation-api-evaluation

Viewer • Updated Aug 28, 2023 • 1.68k • 297 • 35
bigcode/bigcode-pii-dataset

Viewer • Updated May 15, 2023 • 12.1k • 34 • 52
bigcode/bigcode-pii-dataset-training

Viewer • Updated May 11, 2023 • 11.9k • 12 • 11
NemoGuard

Collection

Essential datasets and models for content safety, topic-following, and security guardrails • 13 items • Updated 1 day ago • 14
Detecting Relevant Information in High-Volume Chat Logs: Keyphrase Extraction for Grooming and Drug Dealing Forensic Analysis

Paper • 2311.04905 • Published Sep 15, 2023
Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation

Paper • 2505.10588 • Published May 14 • 4
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

Paper • 2310.17389 • Published Oct 26, 2023

Upvote