Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
theainerd 's Collections
Safety & Security
Agents
Reasoning
Papers-to-Read

Safety & Security

updated 25 days ago
Upvote
-

  • Running
    67
    67

    CyberSecEvalTest

    📈

    Evaluate LLM cybersecurity risks


  • meta-llama/Llama-Guard-3-8B

    Text Generation • Updated Oct 11, 2024 • 343k • • 201

  • meta-llama/Prompt-Guard-86M

    Text Classification • Updated Jul 25, 2024 • 7.94k • 259

  • Running
    16
    16

    Prompt Injection Detection Benchmark

    📝

    detect prompt injection risks


  • protectai/deberta-v3-base-prompt-injection-v2

    Text Classification • Updated May 28, 2024 • 456k • • 58

  • Running on CPU Upgrade
    92
    92

    LLM Safety Leaderboard

    🥇

    View and submit machine learning model evaluations


  • fdtn-ai/Foundation-Sec-8B

    Text Generation • Updated 14 days ago • 26.6k • 207

    Note Foundational Base Model Released by Cisco for SOC operations and other cyber ops.


  • meta-llama/Llama-Guard-4-12B

    Image-Text-to-Text • Updated Apr 29 • 47k • • 44

  • nvidia/llama-3.1-nemoguard-8b-content-safety

    Text Classification • Updated 16 days ago • 633 • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs