Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
theainerd 's Collections
Safety & Security
Agents
Reasoning
Papers-to-Read

Safety & Security

updated 3 days ago
Upvote
-

  • Running
    70
    70

    CyberSecEvalTest

    📈

    Evaluate LLMs' cybersecurity risks and capabilities


  • meta-llama/Llama-Guard-3-8B

    Text Generation • 8B • Updated Oct 11, 2024 • 291k • • 240

  • meta-llama/Prompt-Guard-86M

    Text Classification • 0.3B • Updated Jul 25, 2024 • 57.9k • • 279

  • Running
    17
    17

    Prompt Injection Detection Benchmark

    📝

    Check for prompt injection in text


  • protectai/deberta-v3-base-prompt-injection-v2

    Text Classification • 0.2B • Updated May 28, 2024 • 257k • • 77

  • Running on CPU Upgrade
    94
    94

    LLM Safety Leaderboard

    🥇

    Explore and submit LLM benchmarks


  • fdtn-ai/Foundation-Sec-8B

    Text Generation • 8B • Updated Aug 26 • 8.91k • • 265

    Note Foundational Base Model Released by Cisco for SOC operations and other cyber ops.


  • nvidia/llama-3.1-nemoguard-8b-content-safety

    Text Classification • Updated Jun 9 • 1.5k • 29

  • meta-llama/Llama-Guard-4-12B

    Image-Text-to-Text • 12B • Updated Apr 29 • 37.2k • • 61

  • facebook/Meta-SecAlign-8B

    Updated Sep 30 • 1.61k • 10

  • Qwen/Qwen3Guard-Gen-8B

    Text Generation • 8B • Updated 6 days ago • 30.8k • 61

  • Qwen/Qwen3Guard-Stream-8B

    Feature Extraction • 8B • Updated 17 days ago • 442 • 27

  • openai/gpt-oss-safeguard-20b

    Text Generation • 22B • Updated 4 days ago • 4.41k • • 103
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs