Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Malikeh1375 's Collections
Safety-Aligned Models
AI Safety Benchmarks
Clustered Tulu
LLM-Alignment
LLM Interpretability
Medical Datasets

AI Safety Benchmarks

updated Feb 15
Upvote
1

  • JailbreakBench/JBB-Behaviors

    Viewer • Updated Sep 26, 2024 • 500 • 29.5k • 99

  • walledai/HarmBench

    Viewer • Updated Jul 31, 2024 • 400 • 13.4k • 43

  • allenai/real-toxicity-prompts

    Viewer • Updated Sep 30, 2022 • 99.4k • 10.3k • 117
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs