Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ttttonyhe
's Collections
LLM Guardrails
Red-Teaming Datasets
Prompt Injection Defense
Specialized LLMs
Red-Teaming Models
Safety Alignment Datasets
Dense LLMs
Reasoning LLMs
Tiny Models
Small Models
Embedding Models
OCR Models
Domain-specific Datasets
Novel Model Architectures
Templates
Red-Teaming Datasets
updated
10 days ago
Upvote
1
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
•
Updated
Jan 25, 2025
•
1.05M
•
148
•
67
walledai/AdvBench
Viewer
•
Updated
Jul 4, 2024
•
520
•
10.3k
•
97
jkazdan/HeX-PHI-usable
Viewer
•
Updated
Dec 26, 2024
•
300
•
6
walledai/HarmBench
Viewer
•
Updated
Jul 31, 2024
•
400
•
13.5k
•
43
allenai/wildjailbreak
Viewer
•
Updated
Aug 8, 2024
•
2.21k
•
8.91k
•
127
walledai/XSTest
Viewer
•
Updated
Jul 4, 2024
•
450
•
9.22k
•
22
walledai/StrongREJECT
Viewer
•
Updated
Oct 18, 2024
•
313
•
5.74k
•
22
LLM-Tuning-Safety/HEx-PHI
Preview
•
Updated
Aug 19, 2024
•
953
•
62
Upvote
1
Share collection
View history
Collection guide
Browse collections