BELLS Data Playground

Explore our dataset

Explore our dataset and see how different supervision systems perform against various types of prompts.

Dataset Type

Harmful Borderline Benign

Content Type

Non-Adversarial Adversarial

🛡️ Detected ⚠️ Not detected

✓ Allowed ! Blocked

⚖️ Flagged ➖ Not flagged