The XSTest dataset is a test suite designed to identify exaggerated safety behaviors in large language models. It was introduced to systematically study the phenomenon where some models refuse even clearly safe prompts if they use similar language to unsafe prompts or mention sensitive topics.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages