SUDOER (System/User Dataset for Obedience Evaluation in Responses)

Introduced by Sanchez et al. in Stay on topic with Classifier-Free Guidance

The dataset aims to provide system prompts and user prompts for assistant. You should make random pairs and compute human preference for both system prompt obedience and user prompt relevance through A/B testing.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

SciQ

Usage

License

Unknown

Modalities

Texts

Languages

English

SUDOER (System/User Dataset for Obedience Evaluation in Responses)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit