Dialogue Safety Prediction
2 papers with code • 2 benchmarks • 2 datasets
Determine the safety of a given dialogue context.
Most implemented papers
ProsocialDialog: A Prosocial Backbone for Conversational Agents
With this dataset, we introduce a dialogue safety detection module, Canary, capable of generating RoTs given conversational context, and a socially-informed dialogue agent, Prost.
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
In this research, we used OpenAI GPT as point of comparison since it excels at all levels of safety.