Search Results for author: Victor Legrand

Mitigating Text Toxicity with Counterfactual Generation

In this work, we propose to apply counterfactual generation methods from the eXplainable AI (XAI) field to target and mitigate textual toxicity.

Paper
Add Code

A human-grounded experiment is conducted to evaluate and compare CLS-A to other interpretability methods.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.