Search Results for author: Shashwat Singh

Found 1 papers, 0 papers with code

MiMiC: Minimally Modified Counterfactuals in the Representation Space

no code implementations15 Feb 2024 Shashwat Singh, Shauli Ravfogel, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru

We demonstrate the effectiveness of the proposed approaches in mitigating bias in multiclass classification and in reducing the generation of toxic language, outperforming strong baselines.

Cannot find the paper you are looking for? You can Submit a new open access paper.