no code implementations • 12 Jan 2024 • Thibaud Leteno, Antoine Gourru, Charlotte Laclau, Christophe Gravier
In this paper, we propose an empirical exploration of this problem by formalizing two questions: (1) Can we identify the neural mechanism(s) responsible for gender bias in BERT (and by extension DistilBERT)?
1 code implementation • 21 Nov 2023 • Thibaud Leteno, Antoine Gourru, Charlotte Laclau, Rémi Emonet, Christophe Gravier
This is more suitable for real-life scenarios compared to existing methods that require annotations of sensitive attributes at train time.