Search Results for author: Milan Bhan

Found 4 papers, 0 papers with code

Mitigating Text Toxicity with Counterfactual Generation

no code implementations16 May 2024 Milan Bhan, Jean-Noel Vittaut, Nina Achache, Victor Legrand, Nicolas Chesneau, Annabelle Blangero, Juliette Murris, Marie-Jeanne Lesot

In this work, we propose to apply counterfactual generation methods from the eXplainable AI (XAI) field to target and mitigate textual toxicity.

counterfactual Feature Importance

Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations

no code implementations19 Feb 2024 Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot

Incorporating natural language rationales in the prompt and In-Context Learning (ICL) has led to a significant improvement of Large Language Models (LLMs) performance.

In-Context Learning

TIGTEC : Token Importance Guided TExt Counterfactuals

no code implementations24 Apr 2023 Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot

Counterfactual examples explain a prediction by highlighting changes of instance that flip the outcome of a classifier.

counterfactual Feature Importance

Evaluating self-attention interpretability through human-grounded experimental protocol

no code implementations27 Mar 2023 Milan Bhan, Nina Achache, Victor Legrand, Annabelle Blangero, Nicolas Chesneau

A human-grounded experiment is conducted to evaluate and compare CLS-A to other interpretability methods.

Cannot find the paper you are looking for? You can Submit a new open access paper.