1 code implementation • EMNLP 2021 • Xi Ye, Rohan Nair, Greg Durrett
When a model attribution technique highlights a particular part of the input, a user might understand this highlight as making a statement about counterfactuals (Miller, 2019): if that part of the input were to change, the model's prediction might change as well.