TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Paraphrase Identification	Quora Question Pairs	1-3[0.8pt/2pt] Random	Accuracy	80	# 19
Natural Language Inference	SNLI	RoBERTa-large + self-explaining layer	% Test Accuracy	92.3	# 3
Natural Language Inference	SNLI	RoBERTa-large + self-explaining layer	% Train Accuracy	?	# 74
Natural Language Inference	SNLI	RoBERTa-large + self-explaining layer	Parameters	355m+	# 4
Natural Language Inference	SNLI	RoBERTa-large+Self-Explaining	% Test Accuracy	92.3	# 3
Natural Language Inference	SNLI	RoBERTa-large+Self-Explaining	Parameters	340	# 2
Sentiment Analysis	SST-5 Fine-grained classification	RoBERTa-large+Self-Explaining	Accuracy	59.1	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-explaining-structures-improve-nlp-models/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=self-explaining-structures-improve-nlp-models)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-explaining-structures-improve-nlp-models/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=self-explaining-structures-improve-nlp-models)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-explaining-structures-improve-nlp-models/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=self-explaining-structures-improve-nlp-models)`

Self-Explaining Structures Improve NLP Models

3 Dec 2020 · Zijun Sun, Chun Fan, Qinghong Han, Xiaofei Sun, Yuxian Meng, Fei Wu, Jiwei Li ·

Existing approaches to explaining deep learning models in NLP usually suffer from two major drawbacks: (1) the main model and the explaining model are decoupled: an additional probing or surrogate model is used to interpret an existing model, and thus existing explaining tools are not self-explainable; (2) the probing model is only able to explain a model's predictions by operating on low-level features by computing saliency scores for individual words but are clumsy at high-level text units such as phrases, sentences, or paragraphs. To deal with these two issues, in this paper, we propose a simple yet general and effective self-explaining framework for deep learning models in NLP. The key point of the proposed framework is to put an additional layer, as is called by the interpretation layer, on top of any existing NLP model. This layer aggregates the information for each text span, which is then associated with a specific weight, and their weighted combination is fed to the softmax function for the final prediction. The proposed model comes with the following merits: (1) span weights make the model self-explainable and do not require an additional probing model for interpretation; (2) the proposed model is general and can be adapted to any existing deep learning structures in NLP; (3) the weight associated with each text span provides direct importance scores for higher-level text units such as phrases and sentences. We for the first time show that interpretability does not come at the cost of performance: a neural model of self-explaining features obtains better performances than its counterpart without the self-explaining nature, achieving a new SOTA performance of 59.1 on SST-5 and a new SOTA performance of 92.3 on SNLI.

PDF Abstract

Code

Add Remove Mark official

ShannonAI/Self_Explaining_Structure… official

Tasks

Add Remove

Natural Language Inference

Paraphrase Identification

Sentiment Analysis

Datasets

SST

IMDb Movie Reviews

SNLI SST-5

Quora

Quora Question Pairs

Results from the Paper

Edit

Ranked #2 on Sentiment Analysis on SST-5 Fine-grained classification

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Paraphrase Identification	Quora Question Pairs	1-3[0.8pt/2pt] Random	Accuracy	80	# 19	Compare
Natural Language Inference	SNLI	RoBERTa-large + self-explaining layer	% Test Accuracy	92.3	# 3	Compare
			% Train Accuracy	?	# 74	Compare
			Parameters	355m+	# 4	Compare
Natural Language Inference	SNLI	RoBERTa-large+Self-Explaining	% Test Accuracy	92.3	# 3	Compare
Natural Language Inference	SNLI	RoBERTa-large+Self-Explaining	Parameters	340	# 2	Compare
Sentiment Analysis	SST-5 Fine-grained classification	RoBERTa-large+Self-Explaining	Accuracy	59.1	# 2	Compare

Methods

Add Remove

Interpretability • Softmax

Edit Social Preview

Self-Explaining Structures Improve NLP Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove