TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Natural Language Inference	SNLI	300D Directional self-attention network encoders	% Test Accuracy	85.6	# 69
Natural Language Inference	SNLI	300D Directional self-attention network encoders	% Train Accuracy	91.1	# 39
Natural Language Inference	SNLI	300D Directional self-attention network encoders	Parameters	2.4m	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/disan-directional-self-attention-network-for/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=disan-directional-self-attention-network-for)`

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

14 Sep 2017 · Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Shirui Pan, Chengqi Zhang ·

Recurrent neural nets (RNN) and convolutional neural nets (CNN) are widely used on NLP tasks to capture the long-term and local dependencies, respectively. Attention mechanisms have recently attracted enormous interest due to their highly parallelizable computation, significantly less training time, and flexibility in modeling dependencies. We propose a novel attention mechanism in which the attention between elements from input sequence(s) is directional and multi-dimensional (i.e., feature-wise). A light-weight neural net, "Directional Self-Attention Network (DiSAN)", is then proposed to learn sentence embedding, based solely on the proposed attention without any RNN/CNN structure. DiSAN is only composed of a directional self-attention with temporal order encoded, followed by a multi-dimensional attention that compresses the sequence into a vector representation. Despite its simple form, DiSAN outperforms complicated RNN models on both prediction quality and time efficiency. It achieves the best test accuracy among all sentence encoding methods and improves the most recent best result by 1.02% on the Stanford Natural Language Inference (SNLI) dataset, and shows state-of-the-art test accuracy on the Stanford Sentiment Treebank (SST), Multi-Genre natural language inference (MultiNLI), Sentences Involving Compositional Knowledge (SICK), Customer Review, MPQA, TREC question-type classification and Subjectivity (SUBJ) datasets.

PDF Abstract

Code

Add Remove Mark official

taoshen58/DiSAN official

313

2023-MindSpore-4/Code8

MindSpore-paper-code-3/code6

Tasks

Add Remove

Natural Language Inference

Sentence

Sentence Embedding

Sentence-Embedding

Datasets

SST

SNLI

SICK

MPQA Opinion Corpus

Results from the Paper

Edit

Ranked #68 on Natural Language Inference on SNLI

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Natural Language Inference	SNLI	300D Directional self-attention network encoders	% Test Accuracy	85.6	# 69	Compare
			% Train Accuracy	91.1	# 39	Compare
			Parameters	2.4m	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove