TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Word Sense Disambiguation	SemEval 2007 Task 17	kNN-BERT + POS (training corpus: SemCor)	F1	63.17	# 7
Word Sense Disambiguation	SemEval 2007 Task 17	kNN-BERT	F1	60.94	# 8
Word Sense Disambiguation	SemEval 2007 Task 7	kNN-BERT	F1	81.20	# 9
Word Sense Disambiguation	SemEval 2007 Task 7	kNN-BERT + POS (training corpus: WNGT)	F1	85.32	# 3
Word Sense Disambiguation	SensEval 2 Lexical Sample	kNN-BERT	F1	76.52	# 1
Word Sense Disambiguation	SensEval 3 Lexical Sample	kNN-BERT	F1	80.12	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190910430/word-sense-disambiguation-on-senseval-2-1)](https://paperswithcode.com/sota/word-sense-disambiguation-on-senseval-2-1?p=190910430)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190910430/word-sense-disambiguation-on-senseval-3)](https://paperswithcode.com/sota/word-sense-disambiguation-on-senseval-3?p=190910430)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190910430/word-sense-disambiguation-on-semeval-2007-1)](https://paperswithcode.com/sota/word-sense-disambiguation-on-semeval-2007-1?p=190910430)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/190910430/word-sense-disambiguation-on-semeval-2007)](https://paperswithcode.com/sota/word-sense-disambiguation-on-semeval-2007?p=190910430)`

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings

23 Sep 2019 · Gregor Wiedemann, Steffen Remus, Avi Chawla, Chris Biemann ·

Contextualized word embeddings (CWE) such as provided by ELMo (Peters et al., 2018), Flair NLP (Akbik et al., 2018), or BERT (Devlin et al., 2019) are a major recent innovation in NLP. CWEs provide semantic vector representations of words depending on their respective context. Their advantage over static word embeddings has been shown for a number of tasks, such as text classification, sequence tagging, or machine translation. Since vectors of the same word type can vary depending on the respective context, they implicitly provide a model for word sense disambiguation (WSD). We introduce a simple but effective approach to WSD using a nearest neighbor classification on CWEs. We compare the performance of different CWE models for the task and can report improvements above the current state of the art for two standard WSD benchmark datasets. We further show that the pre-trained BERT model is able to place polysemic words into distinct 'sense' regions of the embedding space, while ELMo and Flair NLP do not seem to possess this ability.

PDF Abstract

Code

Add Remove Mark official

uhh-lt/bert-sense official

Tasks

Add Remove

General Classification

text-classification

Translation

Word Sense Disambiguation

Datasets

Word Sense Disambiguation: a Unified Evaluation Framework and Empirical Comparison

Senseval-2

Results from the Paper

Edit

Ranked #1 on Word Sense Disambiguation on SensEval 3 Lexical Sample

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Word Sense Disambiguation	SemEval 2007 Task 17	kNN-BERT + POS (training corpus: SemCor)	F1	63.17	# 7	Compare
Word Sense Disambiguation	SemEval 2007 Task 17	kNN-BERT	F1	60.94	# 8	Compare
Word Sense Disambiguation	SemEval 2007 Task 7	kNN-BERT	F1	81.20	# 9	Compare
Word Sense Disambiguation	SemEval 2007 Task 7	kNN-BERT + POS (training corpus: WNGT)	F1	85.32	# 3	Compare
Word Sense Disambiguation	SensEval 2 Lexical Sample	kNN-BERT	F1	76.52	# 1	Compare
Word Sense Disambiguation	SensEval 3 Lexical Sample	kNN-BERT	F1	80.12	# 1	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • BiLSTM • Dense Connections • Dropout • ELMo • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • LSTM • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Sigmoid Activation • Softmax • Tanh Activation • Weight Decay • WordPiece

Edit Social Preview

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove