Word Sense Disambiguation

143 papers with code • 15 benchmarks • 15 datasets

The task of Word Sense Disambiguation (WSD) consists of associating words in context with their most suitable entry in a pre-defined sense inventory. The de-facto sense inventory for English in WSD is WordNet. For example, given the word “mouse” and the following sentence:

“A mouse consists of an object held in one's hand, with one or more buttons.”

we would assign “mouse” with its electronic device sense (the 4th sense in the WordNet sense inventory).

Benchmarks

Add a Result

These leaderboards are used to track progress in Word Sense Disambiguation

Dataset	Best Model	Compare
Words in Context	COSINE + Transductive Learning	See all
Supervised:	ConSeC+WNGC	See all
SemEval 2013 Task 12	SemCor+WNGC, hypernyms	See all
SensEval 2	SemCor+WNGC, hypernyms	See all
SensEval 3 Task 1	SemCor+WNGC, hypernyms	See all
SemEval 2007 Task 7	SemCor+WNGC, hypernyms	See all
SemEval 2007 Task 17	SemCor+WNGC, hypernyms	See all
Knowledge-based:	KEF	See all
SemEval 2015 Task 13	SemCor+WNGC, hypernyms	See all
WiC-TSV	transformers	See all
BIG-bench (Anachronisms)	Chinchilla-70B (few-shot, k=5)	See all
RUSSE	Human Benchmark	See all
SensEval 3 Lexical Sample	kNN-BERT	See all
SensEval 2 Lexical Sample	kNN-BERT	See all
TS50	SPIN	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Word Sense Disambiguation models and implementations

danlou/LMMS

4 papers

huggingface/transformers

3 papers

125,334

allenai/dolma

2 papers

775

volcengine/vegiantmodel

2 papers

197

Datasets

Subtasks

Word Sense Induction

Most implemented papers

Most implemented Social Latest No code

Chinese Word Sense Embedding with SememeWSD and Synonym Set

2023-MindSpore-1/ms-code-196 • • 29 Jun 2022

To address this limitation, we propose SememeWSD Synonym (SWSDS) model to assign a different vector to every sense of polysemous words with the help of word sense disambiguation (WSD) and synonym set in OpenHowNet.

Paper
Code

N-Grammer: Augmenting Transformers with latent n-grams

tensorflow/lingvo • • 13 Jul 2022

Transformer models have recently emerged as one of the foundational models in natural language processing, and as a byproduct, there is significant recent interest and investment in scaling these models.

Paper
Code

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

joeljang/elm • • 7 Feb 2023

Recently, Language Models (LMs) instruction-tuned on multiple tasks, also known as multitask-prompted fine-tuning (MT), have shown the capability to generalize to unseen tasks.

Paper
Code

The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

kaistai/cot-collection • • 23 May 2023

Furthermore, we show that instruction tuning with CoT Collection allows LMs to possess stronger few-shot learning capabilities on 4 domain-specific tasks, resulting in an improvement of +2. 24% (Flan-T5 3B) and +2. 37% (Flan-T5 11B), even outperforming ChatGPT utilizing demonstrations until the max length by a +13. 98% margin.

Paper
Code

FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-gram Language Model

denizyuret/fastsubs-googlecode • 24 May 2012

Lexical substitutes have found use in areas such as paraphrasing, text simplification, machine translation, word sense disambiguation, and part of speech induction.

Paper
Code