Named Entity Recognition (NER)

886 papers with code • 76 benchmarks • 122 datasets

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

Mark	Watney	visited	Mars
B-PER	I-PER	O	B-LOC

( Image credit: Zalando )

Benchmarks

Add a Result

These leaderboards are used to track progress in Named Entity Recognition (NER)

Dataset	Best Model	Compare
CoNLL 2003 (English)	ACE + document-context	See all
Ontonotes v5 (English)	BERT-MRC+DSC	See all
NCBI-disease	Spark NLP	See all
WNUT 2017	CL-KL	See all
ACE 2005	Ours: cross-sentence ALB	See all
BC5CDR	BINDER	See all
JNLPBA	KeBioLM	See all
GENIA	DeepStruct multi-task w/ finetune	See all
BC5CDR-chemical	Spark NLP	See all
SLUE	W2V2-L-LL60K (pipeline approach, uses LM)	See all
BC2GM	Spark NLP	See all
BC5CDR-disease	BioMegatron	See all
ACE 2004	Ours: cross-sentence ALB	See all
CoNLL++	Noise-robust Co-regularization + LUKE	See all
SciERC	SciDeBERTa v2	See all
WNUT 2016	HGN	See all
CoNLL 2003 (German)	ACE + document-context	See all
CoNLL 2002 (Spanish)	ACE + document-context	See all
CoNLL 2002 (Dutch)	ACE + document-context	See all
CoNLL03	UniNER-7B	See all
CoNLL 2003 (German) Revised	FLERT XLM-R	See all
Few-NERD (SUP)	PL-Marker	See all
Species-800	BioKMNER + BioBERT	See all
BC4CHEMD	UniNER-7B	See all
AnatEM	ConNER	See all
CORD-r	TPP (LayoutLMv3)	See all
FUNSD-r	TPP (LayoutLMv3)	See all
LINNAEUS	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (morph,test)	AlephBERT-base Pipeline	See all
WNUT 2020	mgsohrab	See all
DWIE	REXEL	See all
FindVehicle	BiLSTM-CRF	See all
BioRED	PubMedBERT-CRF	See all
OntoNotes	DeepStruct multi-task w/ finetune	See all
SemClinBr	pucpr/biobertpt-clin	See all
WLPC	DyGIE	See all
Species800	BLSTM-CNN-Char (SparkNLP)	See all
BioNLP13-CG	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (token,test)	AlephBERT-base	See all
CMeEE	BERT-CRF (Replicated in AdaSeq)	See all
HiNER-original	cfilt/HiNER-original-xlm-roberta-large	See all
HiNER-collapsed	cfilt/HiNER-collapsed-xlm-roberta-large	See all
BC7 NLM-Chem	PubMedBERT+MLP+CRF	See all
MasakhaNER	BERT	See all
OntoNotes 5.0	HGN	See all
ACE2005	DeepStruct multi-task w/ finetune	See all
WetLab	BiLSTM-CRF with ELMo	See all
Code-Switching English-Spanish NER	HME (word + BPE + char)	See all
CoNLL 2000	SWEM-CRF	See all
French Treebank	CamemBERT (subword masking)	See all
SoSciSoCi	Bi-LSTM-CRF (SSC->GSC)	See all
LeNER-Br	LSTM-CRF	See all
NCBI Disease	UniNER-7B	See all
DaNE	saattrupdan/nbailab-base-ner-scandi	See all
IECSIL FIRE-2018 Shared Task	XLM-RoBERTa	See all
LegalNERo	Marcell	See all
Adverse Drug Events (ADE) Corpus	Spark NLP	See all
i2b2 De-identification Dataset	BiLSTM with ELMo	See all
Broad Twitter Corpus	WORD_GAZ	See all
Gellus	ConNER	See all
SemEval 2022 - BanglaCoNER	POS Tagger, Prefix, Suffix, k-Neighbor Words, k-means clustering	See all
SemEval 2022-2023 - BanglaCoNER	FT-Bangla BERT Large	See all
NEMO-Corpus	AlephBERTGimmel-base MTL	See all
UNER v1 (Danish)	UNER XML-R	See all
UNER v1 (English)	UNER XML-R	See all
UNER v1 (Croatian)	UNER XML-R	See all
UNER v1 (Portuguese)	UNER XML-R	See all
UNER v1 (Slovak)	UNER XML-R	See all
UNER v1 (Serbian)	UNER XML-R	See all
UNER v1 (Swedish)	UNER XML-R	See all
UNER v1 (Chinese)	UNER XML-R	See all
UNER v1 (Chinese Simplified)	UNER XML-R	See all
UNER v1 - PUD (English)	UNER XML-R	See all
UNER v1 - PUD (Portuguese)	UNER XML-R	See all
UNER v1 - PUD (Swedish)	UNER XML-R	See all
UNER v1 - PUD (Chinese)	UNER XML-R	See all

Show all 76 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Named Entity Recognition (NER) models and implementations

flairNLP/flair

6 papers

13,561

huggingface/transformers

5 papers

124,793

l3cube-pune/MarathiNLP

4 papers

dmlc/gluon-nlp

3 papers

2,548

See all 7 libraries.

Datasets

Subtasks

Few-shot NER

Medical Named Entity Recognition

Multilingual Named Entity Recognition

Cross-Domain Named Entity Recognition

Named Entity Recognition In Vietnamese

Multi-modal Named Entity Recognition

Zero-shot Named Entity Recognition (NER)

Toponym Recognition

Scientific Concept Extraction

Multi-Grained Named Entity Recognition

Most implemented papers

Most implemented Social Latest No code

A Unified MRC Framework for Named Entity Recognition

ShannonAI/mrc-for-flat-nested-ner • • ACL 2020

Instead of treating the task of NER as a sequence labeling problem, we propose to formulate it as a machine reading comprehension (MRC) task.

Paper
Code

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

studio-ousia/luke • • EMNLP 2020

In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer.

Paper
Code

TENER: Adapting Transformer Encoder for Named Entity Recognition

fastnlp/TENER • • 10 Nov 2019

The Bidirectional long short-term memory networks (BiLSTM) have been widely used as an encoder in models solving the named entity recognition (NER) task.

Paper
Code

Few-NERD: A Few-Shot Named Entity Recognition Dataset

thunlp/Few-NERD • • ACL 2021

In this paper, we present Few-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types.

Paper
Code

Optimal Hyperparameters for Deep LSTM-Networks for Sequence Labeling Tasks

UKPLab/emnlp2017-bilstm-cnn-crf • • 21 Jul 2017

Selecting optimal parameters for a neural network architecture can often make the difference between mediocre and state-of-the-art performance.

Paper
Code

CamemBERT: a Tasty French Language Model

huggingface/transformers • • ACL 2020

We show that the use of web crawled data is preferable to the use of Wikipedia data.

Paper
Code

Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging

UKPLab/emnlp2017-bilstm-cnn-crf • • EMNLP 2017

In this paper we show that reporting a single performance score is insufficient to compare non-deterministic approaches.

Paper
Code

The Natural Language Decathlon: Multitask Learning as Question Answering

salesforce/decaNLP • • ICLR 2019

Though designed for decaNLP, MQAN also achieves state of the art results on the WikiSQL semantic parsing task in the single-task setting.

Paper
Code

Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction

luanyi/DyGIE • • EMNLP 2018

We introduce a multi-task setup of identifying and classifying entities, relations, and coreference clusters in scientific articles.

Paper
Code

Semantic Relation Classification via Bidirectional LSTM Networks with Entity-aware Attention using Latent Entity Typing

roomylee/entity-aware-relation-classification • • 23 Jan 2019

Our model not only utilizes entities and their latent types as features effectively but also is more interpretable by visualizing attention mechanisms applied to our model and results of LET.

Paper
Code

Named Entity Recognition (NER)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result