TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Cross-Lingual NER	CoNLL Dutch	Zero shot mBERT 3	F1	83.35	# 1
Cross-Lingual NER	CoNLL German	Zero shot mBERT 3	F1	72.44	# 7
Cross-Lingual NER	CoNLL Spanish	Zero shot mBERT 3	F1	76.53	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-lingua-franca-named-entity/cross-lingual-ner-on-conll-dutch)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-dutch?p=towards-lingua-franca-named-entity)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-lingua-franca-named-entity/cross-lingual-ner-on-conll-german)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-german?p=towards-lingua-franca-named-entity)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-lingua-franca-named-entity/cross-lingual-ner-on-conll-spanish)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-spanish?p=towards-lingua-franca-named-entity)`

Towards Lingua Franca Named Entity Recognition with BERT

19 Nov 2019 · Taesun Moon, Parul Awasthy, Jian Ni, Radu Florian ·

Information extraction is an important task in NLP, enabling the automatic extraction of data for relational database filling. Historically, research and data was produced for English text, followed in subsequent years by datasets in Arabic, Chinese (ACE/OntoNotes), Dutch, Spanish, German (CoNLL evaluations), and many others. The natural tendency has been to treat each language as a different dataset and build optimized models for each. In this paper we investigate a single Named Entity Recognition model, based on a multilingual BERT, that is trained jointly on many languages simultaneously, and is able to decode these languages with better accuracy than models trained only on one language. To improve the initial model, we study the use of regularization strategies such as multitask learning and partial gradient updates. In addition to being a single model that can tackle multiple languages (including code switch), the model could be used to make zero-shot predictions on a new language, even ones for which training data is not available, out of the box. The results show that this model not only performs competitively with monolingual models, but it also achieves state-of-the-art results on the CoNLL02 Dutch and Spanish datasets, OntoNotes Arabic and Chinese datasets. Moreover, it performs reasonably well on unseen languages, achieving state-of-the-art for zero-shot on three CoNLL languages.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Cross-Lingual NER

Cross-Lingual Transfer

named-entity-recognition

Named Entity Recognition

Named Entity Recognition (NER)

Datasets

OntoNotes 5.0 CoNLL

Results from the Paper

Edit

Ranked #1 on Cross-Lingual NER on CoNLL Dutch

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Cross-Lingual NER	CoNLL Dutch	Zero shot mBERT 3	F1	83.35	# 1	Compare
Cross-Lingual NER	CoNLL German	Zero shot mBERT 3	F1	72.44	# 7	Compare
Cross-Lingual NER	CoNLL Spanish	Zero shot mBERT 3	F1	76.53	# 7	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

Towards Lingua Franca Named Entity Recognition with BERT

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove