TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Cross-Lingual NER	CoNLL 2003	XLM-RoBERTa-large	Spanish	79.5	# 1
Cross-Lingual NER	CoNLL 2003	XLM-RoBERTa-large	German	74.5	# 2
Cross-Lingual NER	CoNLL 2003	XLM-RoBERTa-large	Dutch	82.3	# 1
Cross-Lingual NER	CoNLL Dutch	XLM-R large	F1	79.7	# 7
Cross-Lingual NER	CoNLL German	XLM-R large	F1	74.5	# 4
Cross-Lingual NER	CoNLL Spanish	XLM-R large	F1	79.5	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/model-and-data-transfer-for-cross-lingual/cross-lingual-ner-on-conll-2003)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-2003?p=model-and-data-transfer-for-cross-lingual)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/model-and-data-transfer-for-cross-lingual/cross-lingual-ner-on-conll-spanish)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-spanish?p=model-and-data-transfer-for-cross-lingual)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/model-and-data-transfer-for-cross-lingual/cross-lingual-ner-on-conll-german)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-german?p=model-and-data-transfer-for-cross-lingual)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/model-and-data-transfer-for-cross-lingual/cross-lingual-ner-on-conll-dutch)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-dutch?p=model-and-data-transfer-for-cross-lingual)`

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

23 Oct 2022 · Iker García-Ferrero, Rodrigo Agerri, German Rigau ·

Zero-resource cross-lingual transfer approaches aim to apply supervised models from a source language to unlabelled target languages. In this paper we perform an in-depth study of the two main techniques employed so far for cross-lingual zero-resource sequence labelling, based either on data or model transfer. Although previous research has proposed translation and annotation projection (data-based cross-lingual transfer) as an effective technique for cross-lingual sequence labelling, in this paper we experimentally demonstrate that high capacity multilingual language models applied in a zero-shot (model-based cross-lingual transfer) setting consistently outperform data-based cross-lingual transfer approaches. A detailed analysis of our results suggests that this might be due to important differences in language use. More specifically, machine translation often generates a textual signal which is different to what the models are exposed to when using gold standard data, which affects both the fine-tuning and evaluation processes. Our results also indicate that data-based cross-lingual transfer approaches remain a competitive option when high-capacity multilingual language models are not available.

PDF Abstract

Code

Add Remove Mark official

ikergarcia1996/Easy-Translate official

↳ Quickstart in

Spaces

160

ikergarcia1996/easy-label-projection official

ikergarcia1996/annotation-projectio… official

ikergarcia1996/Iker-Garcia-Ferrero

Tasks

Add Remove

Cross-Lingual NER

Cross-Lingual Transfer

Machine Translation

Translation

Datasets

CoNLL 2003 CoNLL

CCAligned

Results from the Paper

Add Remove

Ranked #1 on Cross-Lingual NER on CoNLL Spanish

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Cross-Lingual NER	CoNLL 2003	XLM-RoBERTa-large	Spanish	79.5	# 1	Compare
			German	74.5	# 2	Compare
			Dutch	82.3	# 1	Compare
Cross-Lingual NER	CoNLL Dutch	XLM-R large	F1	79.7	# 7	Compare
Cross-Lingual NER	CoNLL German	XLM-R large	F1	74.5	# 4	Compare
Cross-Lingual NER	CoNLL Spanish	XLM-R large	F1	79.5	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove