WikiMulti: a Corpus for Cross-Lingual Summarization

23 Apr 2022  ·  Pavel Tikhonov, Valentin Malykh ·

Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti

PDF Abstract

Datasets


Introduced in the Paper:

WikiMulti

Used in the Paper:

XL-Sum MLSUM Global Voices

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods


No methods listed for this paper. Add relevant methods here