MLQE-PE (Multilingual Quality Estimation and Automatic Post-editing Dataset)

Introduced by Fomicheva et al. in MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset

The Multilingual Quality Estimation and Automatic Post-editing (MLQE-PE) Dataset is a dataset for Machine Translation (MT) Quality Estimation (QE) and Automatic Post-Editing (APE). The dataset contains seven language pairs, with human labels for 9,000 translations per language pair in the following formats: sentence-level direct assessments and post-editing effort, and word-level good/bad labels. It also contains the post-edited sentences, as well as titles of the articles where the sentences were extracted from, and the neural MT models used to translate the text.

Source: https://github.com/sheffieldnlp/mlqe-pe

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets