TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Grammatical Error Correction	BEA-2019 (test)	GRECO (voting+ESC)	F0.5	80.84	# 2
Grammatical Error Correction	CoNLL-2014 Shared Task	GRECO (voting+ESC)	F0.5	71.12	# 3
Grammatical Error Correction	CoNLL-2014 Shared Task	GRECO (voting+ESC)	Precision	79.6	# 4
Grammatical Error Correction	CoNLL-2014 Shared Task	GRECO (voting+ESC)	Recall	49.86	# 1
Grammatical Error Correction	CoNLL-2014 Shared Task (10 annotations)	GRECO (vote+ESC)	F0.5	85.21	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/system-combination-via-quality-estimation-for/grammatical-error-correction-on-conll-2014-1)](https://paperswithcode.com/sota/grammatical-error-correction-on-conll-2014-1?p=system-combination-via-quality-estimation-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/system-combination-via-quality-estimation-for/grammatical-error-correction-on-bea-2019-test)](https://paperswithcode.com/sota/grammatical-error-correction-on-bea-2019-test?p=system-combination-via-quality-estimation-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/system-combination-via-quality-estimation-for/grammatical-error-correction-on-conll-2014)](https://paperswithcode.com/sota/grammatical-error-correction-on-conll-2014?p=system-combination-via-quality-estimation-for)`

System Combination via Quality Estimation for Grammatical Error Correction

23 Oct 2023 · Muhammad Reza Qorib, Hwee Tou Ng ·

Quality estimation models have been developed to assess the corrections made by grammatical error correction (GEC) models when the reference or gold-standard corrections are not available. An ideal quality estimator can be utilized to combine the outputs of multiple GEC systems by choosing the best subset of edits from the union of all edits proposed by the GEC base systems. However, we found that existing GEC quality estimation models are not good enough in differentiating good corrections from bad ones, resulting in a low F0.5 score when used for system combination. In this paper, we propose GRECO, a new state-of-the-art quality estimation model that gives a better estimate of the quality of a corrected sentence, as indicated by having a higher correlation to the F0.5 score of a corrected sentence. It results in a combined GEC system with a higher F0.5 score. We also propose three methods for utilizing GEC quality estimation models for system combination with varying generality: model-agnostic, model-agnostic with voting bias, and model-dependent method. The combined GEC system outperforms the state of the art on the CoNLL-2014 test set and the BEA-2019 test set, achieving the highest F0.5 scores published to date.

PDF Abstract

Code

Add Remove Mark official

nusnlp/greco official

Tasks

Add Remove

Grammatical Error Correction

Sentence

Datasets

CoNLL FCE

CoNLL-2014 Shared Task: Grammatical Error Correction

WI-LOCNESS

Results from the Paper

Edit

Ranked #1 on Grammatical Error Correction on CoNLL-2014 Shared Task (10 annotations)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Grammatical Error Correction	BEA-2019 (test)	GRECO (voting+ESC)	F0.5	80.84	# 2	Compare
Grammatical Error Correction	CoNLL-2014 Shared Task	GRECO (voting+ESC)	F0.5	71.12	# 3	Compare
			Precision	79.6	# 4	Compare
			Recall	49.86	# 1	Compare
Grammatical Error Correction	CoNLL-2014 Shared Task (10 annotations)	GRECO (vote+ESC)	F0.5	85.21	# 1	Compare

Methods

Add Remove

BASE

Edit Social Preview

System Combination via Quality Estimation for Grammatical Error Correction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove