TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Grammatical Error Correction	CoNLL-2014 Shared Task	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	F0.5	61.15	# 14
Grammatical Error Correction	CoNLL-2014 Shared Task	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	Precision	71.57	# 7
Grammatical Error Correction	CoNLL-2014 Shared Task	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	Recall	38.65	# 8
Grammatical Error Correction	JFLEG	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	GLEU	61.0	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-grammatical-error-correction-via/grammatical-error-correction-on-jfleg)](https://paperswithcode.com/sota/grammatical-error-correction-on-jfleg?p=improving-grammatical-error-correction-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-grammatical-error-correction-via/grammatical-error-correction-on-conll-2014)](https://paperswithcode.com/sota/grammatical-error-correction-on-conll-2014?p=improving-grammatical-error-correction-via)`

Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data

NAACL 2019 · Wei Zhao, Liang Wang, Kewei Shen, Ruoyu Jia, Jingming Liu ·

Neural machine translation systems have become state-of-the-art approaches for Grammatical Error Correction (GEC) task. In this paper, we propose a copy-augmented architecture for the GEC task by copying the unchanged words from the source sentence to the target sentence. Since the GEC suffers from not having enough labeled training data to achieve high accuracy. We pre-train the copy-augmented architecture with a denoising auto-encoder using the unlabeled One Billion Benchmark and make comparisons between the fully pre-trained model and a partially pre-trained model. It is the first time copying words from the source context and fully pre-training a sequence to sequence model are experimented on the GEC task. Moreover, We add token-level and sentence-level multi-task learning for the GEC task. The evaluation results on the CoNLL-2014 test set show that our approach outperforms all recently published state-of-the-art results by a large margin. The code and pre-trained models are released at https://github.com/zhawe01/fairseq-gec.

PDF Abstract NAACL 2019 PDF NAACL 2019 Abstract

Code

Add Remove Mark official

zhawe01/fairseq-gec official

243

yuantiku/fairseq-gec

243

youichiro/transformer-copy

raghavmalawat/presentationmastery

soyoung97/fairseq-gec-korean

See all 6 implementations

Tasks

Add Remove

Denoising

Grammatical Error Correction

Machine Translation

Multi-Task Learning

Sentence

Translation

Datasets

CoNLL FCE Billion Word Benchmark

JFLEG

CoNLL-2014 Shared Task: Grammatical Error Correction One Billion Word Benchmark

Results from the Paper

Edit

Ranked #4 on Grammatical Error Correction on JFLEG

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Grammatical Error Correction	CoNLL-2014 Shared Task	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	F0.5	61.15	# 14	Compare
			Precision	71.57	# 7	Compare
			Recall	38.65	# 8	Compare
Grammatical Error Correction	JFLEG	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	GLEU	61.0	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove