A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

26 Jan 2018  ยท  Shamil Chollampatt, Hwee Tou Ng ยท

We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural approaches on this task as well as strong statistical machine translation-based systems with neural and task-specific features trained on the same data. Our analysis shows the superiority of convolutional neural networks over recurrent neural networks such as long short-term memory (LSTM) networks in capturing the local context via attention, and thereby improving the coverage in correcting grammatical errors. By ensembling multiple models, and incorporating an N-gram language model and edit features via rescoring, our novel method becomes the first neural approach to outperform the current state-of-the-art statistical machine translation-based approach, both in terms of grammaticality and fluency.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Grammatical Error Correction CoNLL-2014 Shared Task CNN Seq2Seq F0.5 54.79 # 19
Grammatical Error Correction CoNLL-2014 Shared Task (10 annotations) CNN Seq2Seq F0.5 70.14 # 3
Grammatical Error Correction JFLEG CNN Seq2Seq GLEU 57.47 # 6
Grammatical Error Correction Restricted CNN Seq2Seq F0.5 70.14 (measured by Ge et al., 2018) # 1
Grammatical Error Correction _Restricted_ CNN Seq2Seq GLEU 57.47 # 2

Methods


No methods listed for this paper. Add relevant methods here