1 code implementation • Findings (EMNLP) 2021 • Allen Kim, Charuta Pethe, Naoya Inoue, Steve Skiena
We present methods to handle these errors, evaluated on a collection of 19, 347 texts from the Project Gutenberg dataset and 96, 635 texts from the HathiTrust Library.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • COLING 2018 • Vivek Kulkarni, Yingtao Tian, D, Parth iwala, Steve Skiena
We present domain independent models to date documents based only on neologism usage patterns.