no code implementations • Findings (NAACL) 2022 • Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick
We present a self-supervised pre-training approach for learning rich visual language representations for both handwritten and printed historical document transcription.
no code implementations • 9 Jul 2019 • Benjamin Kiessling, Daniel Stökl Ben Ezra, Matthew Thomas Miller
The application of handwritten text recognition to historical works is highly dependant on accurate text line retrieval.
no code implementations • 28 Mar 2017 • Maxim Romanov, Matthew Thomas Miller, Sarah Bowen Savant, Benjamin Kiessling
The OpenITI team has achieved Optical Character Recognition (OCR) accuracy rates for classical Arabic-script texts in the high nineties.
Optical Character Recognition Optical Character Recognition (OCR)