no code implementations • NAACL 2018 • Singh Mayank, Dogga Pradeep, Patro Sohan, Barnwal Dhiraj, Dutt Ritam, Haldar Rajarshi, Goyal Pawan, Mukherjee Animesh
In contrast to previous works, periodically crawling, indexing and processing of new incoming articles is completely automated in the current system.
no code implementations • COLING 2016 • Singh Mayank, Barua Barnopriyo, Palod Priyank, Garg Manvi, Satapathy Sidhartha, Bushi Samuel, Ayush Kumar, Rohith Krishna Sai, Gamidi Tulasi, Goyal Pawan, Mukherjee Animesh
This paper proposes OCR++, an open-source framework designed for a variety of information extraction tasks from scholarly articles including metadata (title, author names, affiliation and e-mail), structure (section headings and body text, table and figure headings, URLs and footnotes) and bibliography (citation instances and references).