no code implementations • 28 Nov 2023 • Soumya Banerjee, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, Partha Pratim Das
Digital libraries often face the challenge of processing a large volume of diverse document types.
Document Image Classification Optical Character Recognition (OCR)
1 code implementation • 14 Feb 2023 • Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, Partha Pratim Das
On the new MixSub dataset, where only the abstract is the input, our proposed model (when trained on the whole training corpus without distinguishing between the subject categories) achieves ROUGE-1, ROUGE-2 and ROUGE-L F1-scores of 31. 78, 9. 76 and 29. 3, respectively, METEOR score of 24. 00, and BERTScore F1 of 85. 25.
no code implementations • COLING 2020 • T.y.s.s Santosh, Debarshi Kumar Sanyal, Plaban Kumar Bhowmick, Partha Pratim Das
Keyphrases in a research paper succinctly capture the primary content of the paper and also assist in indexing the paper at a concept level.
1 code implementation • 11 May 2020 • Soumya Banerjee, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, Parthapratim Das
In the biomedical literature, it is customary to structure an abstract into discourse categories like BACKGROUND, OBJECTIVE, METHOD, RESULT, and CONCLUSION, but this segmentation is uncommon in other fields like computer science.