no code implementations • 9 Dec 2023 • Mohammad Mamun Or Rashid
In this study, a Bangla toxic language dataset has been analyzed which was inputted by the user in Bengali script & language.
1 code implementation • 6 Nov 2023 • Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed
Lemmatization holds significance in both natural language processing (NLP) and linguistics, as it effectively decreases data density and aids in comprehending contextual meaning.
no code implementations • 9 Jun 2023 • Md. Ekramul Islam, Labib Chowdhury, Faisal Ahamed Khan, Shazzad Hossain, Sourave Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed, Mohammad Ruhul Amin
This study introduces SentiGOLD, a Bangla multi-domain sentiment analysis dataset.
1 code implementation • 11 May 2023 • Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Sazia Mehnaz, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Mohammad Mamun Or Rashid, Farig Sadeque
This paper proposes two libraries to address common and uncommon issues with Unicode-based writing schemes for Indic languages.
1 code implementation • 7 Apr 2023 • Shadman Rohan, Mojammel Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed
While widely studied for English and other resource-rich languages, research on coreference resolution in Bengali largely remains unexplored due to the absence of relevant datasets.