1 code implementation • Findings (EMNLP) 2021 • Khondoker Ittehadul Islam, Sudipta Kar, Md Saiful Islam, Mohammad Ruhul Amin
In this paper, we propose an annotated sentiment analysis dataset made of informally written Bangla texts.
no code implementations • SemEval (NAACL) 2022 • Shervin Malmasi, Anjie Fang, Besnik Fetahu, Sudipta Kar, Oleg Rokhlenko
Divided into 13 tracks, the task focused on methods to identify complex named entities (like names of movies, products and groups) in 11 languages in both monolingual and multi-lingual scenarios.
no code implementations • RANLP 2021 • Henry Gorelick, Biddut Sarker Bijoy, Syeda Jannatus Saba, Sudipta Kar, Md Saiful Islam, Mohammad Ruhul Amin
By analyzing the model parameters, we extracted the successful semantic relationships from books of 12 different genres.
no code implementations • 18 Feb 2024 • Shirley Anugrah Hayati, Taehee Jung, Tristan Bodding-Long, Sudipta Kar, Abhinav Sethy, Joo-Kyung Kim, Dongyeop Kang
Fine-tuning large language models (LLMs) with a collection of large and diverse instructions has improved the model's generalization to different tasks, even for unseen tasks.
no code implementations • 30 Oct 2023 • Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, Abhinav Sethy
To overcome these limitations, we propose a novel summary-augmented approach by extending retrieval-augmented personalization with task-aware user summaries generated by LLMs.
no code implementations • 20 Oct 2023 • Besnik Fetahu, Zhiyu Chen, Sudipta Kar, Oleg Rokhlenko, Shervin Malmasi
We present MULTICONER V2, a dataset for fine-grained Named Entity Recognition covering 33 entity classes across 12 languages, in both monolingual and multilingual settings.
no code implementations • 11 May 2023 • Besnik Fetahu, Sudipta Kar, Zhiyu Chen, Oleg Rokhlenko, Shervin Malmasi
The task highlights the need for future research on improving NER robustness on noisy data containing complex entities.
Multilingual Named Entity Recognition named-entity-recognition +3
no code implementations • 22 Feb 2023 • Sudipta Kar, Giuseppe Castellucci, Simone Filice, Shervin Malmasi, Oleg Rokhlenko
In this paper, we approach the problem of incrementally expanding MTL models' capability to solve new tasks over time by distilling the knowledge of an already trained model on n tasks into a new one for solving n+1 tasks.
1 code implementation • 21 Feb 2023 • Christopher Richardson, Sudipta Kar, Anjishnu Kumar, Anand Ramachandran, Omar Zia Khan, Zeynab Raeesy, Abhinav Sethy
The retrieval system is trained on a dataset which contains ~14K multi-turn information-seeking conversations with a valid follow-up question and a set of invalid candidates.
no code implementations • COLING 2022 • Shervin Malmasi, Anjie Fang, Besnik Fetahu, Sudipta Kar, Oleg Rokhlenko
We present MultiCoNER, a large multilingual dataset for Named Entity Recognition that covers 3 domains (Wiki sentences, questions, and search queries) across 11 languages, as well as multilingual and code-mixing subsets.
no code implementations • SEMEVAL 2020 • Parth Patwa, Gustavo Aguilar, Sudipta Kar, Suraj Pandey, Srinivas PYKL, Björn Gambäck, Tanmoy Chakraborty, Thamar Solorio, Amitava Das
In this paper, we present the results of the SemEval-2020 Task 9 on Sentiment Analysis of Code-Mixed Tweets (SentiMix 2020).
no code implementations • LREC 2020 • Gustavo Aguilar, Sudipta Kar, Thamar Solorio
To facilitate research in this direction, we propose a centralized benchmark for Linguistic Code-switching Evaluation (LinCE) that combines ten corpora covering four different code-switched language pairs (i. e., Spanish-English, Nepali-English, Hindi-English, and Modern Standard Arabic-Egyptian Arabic) and four tasks (i. e., language identification, named entity recognition, part-of-speech tagging, and sentiment analysis).
no code implementations • LREC 2020 • Mahsa Shafaei, Niloofar Safi Samghabadi, Sudipta Kar, Thamar Solorio
Movies help us learn and inspire societal change.
1 code implementation • LREC 2020 • Md Zobaer Hossain, Md Ashraful Rahman, Md. Saiful Islam, Sudipta Kar
In this work, we propose an annotated dataset of ~50K news that can be used for building automated fake news detection systems for a low resource language like Bangla.
no code implementations • EMNLP (ALW) 2020 • Niloofar Safi Samghabadi, Afsheen Hatami, Mahsa Shafaei, Sudipta Kar, Thamar Solorio
We experiment with this model on our dataset and later present the analysis.
no code implementations • EMNLP 2020 • Sudipta Kar, Gustavo Aguilar, Mirella Lapata, Thamar Solorio
This paper considers the problem of characterizing stories by inferring properties such as theme and style using written synopses and reviews of movies.
no code implementations • 21 Aug 2019 • Mahsa Shafaei, Niloofar Safi Samghabadi, Sudipta Kar, Thamar Solorio
In this paper, our goal is to predict the suitability of the movie content for children and young adults based on scripts.
no code implementations • COLING 2018 • Sudipta Kar, Suraj Maharjan, Thamar Solorio
Folksonomy of movies covers a wide range of heterogeneous information about movies, like the genre, plot structure, visual experiences, soundtracks, metadata, and emotional experiences from watching a movie.
no code implementations • COLING 2018 • Niloofar Safi Samghabadi, Deepthi Mave, Sudipta Kar, Thamar Solorio
This paper presents our system for "TRAC 2018 Shared Task on Aggression Identification".
no code implementations • SEMEVAL 2016 • Marc Franco-Salvador, Sudipta Kar, Thamar Solorio, Paolo Rosso
In this work we describe the system built for the three English subtasks of the SemEval 2016 Task 3 by the Department of Computer Science of the University of Houston (UH) and the Pattern Recognition and Human Language Technology (PRHLT) research center - Universitat Polit`ecnica de Val`encia: UH-PRHLT.
1 code implementation • NAACL 2018 • Suraj Maharjan, Sudipta Kar, Manuel Montes-y-Gomez, Fabio A. Gonzalez, Thamar Solorio
Books have the power to make us feel happiness, sadness, pain, surprise, or sorrow.
no code implementations • LREC 2018 • Sudipta Kar, Suraj Maharjan, A. Pastor López-Monroy, Thamar Solorio
In this paper, we set out to the task of collecting a corpus of movie plot synopses and tags.
no code implementations • SEMEVAL 2017 • Sudipta Kar, Suraj Maharjan, Thamar Solorio
In this paper, we present our systems for the {``}SemEval-2017 Task-5 on Fine-Grained Sentiment Analysis on Financial Microblogs and News{''}.