Search Results for author: Deepanway Ghosal

Found 31 papers, 22 papers with code

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

1 code implementation15 Apr 2024 Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

These models do not explicitly focus on the presence of concepts or events and their temporal ordering in the output audio with respect to the input prompt.

Audio Generation

PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

2 code implementations20 Mar 2024 Yew Ken Chia, Vernon Toh Yan Han, Deepanway Ghosal, Lidong Bing, Soujanya Poria

As recognizing patterns and abstracting concepts are key to general intelligence, we introduce PuzzleVQA, a collection of puzzles based on abstract patterns.

Multimodal Reasoning

Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning

2 code implementations6 Mar 2024 Deepanway Ghosal, Vernon Toh Yan Han, Chia Yew Ken, Soujanya Poria

We present a new dataset, AlgoPuzzleVQA designed to challenge and evaluate the capabilities of multimodal language models in solving algorithmic puzzles that necessitate both visual understanding, language understanding, and complex algorithmic reasoning.

Multimodal Reasoning Question Answering +1

Caught in the Quicksand of Reasoning, Far from AGI Summit: Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions

1 code implementation17 Jan 2024 Pengfei Hong, Deepanway Ghosal, Navonil Majumder, Somak Aditya, Rada Mihalcea, Soujanya Poria

Recent advancements in Large Language Models (LLMs) have showcased striking results on existing logical reasoning benchmarks, with some models even surpassing human performance.

Arithmetic Reasoning Code Generation +3

Mustango: Toward Controllable Text-to-Music Generation

1 code implementation14 Nov 2023 Jan Melechovsky, Zixun Guo, Deepanway Ghosal, Navonil Majumder, Dorien Herremans, Soujanya Poria

Through extensive experiments, we show that the quality of the music generated by Mustango is state-of-the-art, and the controllability through music-specific text prompts greatly outperforms other models such as MusicGen and AudioLDM2.

Data Augmentation Denoising +4

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

1 code implementation5 Jul 2023 Deepanway Ghosal, Yew Ken Chia, Navonil Majumder, Soujanya Poria

Interestingly, despite being introduced four years ago, T5-based LLMs, such as FLAN-T5, continue to outperform the latest decoder-based LLMs, such as LLAMA and VICUNA, on tasks that require general problem-solving skills.

Language Modelling Large Language Model

ReTAG: Reasoning Aware Table to Analytic Text Generation

no code implementations19 May 2023 Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer

The task of table summarization involves generating text that both succinctly and accurately represents the table or a specific set of highlighted cells within a table.

Data-to-Text Generation Descriptive +2

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

1 code implementation24 Apr 2023 Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Soujanya Poria

The immense scale of the recent large language models (LLM) allows many interesting properties, such as, instruction- and chain-of-thought-based fine-tuning, that has significantly improved zero- and few-shot performance in many natural language processing (NLP) tasks.

AudioCaps Audio Generation

Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering

1 code implementation29 Oct 2022 Deepanway Ghosal, Navonil Majumder, Rada Mihalcea, Soujanya Poria

We show the efficacy of our proposed approach in different tasks -- abductive reasoning, commonsense question answering, science question answering, and sentence completion.

Binary Classification Science Question Answering +2

Multiview Contextual Commonsense Inference: A New Dataset and Task

1 code implementation6 Oct 2022 Siqi Shen, Deepanway Ghosal, Navonil Majumder, Henry Lim, Rada Mihalcea, Soujanya Poria

Our results show that the proposed pre-training objectives are effective at adapting the pre-trained T5-Large model for the contextual commonsense inference task.

 Ranked #1 on Multiview Contextual Commonsense Inference on CICERO (using extra training data)

Multiview Contextual Commonsense Inference

Generating Intermediate Steps for NLI with Next-Step Supervision

no code implementations31 Aug 2022 Deepanway Ghosal, Somak Aditya, Monojit Choudhury

The Natural Language Inference (NLI) task often requires reasoning over multiple steps to reach the conclusion.

Data Augmentation Natural Language Inference

Exemplars-guided Empathetic Response Generation Controlled by the Elements of Human Communication

1 code implementation22 Jun 2021 Navonil Majumder, Deepanway Ghosal, Devamanyu Hazarika, Alexander Gelbukh, Rada Mihalcea, Soujanya Poria

We empirically show that these approaches yield significant improvements in empathetic response quality in terms of both automated and human-evaluated metrics.

Empathetic Response Generation Passage Retrieval +2

Recognizing Emotion Cause in Conversations

1 code implementation22 Dec 2020 Soujanya Poria, Navonil Majumder, Devamanyu Hazarika, Deepanway Ghosal, Rishabh Bhardwaj, Samson Yu Bai Jian, Pengfei Hong, Romila Ghosh, Abhinaba Roy, Niyati Chhaya, Alexander Gelbukh, Rada Mihalcea

We address the problem of recognizing emotion cause in conversations, define two novel sub-tasks of this problem, and provide a corresponding dialogue-level dataset, along with strong Transformer-based baselines.

Causal Emotion Entailment Emotion Cause Extraction

Improving Zero Shot Learning Baselines with Commonsense Knowledge

no code implementations11 Dec 2020 Abhinaba Roy, Deepanway Ghosal, Erik Cambria, Navonil Majumder, Rada Mihalcea, Soujanya Poria

Zero shot learning -- the problem of training and testing on a completely disjoint set of classes -- relies greatly on its ability to transfer knowledge from train classes to test classes.

Word Embeddings Zero-Shot Learning

Persuasive Dialogue Understanding: the Baselines and Negative Results

no code implementations19 Nov 2020 Hui Chen, Deepanway Ghosal, Navonil Majumder, Amir Hussain, Soujanya Poria

Persuasion aims at forming one's opinion and action via a series of persuasive messages containing persuader's strategies.

Attribute Dialogue Understanding +7

MIME: MIMicking Emotions for Empathetic Response Generation

1 code implementation EMNLP 2020 Navonil Majumder, Pengfei Hong, Shanshan Peng, Jiankun Lu, Deepanway Ghosal, Alexander Gelbukh, Rada Mihalcea, Soujanya Poria

Current approaches to empathetic response generation view the set of emotions expressed in the input text as a flat structure, where all the emotions are treated uniformly.

Empathetic Response Generation Response Generation

Visual Interest Prediction with Attentive Multi-Task Transfer Learning

no code implementations26 May 2020 Deepanway Ghosal, Maheshkumar H. Kolekar

Visual interest & affect prediction is a very interesting area of research in the area of computer vision.

Multi-Task Learning

KinGDOM: Knowledge-Guided DOMain adaptation for sentiment analysis

1 code implementation ACL 2020 Deepanway Ghosal, Devamanyu Hazarika, Abhinaba Roy, Navonil Majumder, Rada Mihalcea, Soujanya Poria

Cross-domain sentiment analysis has received significant attention in recent years, prompted by the need to combat the domain gap between different applications that make use of sentiment analysis.

Domain Adaptation Sentiment Analysis

DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation

2 code implementations IJCNLP 2019 Deepanway Ghosal, Navonil Majumder, Soujanya Poria, Niyati Chhaya, Alexander Gelbukh

Emotion recognition in conversation (ERC) has received much attention, lately, from researchers due to its potential widespread applications in diverse areas, such as health-care, education, and human resources.

Emotion Classification Emotion Recognition in Conversation

A Multi-task Ensemble Framework for Emotion, Sentiment and Intensity Prediction

no code implementations3 Aug 2018 Md. Shad Akhtar, Deepanway Ghosal, Asif Ekbal, Pushpak Bhattacharyya, Sadao Kurohashi

In this paper, through multi-task ensemble framework we address three problems of emotion and sentiment analysis i. e. "emotion classification & intensity", "valence, arousal & dominance for emotion" and "valence & arousal} for sentiment".

Emotion Classification General Classification +1

A Multilayer Perceptron based Ensemble Technique for Fine-grained Financial Sentiment Analysis

no code implementations EMNLP 2017 Md. Shad Akhtar, Abhishek Kumar, Deepanway Ghosal, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we propose a novel method for combining deep learning and classical feature based models using a Multi-Layer Perceptron (MLP) network for financial sentiment analysis.

Sentiment Analysis Stock Prediction +1

IITP at SemEval-2017 Task 5: An Ensemble of Deep Learning and Feature Based Models for Financial Sentiment Analysis

no code implementations SEMEVAL 2017 Deepanway Ghosal, Shobhit Bhatnagar, Md. Shad Akhtar, Asif Ekbal, Pushpak Bhattacharyya

In this paper we propose an ensemble based model which combines state of the art deep learning sentiment analysis algorithms like Convolution Neural Network (CNN) and Long Short Term Memory (LSTM) along with feature based models to identify optimistic or pessimistic sentiments associated with companies and stocks in financial texts.

Sentiment Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.