Search Results for author: Utkarsh Tyagi

Found 12 papers, 6 papers with code

Do Vision-Language Models Understand Compound Nouns?

no code implementations • 30 Mar 2024 • Sonal Kumar, Sreyan Ghosh, S Sakshi, Utkarsh Tyagi, Dinesh Manocha

We curate Compun, a novel benchmark with 400 unique and commonly used CNs, to evaluate the effectiveness of VLMs in interpreting CNs.

Image Retrieval Language Modelling +2

Paper
Add Code

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

no code implementations • 30 Mar 2024 • Chandra Kiran Reddy Evuru, Sreyan Ghosh, Sonal Kumar, Ramaneswaran S, Utkarsh Tyagi, Dinesh Manocha

We present CoDa (Constrained Generation based Data Augmentation), a controllable, effective, and training-free data augmentation technique for low-resource (data-scarce) NLP.

Data Augmentation Instruction Following

Paper
Add Code

DALE: Generative Data Augmentation for Low-Resource Legal NLP

1 code implementation • 24 Oct 2023 • Sreyan Ghosh, Chandra Kiran Evuru, Sonal Kumar, S Ramaneswaran, S Sakshi, Utkarsh Tyagi, Dinesh Manocha

We present DALE, a novel and effective generative Data Augmentation framework for low-resource LEgal NLP.

Data Augmentation Denoising +2

Paper
Code

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

no code implementations • 12 Oct 2023 • Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Evuru, S. Ramaneswaran, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha

In this paper, we propose CompA, a collection of two expert-annotated benchmarks with a majority of real-world audio samples, to evaluate compositional reasoning in ALMs.

Attribute Audio Classification +1

Paper
Add Code

AdVerb: Visually Guided Audio Dereverberation

no code implementations • ICCV 2023 • Sanjoy Chowdhury, Sreyan Ghosh, Subhrajyoti Dasgupta, Anton Ratnarajah, Utkarsh Tyagi, Dinesh Manocha

We present AdVerb, a novel audio-visual dereverberation framework that uses visual cues in addition to the reverberant sound to estimate clean audio.

Speaker Verification Speech Enhancement +2

Paper
Add Code

ASPIRE: Language-Guided Augmentation for Robust Image Classification

no code implementations • 19 Aug 2023 • Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Sakshi Singh, Sanjoy Chowdhury, Dinesh Manocha

This paper presents ASPIRE (Language-guided data Augmentation for SPurIous correlation REmoval), a simple yet effective solution for expanding the training dataset with synthetic images without spurious features.

Classification Data Augmentation +2

Paper
Add Code

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

1 code implementation • 1 Jun 2023 • Sreyan Ghosh, Utkarsh Tyagi, Manan Suri, Sonal Kumar, S Ramaneswaran, Dinesh Manocha

In addition, we demonstrate the application of ACLM to other domains that suffer from data scarcity (e. g., biomedical).

Data Augmentation Denoising +5

Paper
Code

BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER

1 code implementation • 18 May 2023 • Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Dinesh Manocha

Though data augmentation has shown to be highly effective for low-resource NER in general, existing data augmentation techniques fail to produce factual and diverse augmentations for BioNER.

Data Augmentation named-entity-recognition +2

Paper
Code

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network

1 code implementation • 2 Mar 2023 • Sreyan Ghosh, Manan Suri, Purva Chiniya, Utkarsh Tyagi, Sonal Kumar, Dinesh Manocha

The tremendous growth of social media users interacting in online conversations has led to significant growth in hate speech, affecting people from various demographics.

Paper
Code

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

no code implementations • 27 Nov 2022 • Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Manan Suri, Rajiv Ratn Shah

Based on early-fusion and self-attention-based multimodal interaction between text and acoustic modalities, in this paper, we propose a novel multimodal architecture for disfluency detection from individual utterances.

Paper
Add Code

MMER: Multimodal Multi-task Learning for Speech Emotion Recognition

1 code implementation • 31 Mar 2022 • Sreyan Ghosh, Utkarsh Tyagi, S Ramaneswaran, Harshvardhan Srivastava, Dinesh Manocha

In this paper, we propose MMER, a novel Multimodal Multi-task learning approach for Speech Emotion Recognition.

Ranked #2 on Speech Emotion Recognition on IEMOCAP (using extra training data)

Multi-Task Learning Speech Emotion Recognition

Paper
Code

M-MELD: A Multilingual Multi-Party Dataset for Emotion Recognition in Conversations

1 code implementation • 31 Mar 2022 • Sreyan Ghosh, S Ramaneswaran, Utkarsh Tyagi, Harshvardhan Srivastava, Samden Lepcha, S Sakshi, Dinesh Manocha

Expression of emotions is a crucial part of daily human communication.

Emotion Recognition

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.