Search Results for author: Ramani Duraiswami

Found 7 papers, 2 papers with code

FAST: Factorizable Attention for Speeding up Transformers

no code implementations12 Feb 2024 Armin Gerami, Monte Hoover, Pranav S. Dulepet, Ramani Duraiswami

Motivated by the factorization inherent in the original fast multipole method and the improved fast Gauss transform we introduce a factorable form of attention that operates efficiently in high dimensions.

A Closer Look at the Limitations of Instruction Tuning

no code implementations3 Feb 2024 Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S, Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha

Our findings reveal that responses generated solely from pre-trained knowledge consistently outperform responses by models that learn any form of new knowledge from IT on open-source datasets.

Hallucination

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

no code implementations12 Oct 2023 Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Evuru, S. Ramaneswaran, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha

In this paper, we propose CompA, a collection of two expert-annotated benchmarks with a majority of real-world audio samples, to evaluate compositional reasoning in ALMs.

Attribute Audio Classification +1

RECAP: Retrieval-Augmented Audio Captioning

1 code implementation18 Sep 2023 Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramani Duraiswami, Dinesh Manocha

We present RECAP (REtrieval-Augmented Audio CAPtioning), a novel and effective audio captioning system that generates captions conditioned on an input audio and other captions similar to the audio retrieved from a datastore.

AudioCaps Audio captioning +2

Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning

no code implementations11 Feb 2015 Yuancheng Luo, Dmitry N. Zotkin, Ramani Duraiswami

From a machine learning perspective, the human ability localize sounds can be modeled as a non-parametric and non-linear regression problem between binaural spectral features of sound received at the ears (input) and their sound-source directions (output).

Active Learning regression

Software for Computing the Spheroidal Wave Functions Using Arbitrary Precision Arithmetic

1 code implementation1 Aug 2014 Ross Adelman, Nail A. Gumerov, Ramani Duraiswami

The spheroidal wave functions, which are the solutions to the Helmholtz equation in spheroidal coordinates, are notoriously difficult to compute.

Mathematical Software Numerical Analysis

Automatic online tuning for fast Gaussian summation

no code implementations NeurIPS 2008 Vlad I. Morariu, Balaji V. Srinivasan, Vikas C. Raykar, Ramani Duraiswami, Larry S. Davis

To solve the second problem, we present an online tuning approach that results in a black box method that automatically chooses the evaluation method and its parameters to yield the best performance for the input data, desired accuracy, and bandwidth.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.