Search Results for author: Ramani Duraiswami

Found 7 papers, 2 papers with code

FAST: Factorizable Attention for Speeding up Transformers

no code implementations • 12 Feb 2024 • Armin Gerami, Monte Hoover, Pranav S. Dulepet, Ramani Duraiswami

Motivated by the factorization inherent in the original fast multipole method and the improved fast Gauss transform we introduce a factorable form of attention that operates efficiently in high dimensions.

Paper
Add Code

A Closer Look at the Limitations of Instruction Tuning

no code implementations • 3 Feb 2024 • Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S, Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha

Our findings reveal that responses generated solely from pre-trained knowledge consistently outperform responses by models that learn any form of new knowledge from IT on open-source datasets.

Hallucination

Paper
Add Code

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

no code implementations • 12 Oct 2023 • Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Evuru, S. Ramaneswaran, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha

In this paper, we propose CompA, a collection of two expert-annotated benchmarks with a majority of real-world audio samples, to evaluate compositional reasoning in ALMs.

Attribute Audio Classification +1

Paper
Add Code

RECAP: Retrieval-Augmented Audio Captioning

1 code implementation • 18 Sep 2023 • Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramani Duraiswami, Dinesh Manocha

We present RECAP (REtrieval-Augmented Audio CAPtioning), a novel and effective audio captioning system that generates captions conditioned on an input audio and other captions similar to the audio retrieved from a datastore.

AudioCaps Audio captioning +2

Paper
Code

Gaussian Process Models for HRTF based Sound-Source Localization and Active-Learning

no code implementations • 11 Feb 2015 • Yuancheng Luo, Dmitry N. Zotkin, Ramani Duraiswami

From a machine learning perspective, the human ability localize sounds can be modeled as a non-parametric and non-linear regression problem between binaural spectral features of sound received at the ears (input) and their sound-source directions (output).

Active Learning regression

Paper
Add Code

Software for Computing the Spheroidal Wave Functions Using Arbitrary Precision Arithmetic

1 code implementation • 1 Aug 2014 • Ross Adelman, Nail A. Gumerov, Ramani Duraiswami

The spheroidal wave functions, which are the solutions to the Helmholtz equation in spheroidal coordinates, are notoriously difficult to compute.

Mathematical Software Numerical Analysis

Paper
Code

Automatic online tuning for fast Gaussian summation

no code implementations • NeurIPS 2008 • Vlad I. Morariu, Balaji V. Srinivasan, Vikas C. Raykar, Ramani Duraiswami, Larry S. Davis

To solve the second problem, we present an online tuning approach that results in a black box method that automatically chooses the evaluation method and its parameters to yield the best performance for the input data, desired accuracy, and bandwidth.

BIG-bench Machine Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.