Search Results for author: Siddharth Gururani

Found 9 papers, 3 papers with code

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

1 code implementation • 22 Feb 2024 • Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue

We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time.

Music Generation

Paper
Code

Multilingual Multiaccented Multispeaker TTS with RADTTS

no code implementations • 24 Jan 2023 • Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro

We work to create a multilingual speech synthesis system which can generate speech with the proper accent while retaining the characteristics of an individual voice.

Speech Synthesis

Paper
Add Code

SPACE: Speech-driven Portrait Animation with Controllable Expression

no code implementations • ICCV 2023 • Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu

It uses a multi-stage approach, combining the controllability of facial landmarks with the high-quality synthesis power of a pretrained face generator.

Paper
Add Code

Anomalous behaviour in loss-gradient based interpretability methods

no code implementations • 15 Jul 2022 • Vinod Subramanian, Siddharth Gururani, Emmanouil Benetos, Mark Sandler

Loss-gradients are used to interpret the decision making process of deep learning models.

Decision Making

Paper
Add Code

dMelodies: A Music Dataset for Disentanglement Learning

2 code implementations • 29 Jul 2020 • Ashis Pati, Siddharth Gururani, Alexander Lerch

In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.

Benchmarking Disentanglement

Paper
Code

Visual Attention for Musical Instrument Recognition

no code implementations • 17 Jun 2020 • Karn Watcharasupat, Siddharth Gururani, Alexander Lerch

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.

Information Retrieval Instrument Recognition +2

Paper
Add Code

Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features

no code implementations • 21 Nov 2019 • Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto

This paper presents a simple yet effective method to achieve prosody transfer from a reference speech signal to synthesized speech.

Paper
Add Code

An Attention Mechanism for Musical Instrument Recognition

1 code implementation • 9 Jul 2019 • Siddharth Gururani, Mohit Sharma, Alexander Lerch

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.

Instrument Recognition

Paper
Code

Music Performance Analysis: A Survey

no code implementations • 29 Jun 2019 • Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani

Music Information Retrieval (MIR) tends to focus on the analysis of audio signals.

Information Retrieval Music Information Retrieval +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.