Search Results for author: Siddharth Gururani

Found 9 papers, 3 papers with code

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

1 code implementation22 Feb 2024 Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue

We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time.

Music Generation

Multilingual Multiaccented Multispeaker TTS with RADTTS

no code implementations24 Jan 2023 Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro

We work to create a multilingual speech synthesis system which can generate speech with the proper accent while retaining the characteristics of an individual voice.

Speech Synthesis

SPACE: Speech-driven Portrait Animation with Controllable Expression

no code implementations ICCV 2023 Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu

It uses a multi-stage approach, combining the controllability of facial landmarks with the high-quality synthesis power of a pretrained face generator.

dMelodies: A Music Dataset for Disentanglement Learning

2 code implementations29 Jul 2020 Ashis Pati, Siddharth Gururani, Alexander Lerch

In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.

Benchmarking Disentanglement

Visual Attention for Musical Instrument Recognition

no code implementations17 Jun 2020 Karn Watcharasupat, Siddharth Gururani, Alexander Lerch

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.

Information Retrieval Instrument Recognition +2

Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features

no code implementations21 Nov 2019 Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto

This paper presents a simple yet effective method to achieve prosody transfer from a reference speech signal to synthesized speech.

An Attention Mechanism for Musical Instrument Recognition

1 code implementation9 Jul 2019 Siddharth Gururani, Mohit Sharma, Alexander Lerch

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.

Instrument Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.