1 code implementation • 22 Feb 2024 • Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue
We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time.
no code implementations • 24 Jan 2023 • Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro
We work to create a multilingual speech synthesis system which can generate speech with the proper accent while retaining the characteristics of an individual voice.
no code implementations • ICCV 2023 • Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu
It uses a multi-stage approach, combining the controllability of facial landmarks with the high-quality synthesis power of a pretrained face generator.
no code implementations • 15 Jul 2022 • Vinod Subramanian, Siddharth Gururani, Emmanouil Benetos, Mark Sandler
Loss-gradients are used to interpret the decision making process of deep learning models.
2 code implementations • 29 Jul 2020 • Ashis Pati, Siddharth Gururani, Alexander Lerch
In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.
no code implementations • 17 Jun 2020 • Karn Watcharasupat, Siddharth Gururani, Alexander Lerch
In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.
no code implementations • 21 Nov 2019 • Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto
This paper presents a simple yet effective method to achieve prosody transfer from a reference speech signal to synthesized speech.
1 code implementation • 9 Jul 2019 • Siddharth Gururani, Mohit Sharma, Alexander Lerch
While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.
no code implementations • 29 Jun 2019 • Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani
Music Information Retrieval (MIR) tends to focus on the analysis of audio signals.