no code implementations • 11 Feb 2024 • Madhav Khirwar, Ankur Narang
Air pollution represents a pivotal environmental challenge globally, playing a major role in climate change via greenhouse gas emissions and negatively affecting the health of billions.
no code implementations • 24 Nov 2023 • Madhav Khirwar, Ankur Narang
Greenhouse gases are pivotal drivers of climate change, necessitating precise quantification and source identification to foster mitigation strategies.
no code implementations • 27 Oct 2023 • Neeraj Kumar, Ankur Narang, Brejesh lall
In this paper, we present a Diffusion GAN based approach (Prosodic Diff-TTS) to generate the corresponding high-fidelity speech based on the style description and content text as an input to generate speech samples within only 4 denoising steps.
no code implementations • 21 Dec 2022 • Neeraj Kumar, Ankur Narang, Brejesh lall
A lot of normalization techniques have been proposed but the success of normalization in low resource downstream NLP and speech tasks is limited.
no code implementations • 19 Feb 2021 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall, Mujtaba Hasan, Pranshu Agarwal, Dipankar Sarkar
We propose a novel method OneShotAu2AV to generate an animated video of arbitrary length using an audio clip and a single unseen image of a person as an input.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Mujtaba Hasan
High-quality video generation with expressive facial movements is a challenging problem that involves complex learning steps for generative adversarial networks.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall
The multi-modal adaptive normalization uses the various features of audio and video such as Mel spectrogram, pitch, energy from audio signals and predicted keypoint heatmap/optical flow and a single image to learn the respective affine parameters to generate highly expressive video.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall
High quality multi-speaker speech synthesis while considering prosody and in a few shot manner is an area of active research with many real-world applications.
no code implementations • 14 Nov 2020 • Dipankar Sarkar, Sumit Rai, Ankur Narang
Federated learning has allowed the training of statistical models over remote devices without the transfer of raw client data.
no code implementations • 12 Nov 2020 • Dipankar Sarkar, Ankur Narang, Sumit Rai
The Federated Learning setting has a central server coordinating the training of a model on a network of devices.
no code implementations • 7 Feb 2019 • Abhishek Laddha, Mohamed Hanoosh, Debdoot Mukherjee, Parth Patwa, Ankur Narang
In the subsequent steps, we predict the message cluster instead of the message.
1 code implementation • 17 Dec 2012 • Suman K. Bera, Sourav Dutta, Ankur Narang, Souvik Bhattacherjee
In this work, we present several novel algorithms for the problem of approximate detection of duplicates in data streams.