Search Results for author: Ankur Narang

Found 12 papers, 1 papers with code

GeoFormer: A Vision and Sequence Transformer-based Approach for Greenhouse Gas Monitoring

no code implementations11 Feb 2024 Madhav Khirwar, Ankur Narang

Air pollution represents a pivotal environmental challenge globally, playing a major role in climate change via greenhouse gas emissions and negatively affecting the health of billions.

Time Series

GeoViT: A Versatile Vision Transformer Architecture for Geospatial Image Analysis

no code implementations24 Nov 2023 Madhav Khirwar, Ankur Narang

Greenhouse gases are pivotal drivers of climate change, necessitating precise quantification and source identification to foster mitigation strategies.

Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN

no code implementations27 Oct 2023 Neeraj Kumar, Ankur Narang, Brejesh lall

In this paper, we present a Diffusion GAN based approach (Prosodic Diff-TTS) to generate the corresponding high-fidelity speech based on the style description and content text as an input to generate speech samples within only 4 denoising steps.

Decoder Denoising

KL Regularized Normalization Framework for Low Resource Tasks

no code implementations21 Dec 2022 Neeraj Kumar, Ankur Narang, Brejesh lall

A lot of normalization techniques have been proposed but the success of normalization in low resource downstream NLP and speech tasks is limited.

One Shot Audio to Animated Video Generation

no code implementations19 Feb 2021 Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall, Mujtaba Hasan, Pranshu Agarwal, Dipankar Sarkar

We propose a novel method OneShotAu2AV to generate an animated video of arbitrary length using an audio clip and a single unseen image of a person as an input.

Video Generation

Robust One Shot Audio to Video Generation

no code implementations14 Dec 2020 Neeraj Kumar, Srishti Goel, Ankur Narang, Mujtaba Hasan

High-quality video generation with expressive facial movements is a challenging problem that involves complex learning steps for generative adversarial networks.

Generative Adversarial Network Marketing +3

Multi Modal Adaptive Normalization for Audio to Video Generation

no code implementations14 Dec 2020 Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall

The multi-modal adaptive normalization uses the various features of audio and video such as Mel spectrogram, pitch, energy from audio signals and predicted keypoint heatmap/optical flow and a single image to learn the respective affine parameters to generate highly expressive video.

Optical Flow Estimation SSIM +1

Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis

no code implementations14 Dec 2020 Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall

High quality multi-speaker speech synthesis while considering prosody and in a few shot manner is an area of active research with many real-world applications.

Cultural Vocal Bursts Intensity Prediction Speech Synthesis

CatFedAvg: Optimising Communication-efficiency and Classification Accuracy in Federated Learning

no code implementations14 Nov 2020 Dipankar Sarkar, Sumit Rai, Ankur Narang

Federated learning has allowed the training of statistical models over remote devices without the transfer of raw client data.

Classification Federated Learning +1

Fed-Focal Loss for imbalanced data classification in Federated Learning

no code implementations12 Nov 2020 Dipankar Sarkar, Ankur Narang, Sumit Rai

The Federated Learning setting has a central server coordinating the training of a model on a network of devices.

Classification Federated Learning +2

Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams

1 code implementation17 Dec 2012 Suman K. Bera, Sourav Dutta, Ankur Narang, Souvik Bhattacherjee

In this work, we present several novel algorithms for the problem of approximate detection of duplicates in data streams.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.