Search Results for author: Ravi Shankar

Found 8 papers, 0 papers with code

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

no code implementations • 3 Mar 2024 • Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others.

Automatic Speech Recognition Keyword Spotting +5

Paper
Add Code

A Comparative Study of Data Augmentation Techniques for Deep Learning Based Emotion Recognition

no code implementations • 9 Nov 2022 • Ravi Shankar, Abdouh Harouna Kenfack, Arjun Somayazulu, Archana Venkataraman

In parallel to these models, researchers have proposed several data augmentation techniques to increase the size and variability of existing labeled datasets.

Data Augmentation Emotion Recognition

Paper
Add Code

A Diffeomorphic Flow-based Variational Framework for Multi-speaker Emotion Conversion

no code implementations • 9 Nov 2022 • Ravi Shankar, Hsi-Wei Hsieh, Nicolas Charon, Archana Venkataraman

We term this new architecture a variational CycleGAN (VCGAN).

Paper
Add Code

Knowledge Graph - Deep Learning: A Case Study in Question Answering in Aviation Safety Domain

no code implementations • LREC 2022 • Ankush Agarwal, Raj Gite, Shreya Laddha, Pushpak Bhattacharyya, Satyanarayan Kar, Asif Ekbal, Prabhjit Thind, Rajesh Zele, Ravi Shankar

We construct a Knowledge Graph from Aircraft Accident reports and contribute this resource to the community of researchers.

Natural Language Queries Passage Retrieval +3

Paper
Add Code

Adaptive Speech Duration Modification using a Deep-Generative Framework

no code implementations • 29 Sep 2021 • Ravi Shankar, Archana Venkataraman

We propose the first method to adaptively modify the duration of a given speechsignal.

Decoder Dynamic Time Warping +1

Paper
Add Code

A Deep-Bayesian Framework for Adaptive Speech Duration Modification

no code implementations • 11 Jul 2021 • Ravi Shankar, Archana Venkataraman

During inference, we generate the attention map as a proxy for the similarity matrix between the given input speech and an unknown target speech signal.

Decoder Dynamic Time Warping +1

Paper
Add Code

Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network

no code implementations • 25 Jul 2020 • Ravi Shankar, Hsi-Wei Hsieh, Nicolas Charon, Archana Venkataraman

Finally, the predictor uses the original spectrum and the modified F0 contour to generate a corresponding target spectrum.

Decoder

Paper
Add Code

Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator

no code implementations • 25 Jul 2020 • Ravi Shankar, Jacob Sager, Archana Venkataraman

We introduce a novel method for emotion conversion in speech that does not require parallel training data.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.