Search Results for author: Vinay Kothapally

Found 6 papers, 0 papers with code

Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation

no code implementations22 Nov 2022 Vinay Kothapally, John H. L. Hansen

Several speech processing systems have demonstrated considerable performance improvements when deep complex neural networks (DCNN) are coupled with self-attention (SA) networks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking

no code implementations22 Nov 2022 Vinay Kothapally, J. H. L. Hansen

With the advancements in deep learning approaches, the performance of speech enhancing systems in the presence of background noise have shown significant improvements.

Speech Dereverberation

Deep Neural Mel-Subband Beamformer for In-car Speech Separation

no code implementations22 Nov 2022 Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

While current deep learning (DL)-based beamforming techniques have been proved effective in speech separation, they are often designed to process narrow-band (NB) frequencies independently which results in higher computational costs and inference times, making them unsuitable for real-world use.

Speech Separation

Joint Neural AEC and Beamforming with Double-Talk Detection

no code implementations9 Nov 2021 Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

We train the proposed model in an end-to-end approach to eliminate background noise and echoes from far-end audio devices, which include nonlinear distortions.

Acoustic echo cancellation Denoising +2

Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition

no code implementations15 Oct 2019 Salar Jafarlou, Soheil Khorram, Vinay Kothapally, John H. L. Hansen

In the present study, we address this issue by investigating variants of large receptive field CNNs (LRF-CNNs) which include deeply recursive networks, dilated convolutional neural networks, and stacked hourglass networks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.