Search Results for author: Shankar M Venkatesan

Found 6 papers, 0 papers with code

Deep Fence Estimation using Stereo Guidance and Adversarial Learning

no code implementations • 3 Jul 2020 • Paritosh Mittal, Shankar M Venkatesan, Viswanath Veera, Aloknath De

People capture memorable images of events and exhibits that are often occluded by a wire mesh loosely termed as fence.

Paper
Add Code

Audio-Visual Decision Fusion for WFST-based and seq2seq Models

no code implementations • 29 Jan 2020 • Rohith Aralikatti, Sharad Roy, Abhinav Thanda, Dilip Kumar Margam, Pujitha Appan Kandala, Tanay Sharma, Shankar M Venkatesan

In this work, we propose novel methods to fuse information from audio and visual modalities at inference time.

speech-recognition Speech Recognition

Paper
Add Code

LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models

no code implementations • 25 Jun 2019 • Dilip Kumar Margam, Rohith Aralikatti, Tanay Sharma, Abhinav Thanda, Pujitha A K, Sharad Roy, Shankar M Venkatesan

We also verify the method on a second dataset of $81$ speakers which we collected.

Lipreading

Paper
Add Code

Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks

no code implementations • 12 Apr 2018 • Rohith Aralikatti, Dilip Margam, Tanay Sharma, Thanda Abhinav, Shankar M Venkatesan

This paper demonstrates two novel methods to estimate the global SNR of speech signals.

speech-recognition Speech Recognition

Paper
Add Code

Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition

no code implementations • 10 Jan 2017 • Abhinav Thanda, Shankar M Venkatesan

Multi-task learning (MTL) involves the simultaneous training of two or more related tasks over shared representations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Audio Visual Speech Recognition using Deep Recurrent Neural Networks

no code implementations • 9 Nov 2016 • Abhinav Thanda, Shankar M Venkatesan

The frame labels obtained from the acoustic model are then used to perform a non-linear dimensionality reduction of the visual features using a deep bottleneck network.

Audio-Visual Speech Recognition Automatic Speech Recognition +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.