no code implementations • 31 May 2023 • Kaousheik Jayakumar, Vrunda N. Sukhadia, A Arunkumar, S. Umesh
Building a multilingual Automated Speech Recognition (ASR) system in a linguistically diverse country like India can be a challenging task due to the differences in scripts and the limited availability of speech data.
no code implementations • 3 Nov 2022 • Vrunda N. Sukhadia, A. Arunkumar, S. Umesh
Our method gives a relative improvement of ~4% over the joint encoder-decoder self-supervised model built with simple pooling of data, which serves as our baseline.
no code implementations • 18 Feb 2022 • Vrunda N. Sukhadia, S. Umesh
We, therefore, propose to use the embeddings tapped from these encoder layers as features for a downstream Conformer target-domain model and show that they provide significant improvements.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4