Search Results for author: Anusha Prakash

Found 6 papers, 0 papers with code

Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

no code implementations • 13 Feb 2023 • Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense.

Speech Synthesis

Paper
Add Code

HMM-based data augmentation for E2E systems for building conversational speech synthesis systems

no code implementations • 22 Dec 2022 • Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

This paper proposes an approach to build a high-quality text-to-speech (TTS) system for technical domains using data augmentation.

Data Augmentation Language Modelling +1

Paper
Add Code

Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

no code implementations • 1 Nov 2022 • Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal

Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video.

Chunking Speech Synthesis +1

Paper
Add Code

The Importance of Accurate Alignments in End-to-End Speech Synthesis

no code implementations • 31 Oct 2022 • Anusha Prakash, Hema A Murthy

With the arrival of end-to-end (E2E) systems, it was observed that very good quality speech could be synthesised with large amounts of data.

Speech Synthesis

Paper
Add Code

Dual Script E2E framework for Multilingual and Code-Switching ASR

no code implementations • 2 Jun 2021 • Mari Ganesh Kumar, Jom Kuriakose, Anand Thyagachandran, Arun Kumar A, Ashish Seth, Lodagala Durga Prasad, Saish Jaiswal, Anusha Prakash, Hema Murthy

In the first system, the E2E model is trained on the CLS representation, and we use a novel data-driven back-end to recover the native language script.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Multi-Relational Question Answering from Narratives: Machine Reading and Reasoning in Simulated Worlds

no code implementations • ACL 2018 • Igor Labutov, Bishan Yang, Anusha Prakash, Amos Azaria

Question Answering (QA), as a research field, has primarily focused on either knowledge bases (KBs) or free text as a source of knowledge.

Question Answering Reading Comprehension

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.