Search Results for author: Kunal Dhawan

Found 9 papers, 3 papers with code

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

no code implementations18 Oct 2023 Tae Jin Park, He Huang, Ante Jukic, Kunal Dhawan, Krishna C. Puvvada, Nithin Koluguri, Nikolay Karpov, Aleksandr Laptev, Jagadeesh Balam, Boris Ginsburg

We present the NVIDIA NeMo team's multi-channel speech recognition system for the 7th CHiME Challenge Distant Automatic Speech Recognition (DASR) Task, focusing on the development of a multi-channel, multi-speaker speech recognition system tailored to transcribe speech from distributed microphones and microphone arrays.

Automatic Speech Recognition speaker-diarization +3

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition

no code implementations19 Sep 2023 Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg

Discrete audio representation, aka audio tokenization, has seen renewed interest driven by its potential to facilitate the application of text language modeling approaches in audio domain.

Language Modelling Quantization +4

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

no code implementations11 Sep 2023 Tae Jin Park, Kunal Dhawan, Nithin Koluguri, Jagadeesh Balam

In addition, these findings point to the potential of using LLMs to improve speaker diarization and other speech processing tasks by capturing semantic and contextual cues.

speaker-diarization Speaker Diarization

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

1 code implementation14 Jun 2023 Kunal Dhawan, Dima Rekesh, Boris Ginsburg

Code-Switching (CS) multilingual Automatic Speech Recognition (ASR) models can transcribe speech containing two or more alternating languages during a conversation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Phonetic Word Embeddings

1 code implementation30 Sep 2021 Rahul Sharma, Kunal Dhawan, Balakrishna Pailla

This work presents a novel methodology for calculating the phonetic similarity between words taking motivation from the human perception of sounds.

Benchmarking Word Embeddings

Joint Language Identification of Code-Switching Speech using Attention based E2E Network

no code implementations15 Jul 2019 Sreeram Ganji, Kunal Dhawan, Kumar Priyadarshi, Rohit Sinha

For the automatic recognition of code-switching speech, the conventional approaches often employ an LID system for detecting the languages present within an utterance.

Language Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.