no code implementations • 16 Oct 2023 • Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, MingBin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang
Recent advances in deep learning and automatic speech recognition have improved the accuracy of end-to-end speech recognition systems, but recognition of personal content such as contact names remains a challenge.
no code implementations • 10 Oct 2023 • Zhihong Lei, MingBin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu
Recent advances in deep learning and automatic speech recognition (ASR) have enabled the end-to-end (E2E) ASR system and boosted the accuracy to a new level.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 5 Oct 2023 • Leonardo Emili, Thiago Fraga-Silva, Ernest Pusateri, Markus Nußbaum-Thom, Youssef Oualil
We study model pruning methods applied to Transformer-based neural network language models for automatic speech recognition.
no code implementations • 29 Jun 2022 • Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin
Virtual assistants make use of automatic speech recognition (ASR) to help users answer entity-centric queries.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 21 Jun 2021 • Mandana Saebi, Ernest Pusateri, Aaksha Meghawat, Christophe Van Gysel
High-quality automatic speech recognition (ASR) is essential for virtual assistants (VAs) to work well.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 14 Feb 2021 • Sashank Gondala, Lyan Verwimp, Ernest Pusateri, Manos Tsagkias, Christophe Van Gysel
We customize entropy pruning by allowing for a keep list of infrequent n-grams that require a more relaxed pruning threshold, and propose three methods to construct the keep list.
no code implementations • 26 May 2020 • Christophe Van Gysel, Manos Tsagkias, Ernest Pusateri, Ilya Oparin
We focus on improving the effectiveness of a Virtual Assistant (VA) in recognizing emerging entities in spoken queries.
no code implementations • 26 Aug 2019 • Ernest Pusateri, Christophe Van Gysel, Rami Botros, Sameer Badaskar, Mirko Hannemann, Youssef Oualil, Ilya Oparin
In this work, we uncover a theoretical connection between two language model interpolation techniques, count merging and Bayesian interpolation.