Search Results for author: Tim Ng

Found 7 papers, 1 papers with code

Conformer-Based Speech Recognition On Extreme Edge-Computing Devices

no code implementations • 16 Dec 2023 • MingBin Xu, Alex Jin, Sicheng Wang, Mu Su, Tim Ng, Henry Mason, Shiyi Han, Zhihong Lei Yaqiao Deng, Zhen Huang, Mahesh Krishnamoorthy

With increasingly more powerful compute capabilities and resources in today's devices, traditionally compute-intensive automatic speech recognition (ASR) has been moving from the cloud to devices to better protect user privacy.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Towards Real-World Streaming Speech Translation for Code-Switched Speech

1 code implementation • 19 Oct 2023 • Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agarwal

Code-switching (CS), i. e. mixing different languages in a single sentence, is a common phenomenon in communication and can be challenging in many Natural Language Processing (NLP) settings.

Sentence Translation

Paper
Code

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

no code implementations • 16 Oct 2023 • Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, MingBin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang

Recent advances in deep learning and automatic speech recognition have improved the accuracy of end-to-end speech recognition systems, but recognition of personal content such as contact names remains a challenge.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Acoustic Model Fusion for End-to-end Speech Recognition

no code implementations • 10 Oct 2023 • Zhihong Lei, MingBin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu

Recent advances in deep learning and automatic speech recognition (ASR) have enabled the end-to-end (E2E) ASR system and boosted the accuracy to a new level.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

A Treatise On FST Lattice Based MMI Training

no code implementations • 17 Oct 2022 • Adnan Haider, Tim Ng, Zhen Huang, Xingyu Na, Antti Veikko Rosti

Maximum mutual information (MMI) has become one of the two de facto methods for sequence-level training of speech recognition acoustic models.

speech-recognition Speech Recognition

Paper
Add Code

Online Automatic Speech Recognition with Listen, Attend and Spell Model

no code implementations • 12 Aug 2020 • Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal

The Listen, Attend and Spell (LAS) model and other attention-based automatic speech recognition (ASR) models have known limitations when operated in a fully online mode.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

no code implementations • 4 Oct 2019 • Zhen Huang, Tim Ng, Leo Liu, Henry Mason, Xiaodan Zhuang, Daben Liu

The most popular way to train very deep CNNs is to use shortcut connections (SC) together with batch normalization (BN).

Inference Optimization speech-recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.