Search Results for author: Qiujia Li

Found 18 papers, 4 papers with code

Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR

no code implementations17 Jan 2024 Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman

Meanwhile, the heterogeneous nature and imbalanced data abundance of different languages may cause performance degradation, leading to asynchronous peak performance for different languages during training, especially on tail ones.

Massive End-to-end Models for Short Search Queries

no code implementations22 Sep 2023 Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara Sainath, Pedro Moreno Mengibar

In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

no code implementations20 Mar 2023 Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland

The performance of the student model can be further enhanced when multiple teachers are used jointly, achieving word error rate reductions (WERRs) of 17. 5% and 10. 6%.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-trained Models

no code implementations7 Oct 2021 Xiaoyu Yang, Qiujia Li, Philip C. Woodland

Self-supervised pre-training is an effective approach to leveraging a large amount of unlabelled data to reduce word error rates (WERs) of automatic speech recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

no code implementations7 Oct 2021 Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland

As end-to-end automatic speech recognition (ASR) models reach promising performance, various downstream tasks rely on good confidence estimators for these systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition

1 code implementation1 Jul 2021 Qiujia Li, Chao Zhang, Philip C. Woodland

Commonly used automatic speech recognition (ASR) systems can be classified into frame-synchronous and label-synchronous categories, based on whether the speech is decoded on a per-frame or per-label basis.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Discriminative Neural Clustering for Speaker Diarisation

1 code implementation22 Oct 2019 Qiujia Li, Florian L. Kreyssig, Chao Zhang, Philip C. Woodland

In this paper, we propose Discriminative Neural Clustering (DNC) that formulates data clustering with a maximum number of clusters as a supervised sequence-to-sequence learning problem.

Clustering Data Augmentation

Integrating Source-channel and Attention-based Sequence-to-sequence Models for Speech Recognition

no code implementations14 Sep 2019 Qiujia Li, Chao Zhang, Philip C. Woodland

This paper proposes a novel automatic speech recognition (ASR) framework called Integrated Source-Channel and Attention (ISCA) that combines the advantages of traditional systems based on the noisy source-channel model (SC) and end-to-end style systems using attention-based sequence-to-sequence models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks

no code implementations30 Oct 2018 Anton Ragni, Qiujia Li, Mark Gales, Yu Wang

These errors are not accounted for by the standard confidence estimation schemes and are hard to rectify in the upstream and downstream processing.

Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation

4 code implementations30 Oct 2018 Qiujia Li, Preben Ness, Anton Ragni, Mark Gales

The standard approach to mitigate errors made by an automatic speech recognition system is to use confidence scores associated with each predicted word.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Shape and Material from Sound

no code implementations NeurIPS 2017 Zhoutong Zhang, Qiujia Li, Zhengjia Huang, Jiajun Wu, Josh Tenenbaum, Bill Freeman

Hearing an object falling onto the ground, humans can recover rich information including its rough shape, material, and falling height.

Object

Cannot find the paper you are looking for? You can Submit a new open access paper.