Search Results for author: Leibny Paola Garcia

Found 11 papers, 4 papers with code

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

no code implementations • 9 Mar 2024 • Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe

Performance evaluation using large language models reveals the advantage of the linguistic hint by achieving 14. 1% and 5. 5% relative improvement on test sets of the ASRU and SEAME datasets, respectively.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

no code implementations • 16 Feb 2024 • Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Recently, Denoising Diffusion Probabilistic Models (DDPMs) have attained leading performances across a diverse range of generative tasks.

Denoising Speech Enhancement +1

Paper
Add Code

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

1 code implementation • 27 Nov 2023 • Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, WenHan Chao, Leibny Paola Garcia

There is a positive correlation between PSR scores and ASR performance, suggesting that phonetic information extracted by monolingual SSL models can be used for downstream tasks in cross-lingual settings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Enhancing Code-switching Speech Recognition with Interactive Language Biases

no code implementations • 29 Sep 2023 • Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur

Languages usually switch within a multilingual speech signal, especially in a bilingual society.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

1 code implementation • 26 Sep 2023 • Ruixing Liang, Xiangyu Zhang, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola Garcia, Amir Manbachi

While significant advancements in artificial intelligence (AI) have catalyzed progress across various domains, its full potential in understanding visual perception remains underexplored.

Brain Computer Interface

Paper
Code

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts

no code implementations • 1 Jun 2023 • Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola Garcia, Daniel Povey, Sanjeev Khudanpur

Imperfectly transcribed speech is a prevalent issue in human-annotated speech corpora, which degrades the performance of ASR models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

EURO: ESPnet Unsupervised ASR Open-source Toolkit

1 code implementation • 30 Nov 2022 • Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola Garcia, Hung-Yi Lee, Shinji Watanabe, Sanjeev Khudanpur

This paper describes the ESPnet Unsupervised ASR Open-source Toolkit (EURO), an end-to-end open-source toolkit for unsupervised automatic speech recognition (UASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

7,891

Paper
Code

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

1 code implementation • 26 Oct 2022 • Hexin Liu, HaiHua Xu, Leibny Paola Garcia, Andy W. H. Khong, Yi He, Sanjeev Khudanpur

The comparison of the proposed methods indicates that incorporating language information is more effective than disentangling for reducing language confusion in CS speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters

no code implementations • 21 Oct 2022 • Yu Xuan, Xiangyu Zhang, Shuyue Stella Li, Zihan Shen, Xin Xie, Leibny Paola Garcia, Roberto Togneri

Compared with the state-of-the-art MSF-ANC method, CRLS shows improved performance.

Paper
Add Code

PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection

no code implementations • 6 Oct 2022 • Shuyue Stella Li, Xiangyu Zhang, Shu Zhou, Hongchao Shu, Ruixing Liang, Hexin Liu, Leibny Paola Garcia

In this work, we propose a highly Portable Quantum Language Model (PQLM) that can easily transmit information to downstream tasks on classical machines.

Language Modelling Sentence Embedding +3

Paper
Add Code

End-to-End Lyrics Recognition with Self-supervised Learning

no code implementations • 26 Sep 2022 • Xiangyu Zhang, Shuyue Stella Li, Zhanhong He, Roberto Togneri, Leibny Paola Garcia

Lyrics recognition is an important task in music processing.

Contrastive Learning Domain Generalization +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.