Search Results for author: Yingying Gao

Found 8 papers, 1 papers with code

Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network

no code implementations20 Feb 2024 Yanan Chen, Zihao Cui, Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

In this study, we present a novel weighting prediction approach, which explicitly learns the task relationships from downstream training information to address the core challenge of universal speech enhancement.

Data Augmentation Speech Enhancement

GenDistiller: Distilling Pre-trained Language Models based on Generative Models

no code implementations20 Oct 2023 Yingying Gao, Shilei Zhang, Zihao Cui, Yanhan Xu, Chao Deng, Junlan Feng

Self-supervised pre-trained models such as HuBERT and WavLM leverage unlabeled speech data for representation learning and offer significantly improve for numerous downstream tasks.

Knowledge Distillation Language Modelling +1

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

no code implementations26 Jun 2022 Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

Spoken language understanding (SLU) treats automatic speech recognition (ASR) and natural language understanding (NLU) as a unified task and usually suffers from data scarcity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Multiple Confidence Gates For Joint Training Of SE And ASR

no code implementations1 Apr 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a common solution for robust ASR in noisy environments.

Robust Speech Recognition Speech Enhancement +1

Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE

no code implementations25 Feb 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Yanan Chen, Junlan Feng, Shilei Zhang

Therefore, we previously proposed a harmonic gated compensation network (HGCN) to predict the full harmonic locations based on the unmasked harmonics and process the result of a coarse enhancement module to recover the masked harmonics.

HGCN: Harmonic gated compensation network for speech enhancement

1 code implementation30 Jan 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Mask processing in the time-frequency (T-F) domain through the neural network has been one of the mainstreams for single-channel speech enhancement.

Action Detection Activity Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.