Search Results for author: Hangting Chen

Found 11 papers, 4 papers with code

Gull: A Generative Multifunctional Audio Codec

no code implementations • 7 Apr 2024 • Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng

We introduce Gull, a generative multifunctional audio codec.

Audio Compression Audio Source Separation +3

Paper
Add Code

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation

no code implementations • 24 Dec 2023 • Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng

In this paper, we present CaRE-SEP, a consistent and relevant embedding network for general sound separation to encourage a comprehensive reconsideration of query usage in audio separation.

Paper
Add Code

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

no code implementations • 25 Sep 2023 • Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang

To address this issue, we introduce an automatic in-the-wild speech data preprocessing framework (AutoPrep) in this paper, which is designed to enhance speech quality, generate speaker labels, and produce transcriptions automatically.

Automatic Speech Recognition Speech Enhancement +3

Paper
Add Code

Complexity Scaling for Speech Denoising

no code implementations • 14 Sep 2023 • Hangting Chen, Jianwei Yu, Chao Weng

A series of MPT networks present high performance covering a wide range of computational complexities on the DNS challenge dataset.

Denoising Speech Denoising

Paper
Add Code

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

1 code implementation • 21 Aug 2023 • Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng

Echo cancellation and noise reduction are essential for full-duplex communication, yet most existing neural networks have high computational costs and are inflexible in tuning model complexity.

Dimensionality Reduction

Paper
Code

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

1 code implementation • 19 Aug 2023 • Jinchuan Tian, Jianwei Yu, Hangting Chen, Brian Yan, Chao Weng, Dong Yu, Shinji Watanabe

While the vanilla transducer does not have a prior preference for any of the valid paths, this work intends to enforce the preferred paths and achieve controllable alignment prediction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

7,891

Paper
Code

High Fidelity Speech Enhancement with Band-split RNN

1 code implementation • 1 Dec 2022 • Jianwei Yu, Yi Luo, Hangting Chen, Rongzhi Gu, Chao Weng

Despite the rapid progress in speech enhancement (SE) research, enhancing the quality of desired speech in environments with strong noise and interfering speakers remains challenging.

Speech Enhancement Vocal Bursts Intensity Prediction

Paper
Code

Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output

1 code implementation • 5 Feb 2021 • Hangting Chen, Yang Yi, Dang Feng, Pengyuan Zhang

The proposed framework facilitates iterative signal refinement with the guide of beamforming and seeks to reach the upper bound of the MVDR-based methods.

blind source separation Speech Separation

Paper
Code

Power pooling: An adaptive pooling function for weakly labelled sound event detection

no code implementations • 20 Oct 2020 • Yuzhuo Liu, Hangting Chen, YunWang, Pengyuan Zhang

While this paper focuses on sound event detection applications, the proposed method can be applied to MIL tasks in other domains.

Event Detection Multiple Instance Learning +1

Paper
Add Code

Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment

no code implementations • 1 Jul 2020 • Hangting Chen, Pengyuan Zhang

Deep attractor networks (DANs) perform speech separation with discriminative embeddings and speaker attractors.

Speech Separation

Paper
Add Code

Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling

no code implementations • 15 Jul 2019 • Hangting Chen, Zuozhen Liu, Zongming Liu, Pengyuan Zhang, Yonghong Yan

This technical report describes the IOA team's submission for TASK1A of DCASE2019 challenge.

Acoustic Scene Classification Data Augmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.