Search Results for author: Sebastian Braun

Found 20 papers, 9 papers with code

ICASSP 2024 Speech Signal Improvement Challenge

no code implementations • 25 Jan 2024 • Nicolae Catalin Ristea, Ando Saabas, Ross Cutler, Babak Naderi, Sebastian Braun, Solomiya Branets

The ICASSP 2024 Speech Signal Improvement Grand Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems.

Paper
Add Code

CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

no code implementations • 8 Dec 2023 • Ruihan Yang, Hannes Gamper, Sebastian Braun

We introduce a multi-modal diffusion model tailored for the bi-directional conditional generation of video and audio.

Audio Generation

Paper
Add Code

Adapting Frechet Audio Distance for Generative Music Evaluation

2 code implementations • 2 Nov 2023 • Azalea Gui, Hannes Gamper, Sebastian Braun, Dimitra Emmanouilidou

The growing popularity of generative music models underlines the need for perceptually relevant, objective music quality metrics.

FAD

Paper
Code

ICASSP 2023 Acoustic Echo Cancellation Challenge

1 code implementation • 22 Sep 2023 • Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

Acoustic echo cancellation Speech Enhancement

340

Paper
Code

Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments

no code implementations • 14 Mar 2023 • Julian Neri, Sebastian Braun

While large state-of-the-art DNNs can achieve excellent separation from anechoic mixtures of speech, the main challenge is to create compact and causal models that can separate reverberant mixtures at inference time.

Speech Separation

Paper
Add Code

ICASSP 2023 Speech Signal Improvement Challenge

no code implementations • 12 Mar 2023 • Ross Cutler, Ando Saabas, Babak Naderi, Nicolae-Cătălin Ristea, Sebastian Braun, Solomiya Branets

The ICASSP 2023 Speech Signal Improvement Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems.

Paper
Add Code

Task splitting for DNN-based acoustic echo and noise removal

no code implementations • 13 May 2022 • Sebastian Braun, Maria Luis Valero

Neural networks have led to tremendous performance gains for single-task speech enhancement, such as noise suppression and acoustic echo cancellation (AEC).

Acoustic echo cancellation Speech Enhancement

Paper
Add Code

ICASSP 2022 Acoustic Echo Cancellation Challenge

1 code implementation • 27 Feb 2022 • Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Hannes Gamper, Sebastian Braun, Karsten Sørensen, Robert Aichner

This is the third AEC challenge and it is enhanced by including mobile scenarios, adding speech recognition rate in the challenge goal metrics, and making the default sample rate 48 kHz.

Acoustic echo cancellation Speech Enhancement +2

340

Paper
Code

ICASSP 2022 Deep Noise Suppression Challenge

1 code implementation • 27 Feb 2022 • Harishchandra Dubey, Vishak Gopal, Ross Cutler, Ashkan Aazami, Sergiy Matusevych, Sebastian Braun, Sefik Emre Eskimez, Manthan Thakker, Takuya Yoshioka, Hannes Gamper, Robert Aichner

We open-source datasets and test sets for researchers to train their deep noise suppression models, as well as a subjective evaluation framework based on ITU-T P. 835 to rate and rank-order the challenge entries.

973

Paper
Code

Effect of noise suppression losses on speech distortion and ASR performance

no code implementations • 23 Nov 2021 • Sebastian Braun, Hannes Gamper

Deep learning based speech enhancement has made rapid development towards improving quality, while models are becoming more compact and usable for real-time on-the-edge inference.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses

no code implementations • 16 Nov 2021 • Viet Anh Trinh, Sebastian Braun

Our results show that the proposed function effectively improves the speech enhancement performance compared to a baseline trained in a supervised way on the noisy VoxCeleb dataset.

Disentanglement Speech Enhancement +2

Paper
Add Code

Performance optimizations on deep noise suppression models

no code implementations • 8 Oct 2021 • Jerry Chee, Sebastian Braun, Vishak Gopal, Ross Cutler

We study the role of magnitude structured pruning as an architecture search to speed up the inference time of a deep noise suppression (DNS) model.

Paper
Add Code

Low complexity online convolutional beamforming

no code implementations • 14 Jul 2021 • Sebastian Braun, Ivan Tashev

Convolutional beamformers integrate the multichannel linear prediction model into beamformers, which provide good performance and optimality for joint dereverberation and noise reduction tasks.

Paper
Add Code

On training targets for noise-robust voice activity detection

no code implementations • 15 Feb 2021 • Sebastian Braun, Ivan Tashev

The task of voice activity detection (VAD) is an often required module in various speech processing, analysis and classification tasks.

Action Detection Activity Detection

Paper
Add Code

Towards efficient models for real-time deep noise suppression

no code implementations • 22 Jan 2021 • Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev

It is shown that the achievable speech quality is a function of network complexity, and show which models have better tradeoffs.

Speech Enhancement

Paper
Add Code

Interspeech 2021 Deep Noise Suppression Challenge

2 code implementations • 6 Jan 2021 • Chandan K A Reddy, Harishchandra Dubey, Kazuhito Koishida, Arun Nair, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan

In this version of the challenge organized at INTERSPEECH 2021, we are expanding both our training and test datasets to accommodate full band scenarios.

Denoising

973

Paper
Code

DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation

2 code implementations • 22 Oct 2020 • Ali Aroudi, Sebastian Braun

Many deep learning techniques are available to perform source separation and reduce background noise.

Paper
Code

ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets and Testing Framework

1 code implementation • 10 Sep 2020 • Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios.

Acoustic echo cancellation Audio and Speech Processing Sound

340

Paper
Code

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results

1 code implementation • 16 May 2020 • Chandan K. A. Reddy, Vishak Gopal, Ross Cutler, Ebrahim Beyrami, Roger Cheng, Harishchandra Dubey, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-sourced a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement

973

Paper
Code

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Speech Quality and Testing Framework

1 code implementation • 23 Jan 2020 • Chandan K. A. Reddy, Ebrahim Beyrami, Harishchandra Dubey, Vishak Gopal, Roger Cheng, Ross Cutler, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-source a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement

973

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.