Search Results for author: Sebastian Braun

Found 20 papers, 9 papers with code

ICASSP 2024 Speech Signal Improvement Challenge

no code implementations25 Jan 2024 Nicolae Catalin Ristea, Ando Saabas, Ross Cutler, Babak Naderi, Sebastian Braun, Solomiya Branets

The ICASSP 2024 Speech Signal Improvement Grand Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems.

CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

no code implementations8 Dec 2023 Ruihan Yang, Hannes Gamper, Sebastian Braun

We introduce a multi-modal diffusion model tailored for the bi-directional conditional generation of video and audio.

Audio Generation

Adapting Frechet Audio Distance for Generative Music Evaluation

2 code implementations2 Nov 2023 Azalea Gui, Hannes Gamper, Sebastian Braun, Dimitra Emmanouilidou

The growing popularity of generative music models underlines the need for perceptually relevant, objective music quality metrics.

FAD

ICASSP 2023 Acoustic Echo Cancellation Challenge

1 code implementation22 Sep 2023 Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

Acoustic echo cancellation Speech Enhancement

Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments

no code implementations14 Mar 2023 Julian Neri, Sebastian Braun

While large state-of-the-art DNNs can achieve excellent separation from anechoic mixtures of speech, the main challenge is to create compact and causal models that can separate reverberant mixtures at inference time.

Speech Separation

ICASSP 2023 Speech Signal Improvement Challenge

no code implementations12 Mar 2023 Ross Cutler, Ando Saabas, Babak Naderi, Nicolae-Cătălin Ristea, Sebastian Braun, Solomiya Branets

The ICASSP 2023 Speech Signal Improvement Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems.

Task splitting for DNN-based acoustic echo and noise removal

no code implementations13 May 2022 Sebastian Braun, Maria Luis Valero

Neural networks have led to tremendous performance gains for single-task speech enhancement, such as noise suppression and acoustic echo cancellation (AEC).

Acoustic echo cancellation Speech Enhancement

ICASSP 2022 Acoustic Echo Cancellation Challenge

1 code implementation27 Feb 2022 Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Hannes Gamper, Sebastian Braun, Karsten Sørensen, Robert Aichner

This is the third AEC challenge and it is enhanced by including mobile scenarios, adding speech recognition rate in the challenge goal metrics, and making the default sample rate 48 kHz.

Acoustic echo cancellation Speech Enhancement +2

ICASSP 2022 Deep Noise Suppression Challenge

1 code implementation27 Feb 2022 Harishchandra Dubey, Vishak Gopal, Ross Cutler, Ashkan Aazami, Sergiy Matusevych, Sebastian Braun, Sefik Emre Eskimez, Manthan Thakker, Takuya Yoshioka, Hannes Gamper, Robert Aichner

We open-source datasets and test sets for researchers to train their deep noise suppression models, as well as a subjective evaluation framework based on ITU-T P. 835 to rate and rank-order the challenge entries.

Effect of noise suppression losses on speech distortion and ASR performance

no code implementations23 Nov 2021 Sebastian Braun, Hannes Gamper

Deep learning based speech enhancement has made rapid development towards improving quality, while models are becoming more compact and usable for real-time on-the-edge inference.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses

no code implementations16 Nov 2021 Viet Anh Trinh, Sebastian Braun

Our results show that the proposed function effectively improves the speech enhancement performance compared to a baseline trained in a supervised way on the noisy VoxCeleb dataset.

Disentanglement Speech Enhancement +2

Performance optimizations on deep noise suppression models

no code implementations8 Oct 2021 Jerry Chee, Sebastian Braun, Vishak Gopal, Ross Cutler

We study the role of magnitude structured pruning as an architecture search to speed up the inference time of a deep noise suppression (DNS) model.

Low complexity online convolutional beamforming

no code implementations14 Jul 2021 Sebastian Braun, Ivan Tashev

Convolutional beamformers integrate the multichannel linear prediction model into beamformers, which provide good performance and optimality for joint dereverberation and noise reduction tasks.

On training targets for noise-robust voice activity detection

no code implementations15 Feb 2021 Sebastian Braun, Ivan Tashev

The task of voice activity detection (VAD) is an often required module in various speech processing, analysis and classification tasks.

Action Detection Activity Detection

Towards efficient models for real-time deep noise suppression

no code implementations22 Jan 2021 Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev

It is shown that the achievable speech quality is a function of network complexity, and show which models have better tradeoffs.

Speech Enhancement

Interspeech 2021 Deep Noise Suppression Challenge

2 code implementations6 Jan 2021 Chandan K A Reddy, Harishchandra Dubey, Kazuhito Koishida, Arun Nair, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan

In this version of the challenge organized at INTERSPEECH 2021, we are expanding both our training and test datasets to accommodate full band scenarios.

Denoising

DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation

2 code implementations22 Oct 2020 Ali Aroudi, Sebastian Braun

Many deep learning techniques are available to perform source separation and reduce background noise.

ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets and Testing Framework

1 code implementation10 Sep 2020 Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios.

Acoustic echo cancellation Audio and Speech Processing Sound

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results

1 code implementation16 May 2020 Chandan K. A. Reddy, Vishak Gopal, Ross Cutler, Ebrahim Beyrami, Roger Cheng, Harishchandra Dubey, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-sourced a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Speech Quality and Testing Framework

1 code implementation23 Jan 2020 Chandan K. A. Reddy, Ebrahim Beyrami, Harishchandra Dubey, Vishak Gopal, Roger Cheng, Ross Cutler, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-source a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.