no code implementations • 24 Apr 2023 • Kristina Tesch, Timo Gerkmann
In a multi-channel separation task with multiple speakers, we aim to recover all individual speech signals from the mixture.
no code implementations • 4 Nov 2022 • Kristina Tesch, Timo Gerkmann
In a scenario with multiple persons talking simultaneously, the spatial characteristics of the signals are the most distinct feature for extracting the target signal.
1 code implementation • 27 Jun 2022 • Kristina Tesch, Timo Gerkmann
The key advantage of using multiple microphones for speech enhancement is that spatial filtering can be used to complement the tempo-spectral processing.
1 code implementation • 22 Jun 2022 • Kristina Tesch, Nils-Hendrik Mohrmann, Timo Gerkmann
Employing deep neural networks (DNNs) to directly learn filters for multi-channel speech enhancement has potentially two key advantages over a traditional approach combining a linear spatial filter with an independent tempo-spectral post-filter: 1) non-linear spatial filtering allows to overcome potential restrictions originating from a linear processing model and 2) joint processing of spatial and tempo-spectral information allows to exploit interdependencies between different sources of information.
no code implementations • 22 Apr 2021 • Kristina Tesch, Timo Gerkmann
Rather, the MMSE optimal filter is a joint spatial and spectral nonlinear function.