Search Results for author: Emanuël A. P. Habets

Found 18 papers, 2 papers with code

Data-driven Joint Detection and Localization of Acoustic Reflectors

no code implementations9 Feb 2024 H. Nazim Bicer, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

Room geometry inference algorithms rely on the localization of acoustic reflectors to identify boundary surfaces of an enclosure.

Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone

no code implementations28 Aug 2023 Cagdas Tuna, Altan Akat, H. Nazim Bicer, Andreas Walther, Emanuël A. P. Habets

Motivated by the increasing popularity of commercially available soundbars, this article presents a data-driven 3D RGI method using RIRs measured from a linear loudspeaker array to a single microphone.

Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TV

no code implementations23 Mar 2023 Matteo Torcoli, Emanuël A. P. Habets

When dialogue and background sounds are not separately available from the production stage, Dialogue Separation (DS) can estimate them to enable personalization.

Action Detection Activity Detection

Beamformer-Guided Target Speaker Extraction

no code implementations15 Mar 2023 Mohamed Elminshawi, Srikanth Raj Chetupalli, Emanuël A. P. Habets

By allowing for time-varying embeddings in the single-channel TSE block, the proposed method fully exploits the correspondence between the front-end beamformer output and the target speech in the microphone signal.

Target Speaker Extraction

Contrastive Representation Learning for Acoustic Parameter Estimation

no code implementations22 Feb 2023 Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

A study is presented in which a contrastive learning approach is used to extract low-dimensional representations of the acoustic environment from single-channel, reverberant speech signals.

Contrastive Learning Data Augmentation +2

Audiovisual Database with 360 Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research

no code implementations27 Dec 2022 Thomas Robotham, Ashutosh Singla, Olli S. Rummukainen, Alexander Raake, Emanuël A. P. Habets

Research into multi-modal perception, human cognition, behavior, and attention can benefit from high-fidelity content that may recreate real-life-like scenes when rendered on head-mounted displays.

AID: Open-source Anechoic Interferer Dataset

1 code implementation5 Aug 2022 Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

A dataset of anechoic recordings of various sound sources encountered in domestic environments is presented.

Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion

no code implementations28 Jun 2022 Ahmad Aloradi, Wolfgang Mack, Mohamed Elminshawi, Emanuël A. P. Habets

Classical speaker verification (SV) approaches estimate a fixed-dimensional embedding from a speech utterance that encodes the speaker's voice characteristics.

Speaker Verification

Blind Reverberation Time Estimation in Dynamic Acoustic Conditions

no code implementations23 Feb 2022 Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

The estimation of reverberation time from real-world signals plays a central role in a wide range of applications.

New Insights on Target Speaker Extraction

no code implementations1 Feb 2022 Mohamed Elminshawi, Wolfgang Mack, Srikanth Raj Chetupalli, Soumitro Chakrabarty, Emanuël A. P. Habets

However, such studies have been conducted on a few datasets and have not considered recent deep neural network architectures for SS that have shown impressive separation performance.

Speaker Separation Target Speaker Extraction

Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms

no code implementations3 Jan 2022 Wolfgang Mack, Julian Wechsler, Emanuël A. P. Habets

The impact of attention on DOA estimators and different training strategies for attention and DOA DNNs are not yet studied in depth.

Direction of Arrival Estimation Speech Enhancement

An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments

no code implementations9 Nov 2020 Shrishti Saha Shetu, Soumitro Chakrabarty, Emanuël A. P. Habets

Audio-visual speech enhancement (AVSE) methods use both audio and visual features for the task of speech enhancement and the use of visual features has been shown to be particularly effective in multi-speaker scenarios.

Optical Flow Estimation Speech Enhancement

Efficient Training Data Generation for Phase-Based DOA Estimation

no code implementations9 Nov 2020 Fabian Hübner, Wolfgang Mack, Emanuël A. P. Habets

By an evaluation using data from measured room impulse responses, we demonstrate that a model trained with the proposed training data generation method performs comparably to models trained with data generated based on the source-image method.

Informed Source Extraction With Application to Acoustic Echo Reduction

no code implementations9 Nov 2020 Mohamed Elminshawi, Wolfgang Mack, Emanuël A. P. Habets

Recent deep learning-based methods leverage a speaker discriminative model that maps a reference snippet uttered by the target speaker into a single embedding vector that encapsulates the characteristics of the target speaker.

Acoustic echo cancellation Speaker Separation

Cannot find the paper you are looking for? You can Submit a new open access paper.