no code implementations • COLING (MWE) 2020 • Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa
When repairing a device, humans employ a series of tools that corresponds to the arrangement of the device components.
no code implementations • 26 May 2023 • Matías Pizarro, Dorothea Kolossa, Asja Fischer
Adversarial attacks can mislead automatic speech recognition (ASR) systems into predicting an arbitrary target text, thus posing a clear security threat.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • SemEval (NAACL) 2022 • Wentao Yu, Benedikt Boenninghoff, Jonas Roehrig, Dorothea Kolossa
This work presents an ensemble system based on various uni-modal and bi-modal model architectures developed for the SemEval 2022 Task 5: MAMI-Multimedia Automatic Misogyny Identification.
no code implementations • 14 Dec 2021 • Matias Pizarro, Dorothea Kolossa, Asja Fischer
We perform an empirical analysis of hybrid ASR models trained on data pre-processed in such a way.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 30 Sep 2021 • Wentao Yu, Jan Freiwald, Sören Tewes, Fabien Huennemeyer, Dorothea Kolossa
We discuss the outcomes of these systems, which both show great similarities and only small improvements, pointing to a need for a deeper understanding of federated learning for speech recognition.
no code implementations • 10 Sep 2021 • Wentao Yu, Steffen Zeiler, Dorothea Kolossa
To address the inherent difficulties, we propose a new fusion strategy: a recurrent integration network is trained to fuse the state posteriors of multiple single-modality models, guided by a set of model-based and signal-based stream reliability measures.
1 code implementation • 30 Jun 2021 • Benedikt Boenninghoff, Robert M. Nickel, Dorothea Kolossa
The PAN 2021 authorship verification (AV) challenge is part of a three-year strategy, moving from a cross-topic/closed-set AV task to a cross-topic/open-set AV task over a collection of fanfiction texts.
1 code implementation • 21 Jun 2021 • Benedikt Boenninghoff, Dorothea Kolossa, Robert M. Nickel
We are addressing two fundamental problems in authorship verification (AV): Topic variability and miscalibration.
1 code implementation • 7 Jun 2021 • Christopher Schymura, Benedikt Bönninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa
Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e. g. a microphone array).
no code implementations • 19 Apr 2021 • Wentao Yu, Steffen Zeiler, Dorothea Kolossa
While audio-visual speech recognition can significantly improve the recognition rate of end-to-end models in such poor conditions, it is not obvious how to best utilize any available information on acoustic and visual signal quality and reliability in these models.
no code implementations • 1 Mar 2021 • Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa
The detection of voiced speech, the estimation of the fundamental frequency, and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques.
1 code implementation • 28 Feb 2021 • Christopher Schymura, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa
Herein, attentions allow for capturing temporal dependencies in the audio signal by focusing on specific frames that are relevant for estimating the activity and direction-of-arrival of sound events at the current time-step.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 23 Feb 2021 • Julio Wissing, Benedikt Boenninghoff, Dorothea Kolossa, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Christopher Schymura
Estimating the positions of multiple speakers can be helpful for tasks like automatic speech recognition or speaker diarization.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
1 code implementation • 10 Feb 2021 • Thorsten Eisenhofer, Lea Schönherr, Joel Frank, Lars Speckemeier, Dorothea Kolossa, Thorsten Holz
In this paper we propose a different perspective: We accept the presence of adversarial examples against ASR systems, but we require them to be perceivable by human listeners.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • COLING 2020 • Benedikt Boenninghoff, Steffen Zeiler, Robert Nickel, Dorothea Kolossa
In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task.
1 code implementation • 21 Oct 2020 • Hojjat Aghakhani, Lea Schönherr, Thorsten Eisenhofer, Dorothea Kolossa, Thorsten Holz, Christopher Kruegel, Giovanni Vigna
In a more realistic scenario, when the target audio waveform is played over the air in different rooms, VENOMAVE maintains a success rate of up to 73. 3%.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 16 Oct 2020 • Mahdie Karbasi, Stefan Bleeck, Dorothea Kolossa
The estimation of speech intelligibility is still far from being a solved problem.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 23 Aug 2020 • Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa
The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts.
no code implementations • 28 May 2020 • Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa
In this work, we are extending the Gaussian model for the VAE to a Student-$t$ model, which allows for an independent control of the "heaviness" of the respective tails of the implied probability densities.
1 code implementation • 24 May 2020 • Sina Däubener, Lea Schönherr, Asja Fischer, Dorothea Kolossa
The neural networks for uncertainty quantification simultaneously diminish the vulnerability to the attack, which is reflected in a lower recognition accuracy of the malicious target text in comparison to a standard hybrid ASR system.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • LREC 2020 • Nima Nabizadeh, Dorothea Kolossa, Martin Heckmann
In this paper, we, therefore, focus on information extraction (IE) from the instructional text in repair manuals.
1 code implementation • ICML 2020 • Joel Frank, Thorsten Eisenhofer, Lea Schönherr, Asja Fischer, Dorothea Kolossa, Thorsten Holz
Based on this analysis, we demonstrate how the frequency representation can be used to identify deep fake images in an automated way, surpassing state-of-the-art methods.
no code implementations • 12 Dec 2019 • Ahmed Hussen Abdelaziz, Shuo-Yiin Chang, Nelson Morgan, Erik Edwards, Dorothea Kolossa, Dan Ellis, David A. Moses, Edward F. Chang
The emerging field of neural speech recognition (NSR) using electrocorticography has recently attracted remarkable research interest for studying how human brains recognize speech in quiet and noisy surroundings.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
2 code implementations • 17 Oct 2019 • Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, Robert M. Nickel
Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not.
no code implementations • WS 2019 • Diana Kleingarn, Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa
Humans use a variety of approaches to reference objects in the external world, including verbal descriptions, hand and head gestures, eye gaze or any combination of them.
2 code implementations • 20 Aug 2019 • Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa
Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not.
no code implementations • 5 Aug 2019 • Lea Schönherr, Thorsten Eisenhofer, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa
In this paper, we demonstrate the first algorithm that produces generic adversarial examples, which remain robust in an over-the-air attack that is not adapted to the specific environment.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 29 Mar 2019 • Ivo Trowitzsch, Christopher Schymura, Dorothea Kolossa, Klaus Obermayer
This work presents an approach that robustly binds localization with the detection of sound events in a binaural robotic system.
Sound Audio and Speech Processing
1 code implementation • 14 Mar 2019 • Christopher Schymura, Dorothea Kolossa
This paper presents a framework that extends the well-established theory of nonlinear dynamical systems with the notion of dynamic stream weights for an arbitrary number of sensory observations.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 16 Aug 2018 • Lea Schönherr, Katharina Kohls, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa
We use this backpropagation to learn the degrees of freedom for the adversarial perturbation of the input signal, i. e., we apply a psychoacoustic model and manipulate the acoustic signal below the thresholds of human perception.
Cryptography and Security Sound Audio and Speech Processing