Search Results for author: Dorothea Kolossa

Found 30 papers, 15 papers with code

Hierarchy-aware Learning of Sequential Tool Usage via Semi-automatically Constructed Taxonomies

no code implementations COLING (MWE) 2020 Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa

When repairing a device, humans employ a series of tools that corresponds to the arrangement of the device components.

DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution

no code implementations26 May 2023 Matías Pizarro, Dorothea Kolossa, Asja Fischer

Adversarial attacks can mislead automatic speech recognition (ASR) systems into predicting an arbitrary target text, thus posing a clear security threat.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

RubCSG at SemEval-2022 Task 5: Ensemble learning for identifying misogynous MEMEs

1 code implementation SemEval (NAACL) 2022 Wentao Yu, Benedikt Boenninghoff, Jonas Roehrig, Dorothea Kolossa

This work presents an ensemble system based on various uni-modal and bi-modal model architectures developed for the SemEval 2022 Task 5: MAMI-Multimedia Automatic Misogyny Identification.

Ensemble Learning

Federated Learning in ASR: Not as Easy as You Think

1 code implementation30 Sep 2021 Wentao Yu, Jan Freiwald, Sören Tewes, Fabien Huennemeyer, Dorothea Kolossa

We discuss the outcomes of these systems, which both show great similarities and only small improvements, pointing to a need for a deeper understanding of federated learning for speech recognition.

Federated Learning speech-recognition +1

Large-vocabulary Audio-visual Speech Recognition in Noisy Environments

no code implementations10 Sep 2021 Wentao Yu, Steffen Zeiler, Dorothea Kolossa

To address the inherent difficulties, we propose a new fusion strategy: a recurrent integration network is trained to fuse the state posteriors of multiple single-modality models, guided by a set of model-based and signal-based stream reliability measures.

Audio-Visual Speech Recognition Lipreading +3

O2D2: Out-Of-Distribution Detector to Capture Undecidable Trials in Authorship Verification

1 code implementation30 Jun 2021 Benedikt Boenninghoff, Robert M. Nickel, Dorothea Kolossa

The PAN 2021 authorship verification (AV) challenge is part of a three-year strategy, moving from a cross-topic/closed-set AV task to a cross-topic/open-set AV task over a collection of fanfiction texts.

Authorship Verification

Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

1 code implementation21 Jun 2021 Benedikt Boenninghoff, Dorothea Kolossa, Robert M. Nickel

We are addressing two fundamental problems in authorship verification (AV): Topic variability and miscalibration.

Authorship Verification

PILOT: Introducing Transformers for Probabilistic Sound Event Localization

1 code implementation7 Jun 2021 Christopher Schymura, Benedikt Bönninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa

Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e. g. a microphone array).

Event Detection

Fusing information streams in end-to-end audio-visual speech recognition

no code implementations19 Apr 2021 Wentao Yu, Steffen Zeiler, Dorothea Kolossa

While audio-visual speech recognition can significantly improve the recognition rate of end-to-end models in such poor conditions, it is not obvious how to best utilize any available information on acoustic and visual signal quality and reliability in these models.

Audio-Visual Speech Recognition Lip Reading +2

Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

no code implementations1 Mar 2021 Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

The detection of voiced speech, the estimation of the fundamental frequency, and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques.

General Classification

Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization

1 code implementation28 Feb 2021 Christopher Schymura, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa

Herein, attentions allow for capturing temporal dependencies in the audio signal by focusing on specific frames that are relevant for estimating the activity and direction-of-arrival of sound events at the current time-step.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Dompteur: Taming Audio Adversarial Examples

1 code implementation10 Feb 2021 Thorsten Eisenhofer, Lea Schönherr, Joel Frank, Lars Speckemeier, Dorothea Kolossa, Thorsten Holz

In this paper we propose a different perspective: We accept the presence of adversarial examples against ASR systems, but we require them to be perceivable by human listeners.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

VenoMave: Targeted Poisoning Against Speech Recognition

1 code implementation21 Oct 2020 Hojjat Aghakhani, Lea Schönherr, Thorsten Eisenhofer, Dorothea Kolossa, Thorsten Holz, Christopher Kruegel, Giovanni Vigna

In a more realistic scenario, when the target audio waveform is played over the air in different rooms, VENOMAVE maintains a success rate of up to 73. 3%.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Deep Bayes Factor Scoring for Authorship Verification

no code implementations23 Aug 2020 Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa

The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts.

Authorship Verification Metric Learning

Variational Autoencoder with Embedded Student-$t$ Mixture Model for Authorship Attribution

no code implementations28 May 2020 Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa

In this work, we are extending the Gaussian model for the VAE to a Student-$t$ model, which allows for an independent control of the "heaviness" of the respective tails of the implied probability densities.

Authorship Attribution General Classification

Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification

1 code implementation24 May 2020 Sina Däubener, Lea Schönherr, Asja Fischer, Dorothea Kolossa

The neural networks for uncertainty quantification simultaneously diminish the vulnerability to the attack, which is reflected in a lower recognition accuracy of the malicious target text in comparison to a standard hybrid ASR system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Leveraging Frequency Analysis for Deep Fake Image Recognition

1 code implementation ICML 2020 Joel Frank, Thorsten Eisenhofer, Lea Schönherr, Asja Fischer, Dorothea Kolossa, Thorsten Holz

Based on this analysis, we demonstrate how the frequency representation can be used to identify deep fake images in an automated way, surpassing state-of-the-art methods.

Image Forensics

On Neural Phone Recognition of Mixed-Source ECoG Signals

no code implementations12 Dec 2019 Ahmed Hussen Abdelaziz, Shuo-Yiin Chang, Nelson Morgan, Erik Edwards, Dorothea Kolossa, Dan Ellis, David A. Moses, Edward F. Chang

The emerging field of neural speech recognition (NSR) using electrocorticography has recently attracted remarkable research interest for studying how human brains recognize speech in quiet and noisy surroundings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

2 code implementations17 Oct 2019 Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, Robert M. Nickel

Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not.

Authorship Verification Decision Making

Speaker-adapted neural-network-based fusion for multimodal reference resolution

no code implementations WS 2019 Diana Kleingarn, Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa

Humans use a variety of approaches to reference objects in the external world, including verbal descriptions, hand and head gestures, eye gaze or any combination of them.

Similarity Learning for Authorship Verification in Social Media

2 code implementations20 Aug 2019 Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not.

Authorship Verification

Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems

no code implementations5 Aug 2019 Lea Schönherr, Thorsten Eisenhofer, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

In this paper, we demonstrate the first algorithm that produces generic adversarial examples, which remain robust in an over-the-air attack that is not adapted to the specific environment.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Joining Sound Event Detection and Localization Through Spatial Segregation

1 code implementation29 Mar 2019 Ivo Trowitzsch, Christopher Schymura, Dorothea Kolossa, Klaus Obermayer

This work presents an approach that robustly binds localization with the detection of sound events in a binaural robotic system.

Sound Audio and Speech Processing

Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights

1 code implementation14 Mar 2019 Christopher Schymura, Dorothea Kolossa

This paper presents a framework that extends the well-established theory of nonlinear dynamical systems with the notion of dynamic stream weights for an arbitrary number of sensory observations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Adversarial Attacks Against Automatic Speech Recognition Systems via Psychoacoustic Hiding

no code implementations16 Aug 2018 Lea Schönherr, Katharina Kohls, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

We use this backpropagation to learn the degrees of freedom for the adversarial perturbation of the input signal, i. e., we apply a psychoacoustic model and manipulate the acoustic signal below the thresholds of human perception.

Cryptography and Security Sound Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.