Search Results for author: Dorothea Kolossa

Found 30 papers, 15 papers with code

Hierarchy-aware Learning of Sequential Tool Usage via Semi-automatically Constructed Taxonomies

no code implementations • COLING (MWE) 2020 • Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa

When repairing a device, humans employ a series of tools that corresponds to the arrangement of the device components.

Paper
Add Code

DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution

no code implementations • 26 May 2023 • Matías Pizarro, Dorothea Kolossa, Asja Fischer

Adversarial attacks can mislead automatic speech recognition (ASR) systems into predicting an arbitrary target text, thus posing a clear security threat.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

RubCSG at SemEval-2022 Task 5: Ensemble learning for identifying misogynous MEMEs

1 code implementation • SemEval (NAACL) 2022 • Wentao Yu, Benedikt Boenninghoff, Jonas Roehrig, Dorothea Kolossa

This work presents an ensemble system based on various uni-modal and bi-modal model architectures developed for the SemEval 2022 Task 5: MAMI-Multimedia Automatic Misogyny Identification.

Ensemble Learning

Paper
Code

Robustifying automatic speech recognition by extracting slowly varying features

no code implementations • 14 Dec 2021 • Matias Pizarro, Dorothea Kolossa, Asja Fischer

We perform an empirical analysis of hybrid ASR models trained on data pre-processed in such a way.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Federated Learning in ASR: Not as Easy as You Think

1 code implementation • 30 Sep 2021 • Wentao Yu, Jan Freiwald, Sören Tewes, Fabien Huennemeyer, Dorothea Kolossa

We discuss the outcomes of these systems, which both show great similarities and only small improvements, pointing to a need for a deeper understanding of federated learning for speech recognition.

Federated Learning speech-recognition +1

Paper
Code

Large-vocabulary Audio-visual Speech Recognition in Noisy Environments

no code implementations • 10 Sep 2021 • Wentao Yu, Steffen Zeiler, Dorothea Kolossa

To address the inherent difficulties, we propose a new fusion strategy: a recurrent integration network is trained to fuse the state posteriors of multiple single-modality models, guided by a set of model-based and signal-based stream reliability measures.

Audio-Visual Speech Recognition Lipreading +3

Paper
Add Code

O2D2: Out-Of-Distribution Detector to Capture Undecidable Trials in Authorship Verification

1 code implementation • 30 Jun 2021 • Benedikt Boenninghoff, Robert M. Nickel, Dorothea Kolossa

The PAN 2021 authorship verification (AV) challenge is part of a three-year strategy, moving from a cross-topic/closed-set AV task to a cross-topic/open-set AV task over a collection of fanfiction texts.

Authorship Verification

Paper
Code

Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

1 code implementation • 21 Jun 2021 • Benedikt Boenninghoff, Dorothea Kolossa, Robert M. Nickel

We are addressing two fundamental problems in authorship verification (AV): Topic variability and miscalibration.

Authorship Verification

Paper
Code

PILOT: Introducing Transformers for Probabilistic Sound Event Localization

1 code implementation • 7 Jun 2021 • Christopher Schymura, Benedikt Bönninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa

Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e. g. a microphone array).

Event Detection

Paper
Code

Fusing information streams in end-to-end audio-visual speech recognition

no code implementations • 19 Apr 2021 • Wentao Yu, Steffen Zeiler, Dorothea Kolossa

While audio-visual speech recognition can significantly improve the recognition rate of end-to-end models in such poor conditions, it is not obvious how to best utilize any available information on acoustic and visual signal quality and reliability in these models.

Audio-Visual Speech Recognition Lip Reading +2

Paper
Add Code

Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

no code implementations • 1 Mar 2021 • Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

The detection of voiced speech, the estimation of the fundamental frequency, and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques.

General Classification

Paper
Add Code

Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization

1 code implementation • 28 Feb 2021 • Christopher Schymura, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa

Herein, attentions allow for capturing temporal dependencies in the audio signal by focusing on specific frames that are relevant for estimating the activity and direction-of-arrival of sound events at the current time-step.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain

1 code implementation • 23 Feb 2021 • Julio Wissing, Benedikt Boenninghoff, Dorothea Kolossa, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Christopher Schymura

Estimating the positions of multiple speakers can be helpful for tasks like automatic speech recognition or speaker diarization.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

Dompteur: Taming Audio Adversarial Examples

1 code implementation • 10 Feb 2021 • Thorsten Eisenhofer, Lea Schönherr, Joel Frank, Lars Speckemeier, Dorothea Kolossa, Thorsten Holz

In this paper we propose a different perspective: We accept the presence of adversarial examples against ASR systems, but we require them to be perceivable by human listeners.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Variational Autoencoder with Embedded Student-t Mixture Model for Authorship Attribution

no code implementations • COLING 2020 • Benedikt Boenninghoff, Steffen Zeiler, Robert Nickel, Dorothea Kolossa

In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task.

Authorship Attribution

Paper
Add Code

VenoMave: Targeted Poisoning Against Speech Recognition

1 code implementation • 21 Oct 2020 • Hojjat Aghakhani, Lea Schönherr, Thorsten Eisenhofer, Dorothea Kolossa, Thorsten Holz, Christopher Kruegel, Giovanni Vigna

In a more realistic scenario, when the target audio waveform is played over the air in different rooms, VENOMAVE maintains a success rate of up to 73. 3%.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Non-intrusive speech intelligibility prediction using automatic speech recognition derived measures

no code implementations • 16 Oct 2020 • Mahdie Karbasi, Stefan Bleeck, Dorothea Kolossa

The estimation of speech intelligibility is still far from being a solved problem.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Deep Bayes Factor Scoring for Authorship Verification

no code implementations • 23 Aug 2020 • Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa

The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts.

Authorship Verification Metric Learning

Paper
Add Code

Variational Autoencoder with Embedded Student-$t$ Mixture Model for Authorship Attribution

no code implementations • 28 May 2020 • Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa

In this work, we are extending the Gaussian model for the VAE to a Student-$t$ model, which allows for an independent control of the "heaviness" of the respective tails of the implied probability densities.

Authorship Attribution General Classification

Paper
Add Code

Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification

1 code implementation • 24 May 2020 • Sina Däubener, Lea Schönherr, Asja Fischer, Dorothea Kolossa

The neural networks for uncertainty quantification simultaneously diminish the vulnerability to the attack, which is reflected in a lower recognition accuracy of the malicious target text in comparison to a standard hybrid ASR system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

MyFixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair Manuals

no code implementations • LREC 2020 • Nima Nabizadeh, Dorothea Kolossa, Martin Heckmann

In this paper, we, therefore, focus on information extraction (IE) from the instructional text in repair manuals.

Paper
Add Code

Leveraging Frequency Analysis for Deep Fake Image Recognition

1 code implementation • ICML 2020 • Joel Frank, Thorsten Eisenhofer, Lea Schönherr, Asja Fischer, Dorothea Kolossa, Thorsten Holz

Based on this analysis, we demonstrate how the frequency representation can be used to identify deep fake images in an automated way, surpassing state-of-the-art methods.

Image Forensics

162

Paper
Code

On Neural Phone Recognition of Mixed-Source ECoG Signals

no code implementations • 12 Dec 2019 • Ahmed Hussen Abdelaziz, Shuo-Yiin Chang, Nelson Morgan, Erik Edwards, Dorothea Kolossa, Dan Ellis, David A. Moses, Edward F. Chang

The emerging field of neural speech recognition (NSR) using electrocorticography has recently attracted remarkable research interest for studying how human brains recognize speech in quiet and noisy surroundings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

2 code implementations • 17 Oct 2019 • Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, Robert M. Nickel

Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not.

Authorship Verification Decision Making

Paper
Code

Speaker-adapted neural-network-based fusion for multimodal reference resolution

no code implementations • WS 2019 • Diana Kleingarn, Nima Nabizadeh, Martin Heckmann, Dorothea Kolossa

Humans use a variety of approaches to reference objects in the external world, including verbal descriptions, hand and head gestures, eye gaze or any combination of them.

Paper
Add Code

Similarity Learning for Authorship Verification in Social Media

2 code implementations • 20 Aug 2019 • Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not.

Authorship Verification

Paper
Code

Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems

no code implementations • 5 Aug 2019 • Lea Schönherr, Thorsten Eisenhofer, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

In this paper, we demonstrate the first algorithm that produces generic adversarial examples, which remain robust in an over-the-air attack that is not adapted to the specific environment.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Joining Sound Event Detection and Localization Through Spatial Segregation

1 code implementation • 29 Mar 2019 • Ivo Trowitzsch, Christopher Schymura, Dorothea Kolossa, Klaus Obermayer

This work presents an approach that robustly binds localization with the detection of sound events in a binaural robotic system.

Sound Audio and Speech Processing

Paper
Code

Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights

1 code implementation • 14 Mar 2019 • Christopher Schymura, Dorothea Kolossa

This paper presents a framework that extends the well-established theory of nonlinear dynamical systems with the notion of dynamic stream weights for an arbitrary number of sensory observations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Adversarial Attacks Against Automatic Speech Recognition Systems via Psychoacoustic Hiding

no code implementations • 16 Aug 2018 • Lea Schönherr, Katharina Kohls, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

We use this backpropagation to learn the degrees of freedom for the adversarial perturbation of the input signal, i. e., we apply a psychoacoustic model and manipulate the acoustic signal below the thresholds of human perception.

Cryptography and Security Sound Audio and Speech Processing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.