Search Results for author: Dominik Wagner

Found 13 papers, 2 papers with code

Diagonalisation SGD: Fast & Convergent SGD for Non-Differentiable Models via Reparameterisation and Smoothing

1 code implementation19 Feb 2024 Dominik Wagner, Basim Khajwal, C. -H. Luke Ong

It is well-known that the reparameterisation gradient estimator, which exhibits low variance in practice, is biased for non-differentiable models.

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

no code implementations6 Dec 2023 Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

We compare the proposed system to unimodal baselines and show that the multimodal approach achieves lower equal-error-rates (EERs), while using only a fraction of the training data.

Automatic Speech Recognition Language Modelling +3

A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem

no code implementations30 May 2023 Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer

Most stuttering detection and classification research has viewed stuttering as a multi-class classification problem or a binary detection task for each dysfluency type; however, this does not match the nature of stuttering, in which one dysfluency seldom comes alone but rather co-occurs with others.

Classification Cross-corpus +2

Fast and Correct Gradient-Based Optimisation for Probabilistic Programming via Smoothing

no code implementations9 Jan 2023 Basim Khajwal, C. -H. Luke Ong, Dominik Wagner

Thus we can prove stochastic gradient descent with the reparameterisation gradient estimator to be correct when applied to the smoothed problem.

Probabilistic Programming Variational Inference

Dysfluencies Seldom Come Alone -- Detection as a Multi-Label Problem

no code implementations28 Oct 2022 Sebastian P. Bayerl, Dominik Wagner, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer

This work explores an approach based on a modified wav2vec 2. 0 system for end-to-end stuttering detection and classification as a multi-label problem.

Multi-class Classification speech-recognition +1

Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?

no code implementations27 Oct 2022 Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet

The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech.

Binary Classification

The Influence of Dataset Partitioning on Dysfluency Detection Systems

1 code implementation7 Jun 2022 Sebastian P. Bayerl, Dominik Wagner, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer

This paper empirically investigates the influence of different data splits and splitting strategies on the performance of dysfluency detection systems.

Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0

no code implementations7 Apr 2022 Sebastian P. Bayerl, Dominik Wagner, Elmar Nöth, Korbinian Riedhammer

This paper shows that fine-tuning wav2vec 2. 0 [1] for the classification of stuttering on a sizeable English corpus containing stuttered speech, in conjunction with multi-task learning, boosts the effectiveness of the general-purpose wav2vec 2. 0 features for detecting stuttering in speech; both within and across languages.

Multi-Task Learning speech-recognition +1

Detecting Vocal Fatigue with Neural Embeddings

no code implementations7 Apr 2022 Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Korbinian Riedhammer, Tobias Bocklet

Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended utilization.

Densities of Almost Surely Terminating Probabilistic Programs are Differentiable Almost Everywhere

no code implementations8 Apr 2020 Carol Mak, C. -H. Luke Ong, Hugo Paquet, Dominik Wagner

We give SPCF a sampling-style operational semantics a la Borgstrom et al., and study the associated weight (commonly referred to as the density) function and value function on the set of possible execution traces.

Cannot find the paper you are looking for? You can Submit a new open access paper.