no code implementations • 18 Dec 2023 • Philipp Seeberger, Tobias Bocklet, Korbinian Riedhammer
User-generated information content has become an important information source in crisis situations.
no code implementations • 16 Aug 2023 • Franziska Braun, Sebastian P. Bayerl, Paula A. Pérez-Toro, Florian Hönig, Hartmut Lehfeld, Thomas Hillemacher, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer
Automated dementia screening enables early detection and intervention, reducing costs to healthcare systems and increasing quality of life for those affected.
no code implementations • 30 May 2023 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer
Most stuttering detection and classification research has viewed stuttering as a multi-class classification problem or a binary detection task for each dysfluency type; however, this does not match the nature of stuttering, in which one dysfluency seldom comes alone but rather co-occurs with others.
no code implementations • 28 Oct 2022 • Sebastian P. Bayerl, Dominik Wagner, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer
This work explores an approach based on a modified wav2vec 2. 0 system for end-to-end stuttering detection and classification as a multi-label problem.
no code implementations • 28 Oct 2022 • Ilja Baumann, Dominik Wagner, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
Recent findings show that pre-trained wav2vec 2. 0 models are reliable feature extractors for various speaker characteristics classification tasks.
no code implementations • 27 Oct 2022 • Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech.
no code implementations • 24 Oct 2022 • Maximilian Bundscherer, Thomas H. Schmitt, Sebastian Bayerl, Thomas Auerbach, Tobias Bocklet
This paper describes a machine learning approach to determine the abrasive belt wear of wide belt sanders used in industrial processes based on acoustic data, regardless of the sanding process-related parameters, Feed speed, Grit Size, and Type of material.
no code implementations • 16 Jun 2022 • Ilja Baumann, Dominik Wagner, Sebastian Bayerl, Tobias Bocklet
In this work, the task is to determine whether spoken nonwords have been uttered correctly.
1 code implementation • 7 Jun 2022 • Sebastian P. Bayerl, Dominik Wagner, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer
This paper empirically investigates the influence of different data splits and splitting strategies on the performance of dysfluency detection systems.
no code implementations • 7 Apr 2022 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Korbinian Riedhammer, Tobias Bocklet
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended utilization.
no code implementations • 11 Aug 2020 • Munir Georges, Jonathan Huang, Tobias Bocklet
Deep neural networks (DNN) have recently been widely used in speaker recognition systems, achieving state-of-the-art performance on various benchmarks.
no code implementations • LREC 2014 • Tobias Bocklet, Andreas Maier, Korbinian Riedhammer, Ulrich Eysholdt, Elmar N{\"o}th
In this paper we describe Erlangen-CLP, a large speech database of children with Cleft Lip and Palate.