no code implementations • 16 Feb 2023 • Pierre Beckmann, Guillaume Köstner, Inês Hipólito
We propose a non-representationalist framework for deep learning relying on a novel method: computational phenomenology, a dialogue between the first-person perspective (relying on phenomenology) and the mechanisms of computational models.
1 code implementation • 24 Jun 2022 • Gasser Elbanna, Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Karl El Hajal, Milos Cernak
Our results indicate that the hybrid model with a convolutional transformer as the encoder yields superior performance in most HEAR challenge tasks.
Ranked #1 on Self-Supervised Learning on CREMA-D
1 code implementation • 30 Mar 2022 • Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow, Lara Orlandic, Pablo Mainar, Mikolaj Kegler, Pierre Beckmann, Milos Cernak
To that end, we introduce a set of five datasets for task load detection in speech.
2 code implementations • 7 Oct 2021 • Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Milos Cernak
To facilitate the process, here, we present the Speech Emotion Recognition Adaptation Benchmark (SERAB), a framework for evaluating the performance and generalization capacity of different approaches for utterance-level SER.
2 code implementations • 22 Oct 2019 • Pierre Beckmann, Mikolaj Kegler, Milos Cernak
Recent breakthroughs in deep learning often rely on representation learning and knowledge transfer.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +7
2 code implementations • 20 Oct 2019 • Mikolaj Kegler, Pierre Beckmann, Milos Cernak
To address these limitations, here we propose an end-to-end framework for speech inpainting, the context-based retrieval of missing or severely distorted parts of time-frequency representation of speech.