no code implementations • 14 Mar 2022 • Rosanna Turrisi, Leonardo Badino
Based on the similarity between the target speaker and the healthy/dysarthric source speakers, we then define the healthy/dysarthric score of the target speaker that we leverage to perform dysarthria detection.
no code implementations • 6 Apr 2021 • Rosanna Turrisi, Arianna Braccia, Marco Emanuele, Simone Giulietti, Maura Pugliatti, Mariachiara Sensi, Luciano Fadiga, Leonardo Badino
The corpus aims at providing a resource for the development of ASR-based assistive technologies for patients with dysarthria.
no code implementations • 6 Apr 2021 • Rosanna Turrisi, Leonardo Badino
Interestingly, MSDA-WJDOT provides a similarity score between the source and the target, i. e. between speakers in this case.
no code implementations • 5 Dec 2019 • Ander Arriandiaga, Giovanni Morrone, Luca Pasa, Leonardo Badino, Chiara Bartolozzi
In order to overcome this limitation, we propose the use of event-driven cameras and exploit compression, high temporal resolution and low latency, for low cost and low latency motion feature extraction, going towards online embedded audio-visual speech processing.
no code implementations • 16 Apr 2019 • Luca Pasa, Giovanni Morrone, Leonardo Badino
In this paper, we analyzed how audio-visual speech enhancement can help to perform the ASR task in a cocktail party scenario.
1 code implementation • 6 Nov 2018 • Giovanni Morrone, Luca Pasa, Vadim Tikhanoff, Sonia Bergamaschi, Luciano Fadiga, Leonardo Badino
In this paper, we address the problem of enhancing the speech of a speaker of interest in a cocktail party scenario when visual information of the speaker of interest is available.
Ranked #1 on Speech Enhancement on GRID corpus (mixed-speech)
no code implementations • 4 Sep 2018 • Rosanna Turrisi, Raffaele Tavarone, Leonardo Badino
We address the problem of reconstructing articulatory movements, given audio and/or phonetic labels.