Search Results for author: Giovanni Morrone

Found 8 papers, 2 papers with code

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

no code implementations21 Mar 2023 Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini

Finally, we also show that the separated signals can be readily used also for automatic speech recognition, reaching performance close to using oracle sources in some configurations.

Action Detection Activity Detection +4

Conversational Speech Separation: an Evaluation Study for Streaming Applications

no code implementations31 May 2022 Giovanni Morrone, Samuele Cornell, Enrico Zovato, Alessio Brutti, Stefano Squartini

Continuous speech separation (CSS) is a recently proposed framework which aims at separating each speaker from an input mixture signal in a streaming fashion.

Speech Separation

Audio-Visual Speech Inpainting with Deep Learning

no code implementations9 Oct 2020 Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen

In this paper, we present a deep-learning-based framework for audio-visual speech inpainting, i. e., the task of restoring the missing parts of an acoustic speech signal from reliable audio context and uncorrupted visual information.

Multi-Task Learning

Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven Cameras

no code implementations5 Dec 2019 Ander Arriandiaga, Giovanni Morrone, Luca Pasa, Leonardo Badino, Chiara Bartolozzi

In order to overcome this limitation, we propose the use of event-driven cameras and exploit compression, high temporal resolution and low latency, for low cost and low latency motion feature extraction, going towards online embedded audio-visual speech processing.

Optical Flow Estimation Speech Separation

An Analysis of Speech Enhancement and Recognition Losses in Limited Resources Multi-talker Single Channel Audio-Visual ASR

no code implementations16 Apr 2019 Luca Pasa, Giovanni Morrone, Leonardo Badino

In this paper, we analyzed how audio-visual speech enhancement can help to perform the ASR task in a cocktail party scenario.

Speech Enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

1 code implementation6 Nov 2018 Giovanni Morrone, Luca Pasa, Vadim Tikhanoff, Sonia Bergamaschi, Luciano Fadiga, Leonardo Badino

In this paper, we address the problem of enhancing the speech of a speaker of interest in a cocktail party scenario when visual information of the speaker of interest is available.

Speech Enhancement Speech Separation

Cannot find the paper you are looking for? You can Submit a new open access paper.