no code implementations • 25 Apr 2024 • Giampiero Salvi
This is done employing hidden Markov models and using the SpeechDat database to train their parameters.
no code implementations • 12 Jan 2024 • Giampiero Salvi
This paper describes the use of connectionist techniques in phonetic speech recognition with strong latency constraints.
no code implementations • 11 Jan 2024 • Giampiero Salvi
The advantage of this measure is its simplicity as the posterior probabilities of each class are available in connectionist phoneme recognition.
no code implementations • 13 Jul 2023 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi
We address the video prediction task by putting forth a novel model that combines (i) our recently proposed hierarchical residual vector quantized variational autoencoder (HR-VQVAE), and (ii) a novel spatiotemporal PixelCNN (ST-PixelCNN).
1 code implementation • 9 Aug 2022 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi
We propose a multi-layer variational autoencoder method, we call HR-VQVAE, that learns hierarchical discrete representations of the data.
no code implementations • 11 Jun 2021 • Jerome Abdelnour, Jean Rouat, Giampiero Salvi
We also test the addition of a MALiMo module in our model on both CLEAR2 and DAQA.
no code implementations • 11 Sep 2020 • Mohammad Adiban, Arash Safari, Giampiero Salvi
In this study, we introduce a novel unsupervised countermeasure for smart grid power systems, based on generative adversarial networks (GANs).
no code implementations • 28 Feb 2019 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat
The AQA task consists of analyzing an acoustic scene composed by a combination of elementary sounds and answering questions that relate the position and properties of these sounds.
1 code implementation • 26 Feb 2019 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi
It then uses this information to learn a mapping between its own actions and those performed by a human in a shared environment.
1 code implementation • 26 Nov 2018 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat
We introduce the task of acoustic question answering (AQA) in the area of acoustic reasoning.
1 code implementation • 8 Apr 2018 • Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi
We first show that the phenomenon of variance reduction by diversified sampling generalizes in particular to non-stationary point processes.
1 code implementation • 27 Nov 2017 • Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor
The model is based on an affordance network, i. e., a mapping between robot actions, robot perceptions, and the perceived effects of these actions upon objects.
no code implementations • 24 Nov 2017 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi
A growing field in robotics and Artificial Intelligence (AI) research is human-robot collaboration, whose target is to enable effective teamwork between humans and robots.
no code implementations • 24 Nov 2017 • Kalin Stefanov, Jonas Beskow, Giampiero Salvi
Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings.
no code implementations • 3 Oct 2016 • Akash Kumar Dhaka, Giampiero Salvi
We propose the application of a semi-supervised learning method to improve the performance of acoustic modelling for automatic speech recognition based on deep neural net- works.
no code implementations • 29 Jun 2016 • Akash Kumar Dhaka, Giampiero Salvi
We present a systematic analysis on the performance of a phonetic recogniser when the window of input features is not symmetric with respect to the current frame.
no code implementations • LREC 2014 • Giampiero Salvi, Niklas Vanhainen
This paper presents a plugin that adds automatic speech recognition (ASR) functionality to the WaveSurfer sound manipulation and visualisation program.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2014 • Niklas Vanhainen, Giampiero Salvi
This paper presents results for large vocabulary continuous speech recognition (LVCSR) in Swedish.