Search Results for author: Giampiero Salvi

Found 18 papers, 5 papers with code

Developing Acoustic Models for Automatic Speech Recognition in Swedish

no code implementations25 Apr 2024 Giampiero Salvi

This is done employing hidden Markov models and using the SpeechDat database to train their parameters.

Automatic Speech Recognition speech-recognition +1

Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints

no code implementations12 Jan 2024 Giampiero Salvi

This paper describes the use of connectionist techniques in phonetic speech recognition with strong latency constraints.

Language Modelling speech-recognition +1

Segment Boundary Detection via Class Entropy Measurements in Connectionist Phoneme Recognition

no code implementations11 Jan 2024 Giampiero Salvi

The advantage of this measure is its simplicity as the posterior probabilities of each class are available in connectionist phoneme recognition.

Boundary Detection

S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction

no code implementations13 Jul 2023 Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi

We address the video prediction task by putting forth a novel model that combines (i) our recently proposed hierarchical residual vector quantized variational autoencoder (HR-VQVAE), and (ii) a novel spatiotemporal PixelCNN (ST-PixelCNN).

Video Prediction

STEP-GAN: A Step-by-Step Training for Multi Generator GANs with application to Cyber Security in Power Systems

no code implementations11 Sep 2020 Mohammad Adiban, Arash Safari, Giampiero Salvi

In this study, we introduce a novel unsupervised countermeasure for smart grid power systems, based on generative adversarial networks (GANs).

One-Class Classification

From Visual to Acoustic Question Answering

no code implementations28 Feb 2019 Jerome Abdelnour, Giampiero Salvi, Jean Rouat

The AQA task consists of analyzing an acoustic scene composed by a combination of elementary sounds and answering questions that relate the position and properties of these sounds.

Acoustic Question Answering Position +2

Active Mini-Batch Sampling using Repulsive Point Processes

1 code implementation8 Apr 2018 Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi

We first show that the phenomenon of variance reduction by diversified sampling generalizes in particular to non-stationary point processes.

Point Processes

Language Bootstrapping: Learning Word Meanings From Perception-Action Association

1 code implementation27 Nov 2017 Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor

The model is based on an affordance network, i. e., a mapping between robot actions, robot perceptions, and the perceived effects of these actions upon objects.

Language Acquisition speech-recognition +1

Interactive Robot Learning of Gestures, Language and Affordances

no code implementations24 Nov 2017 Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi

A growing field in robotics and Artificial Intelligence (AI) research is human-robot collaboration, whose target is to enable effective teamwork between humans and robots.

Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition

no code implementations24 Nov 2017 Kalin Stefanov, Jonas Beskow, Giampiero Salvi

Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings.

Language Acquisition

Semi-supervised Learning with Sparse Autoencoders in Phone Classification

no code implementations3 Oct 2016 Akash Kumar Dhaka, Giampiero Salvi

We propose the application of a semi-supervised learning method to improve the performance of acoustic modelling for automatic speech recognition based on deep neural net- works.

Acoustic Modelling Automatic Speech Recognition +4

Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing

no code implementations29 Jun 2016 Akash Kumar Dhaka, Giampiero Salvi

We present a systematic analysis on the performance of a phonetic recogniser when the window of input features is not symmetric with respect to the current frame.

Low-latency processing

The WaveSurfer Automatic Speech Recognition Plugin

no code implementations LREC 2014 Giampiero Salvi, Niklas Vanhainen

This paper presents a plugin that adds automatic speech recognition (ASR) functionality to the WaveSurfer sound manipulation and visualisation program.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.