Search Results for author: Giampiero Salvi

We address the video prediction task by putting forth a novel model that combines (i) our recently proposed hierarchical residual vector quantized variational autoencoder (HR-VQVAE), and (ii) a novel spatiotemporal PixelCNN (ST-PixelCNN).

Video Prediction

Paper
Add Code

Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation

1 code implementation • 9 Aug 2022 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi

We propose a multi-layer variational autoencoder method, we call HR-VQVAE, that learns hierarchical discrete representations of the data.

Image Generation Image Reconstruction

Paper
Code

NAAQA: A Neural Architecture for Acoustic Question Answering

no code implementations • 11 Jun 2021 • Jerome Abdelnour, Jean Rouat, Giampiero Salvi

We also test the addition of a MALiMo module in our model on both CLEAR2 and DAQA.

Acoustic Question Answering Question Answering +2

Paper
Add Code

STEP-GAN: A Step-by-Step Training for Multi Generator GANs with application to Cyber Security in Power Systems

no code implementations • 11 Sep 2020 • Mohammad Adiban, Arash Safari, Giampiero Salvi

In this study, we introduce a novel unsupervised countermeasure for smart grid power systems, based on generative adversarial networks (GANs).

One-Class Classification

Paper
Add Code

From Visual to Acoustic Question Answering

no code implementations • 28 Feb 2019 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat

The AQA task consists of analyzing an acoustic scene composed by a combination of elementary sounds and answering questions that relate the position and properties of these sounds.

Acoustic Question Answering Position +2

Paper
Add Code

Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions

1 code implementation • 26 Feb 2019 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi

It then uses this information to learn a mapping between its own actions and those performed by a human in a shared environment.

Action Recognition Temporal Action Localization

Paper
Code

CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning

1 code implementation • 26 Nov 2018 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat

We introduce the task of acoustic question answering (AQA) in the area of acoustic reasoning.

Acoustic Question Answering Question Answering +1

Paper
Code

Active Mini-Batch Sampling using Repulsive Point Processes

1 code implementation • 8 Apr 2018 • Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi

We first show that the phenomenon of variance reduction by diversified sampling generalizes in particular to non-stationary point processes.

Point Processes

Paper
Code

Language Bootstrapping: Learning Word Meanings From Perception-Action Association

1 code implementation • 27 Nov 2017 • Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor

The model is based on an affordance network, i. e., a mapping between robot actions, robot perceptions, and the perceived effects of these actions upon objects.

Language Acquisition speech-recognition +1

Paper
Code

Interactive Robot Learning of Gestures, Language and Affordances

no code implementations • 24 Nov 2017 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi

A growing field in robotics and Artificial Intelligence (AI) research is human-robot collaboration, whose target is to enable effective teamwork between humans and robots.

Paper
Add Code

Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition

no code implementations • 24 Nov 2017 • Kalin Stefanov, Jonas Beskow, Giampiero Salvi

Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings.

Language Acquisition

Paper
Add Code

Semi-supervised Learning with Sparse Autoencoders in Phone Classification

no code implementations • 3 Oct 2016 • Akash Kumar Dhaka, Giampiero Salvi

We propose the application of a semi-supervised learning method to improve the performance of acoustic modelling for automatic speech recognition based on deep neural net- works.

Acoustic Modelling Automatic Speech Recognition +4

Paper
Add Code

Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing

no code implementations • 29 Jun 2016 • Akash Kumar Dhaka, Giampiero Salvi

We present a systematic analysis on the performance of a phonetic recogniser when the window of input features is not symmetric with respect to the current frame.

Low-latency processing

Paper
Add Code

The WaveSurfer Automatic Speech Recognition Plugin

no code implementations • LREC 2014 • Giampiero Salvi, Niklas Vanhainen

This paper presents a plugin that adds automatic speech recognition (ASR) functionality to the WaveSurfer sound manipulation and visualisation program.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish

no code implementations • LREC 2014 • Niklas Vanhainen, Giampiero Salvi

This paper presents results for large vocabulary continuous speech recognition (LVCSR) in Swedish.

Language Modelling speech-recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.