Search Results for author: Andrea Vanzo

Found 12 papers, 3 papers with code

Anatomy of Industrial Scale Multilingual ASR

no code implementations • 15 Apr 2024 • Francis McCann Ramirez, Luka Chkhetiani, Andrew Ehrenberg, Robert McHardy, Rami Botros, Yash Khare, Andrea Vanzo, Taufiquzzaman Peyash, Gabriel Oexle, Michael Liang, Ilya Sklyar, Enver Fakhan, Ahmed Etefy, Daniel McCrystal, Sam Flamini, Domenic Donato, Takuya Yoshioka

This paper describes AssemblyAI's industrial-scale automatic speech recognition (ASR) system, designed to meet the requirements of large-scale, multilingual ASR serving various application needs.

Anatomy Automatic Speech Recognition +4

Paper
Add Code

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

no code implementations • 10 Apr 2024 • Kevin Zhang, Luka Chkhetiani, Francis McCann Ramirez, Yash Khare, Andrea Vanzo, Michael Liang, Sergio Ramirez Martin, Gabriel Oexle, Ruben Bousbib, Taufiquzzaman Peyash, Michael Nguyen, Dillon Pulliam, Domenic Donato

This paper presents Conformer-1, an end-to-end Automatic Speech Recognition (ASR) model trained on an extensive dataset of 570k hours of speech audio data, 91% of which was acquired from publicly available sources.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Going for GOAL: A Resource for Grounded Football Commentaries

1 code implementation • 8 Nov 2022 • Alessandro Suglia, José Lopes, Emanuele Bastianelli, Andrea Vanzo, Shubham Agarwal, Malvina Nikandrou, Lu Yu, Ioannis Konstas, Verena Rieser

As the course of a game is unpredictable, so are commentaries, which makes them a unique resource to investigate dynamic language grounding.

Moment Retrieval Retrieval

Paper
Code

An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games

no code implementations • EACL 2021 • Alessandro Suglia, Yonatan Bisk, Ioannis Konstas, Antonio Vergari, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

Guessing games are a prototypical instance of the "learning by interacting" paradigm.

Question Answering Visual Question Answering

Paper
Add Code

Encoding Syntactic Constituency Paths for Frame-Semantic Parsing with Graph Convolutional Networks

no code implementations • 26 Nov 2020 • Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

We study the problem of integrating syntactic information from constituency trees into a neural model in Frame-semantic parsing sub-tasks, namely Target Identification (TI), FrameIdentification (FI), and Semantic Role Labeling (SRL).

Semantic Parsing Semantic Role Labeling +1

Paper
Add Code

SLURP: A Spoken Language Understanding Resource Package

1 code implementation • EMNLP 2020 • Emanuele Bastianelli, Andrea Vanzo, Pawel Swietojanski, Verena Rieser

Spoken Language Understanding infers semantic meaning directly from audio data, and thus promises to reduce error propagation and misunderstandings in end-user applications.

Ranked #3 on Slot Filling on SLURP (using extra training data)

Intent Classification Slot Filling +1

Paper
Code

Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games

no code implementations • COLING 2020 • Alessandro Suglia, Antonio Vergari, Ioannis Konstas, Yonatan Bisk, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

However, as shown by Suglia et al. (2020), existing models fail to learn truly multi-modal representations, relying instead on gold category labels for objects in the scene both at training and inference time.

Object

Paper
Add Code

CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

no code implementations • ACL 2020 • Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank, Oliver Lemon

To remedy this, we present GROLLA, an evaluation framework for Grounded Language Learning with Attributes with three sub-tasks: 1) Goal-oriented evaluation; 2) Object attribute prediction evaluation; and 3) Zero-shot evaluation.

Attribute Grounded language learning

Paper
Add Code

Hierarchical Multi-Task Natural Language Understanding for Cross-domain Conversational AI: HERMIT NLU

1 code implementation • WS 2019 • Andrea Vanzo, Emanuele Bastianelli, Oliver Lemon

We present a new neural architecture for wide-coverage Natural Language Understanding in Spoken Dialogue Systems.

Natural Language Understanding Sentence +1

17,993

Paper
Code

MuMMER: Socially Intelligent Human-Robot Interaction in Public Spaces

no code implementations • 15 Sep 2019 • Mary Ellen Foster, Bart Craenen, Amol Deshmukh, Oliver Lemon, Emanuele Bastianelli, Christian Dondrup, Ioannis Papaioannou, Andrea Vanzo, Jean-Marc Odobez, Olivier Canévet, Yuanzhouhan Cao, Weipeng He, Angel Martínez-González, Petr Motlicek, Rémy Siegfried, Rachid Alami, Kathleen Belhassein, Guilhem Buisan, Aurélie Clodic, Amandine Mayima, Yoan Sallami, Guillaume Sarthou, Phani-Teja Singamaneni, Jules Waldhart, Alexandre Mazel, Maxime Caniot, Marketta Niemelä, Päivi Heikkilä, Hanna Lammi, Antti Tammela

In the EU-funded MuMMER project, we have developed a social robot designed to interact naturally and flexibly with users in public spaces such as a shopping mall.

Motion Planning

Paper
Add Code

Structured Learning for Context-aware Spoken Language Understanding of Robotic Commands

no code implementations • WS 2017 • Andrea Vanzo, Danilo Croce, Roberto Basili, Daniele Nardi

Service robots are expected to operate in specific environments, where the presence of humans plays a key role.

Spoken Language Understanding

Paper
Add Code

A context-based model for Sentiment Analysis in Twitter

no code implementations • COLING 2014 • Andrea Vanzo, Danilo Croce, Roberto Basili

Sentiment Analysis Text Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.