1 code implementation • 2 May 2024 • Moreno La Quatra, Alkis Koudounas, Lorenzo Vaiani, Elena Baralis, Luca Cagliero, Paolo Garza, Sabato Marco Siniscalchi
Limited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities.
no code implementations • 31 Mar 2024 • Alkis Koudounas, Flavio Giobergia
We identify subgroups of audio recordings based on combinations of these metadata and compute each subgroup's performance (e. g., Word Error Rate) and the difference in performance (''divergence'') w. r. t the overall population.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 1 Mar 2024 • Federico Borra, Claudio Savelli, Giacomo Rosso, Alkis Koudounas, Flavio Giobergia
In Natural Language Generation (NLG), contemporary Large Language Models (LLMs) face several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency-centric metrics.
no code implementations • 2 Oct 2023 • Flavio Giobergia, Alkis Koudounas, Elena Baralis
Exploring exoplanets has transformed our understanding of the universe by revealing many planetary systems that defy our current understanding.
no code implementations • 14 Sep 2023 • Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis
Existing work focuses on a few spoken language understanding (SLU) tasks, and explanations are difficult to interpret for most users.
1 code implementation • 14 Jun 2023 • Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis
Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.