no code implementations • 17 Jan 2024 • Iván López-Espejo, Aditya Joglekar, Antonio M. Peinado, Jesper Jensen
Pre-emphasis filtering, compensating for the natural energy decay of speech at higher frequencies, has been considered as a common pre-processing step in a number of speech processing tasks over the years.
no code implementations • 3 May 2023 • Iván López-Espejo, Santi Prieto, Alfonso Ortega, Eduardo Lleida
Despite the maturity of modern speaker verification technology, its performance still significantly degrades when facing non-neutrally-phonated (e. g., shouted and whispered) speech.
no code implementations • 19 Nov 2022 • Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen
In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance.
no code implementations • 20 Nov 2021 • Iván López-Espejo, Zheng-Hua Tan, John Hansen, Jesper Jensen
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams and has become a fast-growing technology thanks to the paradigm shift introduced by deep learning a few years ago.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 6 Aug 2020 • Santi Prieto, Alfonso Ortega, Iván López-Espejo, Eduardo Lleida
These compensation techniques are borrowed from the area of robustness for automatic speech recognition and, in this work, we apply them to compensate the mismatch between shouted and normal conditions in speaker verification.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 30 May 2020 • Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
Despite their great performance over the years, handcrafted speech features are not necessarily optimal for any particular speech application.
no code implementations • 15 Sep 2019 • Iván López-Espejo
In this report, I investigate the use of end-to-end deep residual learning with dilated convolutions for myocardial infarction (MI) detection and localization from electrocardiogram (ECG) signals.
no code implementations • 22 Jun 2019 • Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
Our results show that this multi-task deep residual network is able to achieve a KWS accuracy relative improvement of around 32% with respect to a system that does not deal with external speakers.