Search Results for author: Arthur Pimentel

Found 7 papers, 1 papers with code

An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning

1 code implementation • 13 Mar 2024 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk

Lastly, we show that the proposed recipe can be applied to other distillation methodologies, such as the recent DPWavLM.

Denoising Knowledge Distillation +2

Paper
Code

On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''

no code implementations • 25 Sep 2023 • Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk

Recent advances with self-supervised learning have allowed speech recognition systems to achieve state-of-the-art (SOTA) word error rates (WER) while requiring only a fraction of the labeled training data needed by its predecessors.

Data Augmentation Model Compression +4

Paper
Add Code

VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks

no code implementations • 22 Sep 2023 • Heitor R. Guimarães, Arthur Pimentel, Anderson Avila, Tiago H. Falk

Keyword spotting (KWS) refers to the task of identifying a set of predefined words in audio streams.

Adversarial Robustness Keyword Spotting +2

Paper
Add Code

On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications

no code implementations • 23 May 2023 • Vamsikrishna Chemudupati, Marzieh Tahaei, Heitor Guimaraes, Arthur Pimentel, Anderson Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago Falk

Large self-supervised pre-trained speech models have achieved remarkable success across various speech-processing tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild'' Edge Applications

no code implementations • 9 May 2023 • Heitor Guimarães, Arthur Pimentel, Anderson Avila, Mehdi Rezagholizadeh, Tiago H. Falk

Later, these representations serve as input to downstream models to solve a number of tasks, such as keyword spotting or emotion recognition.

Emotion Recognition intent-classification +2

Paper
Add Code

RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness

no code implementations • 18 Feb 2023 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk

The proposed layer-wise distillation recipe is evaluated on top of three well-established universal representations, as well as with three downstream tasks.

Knowledge Distillation Multi-Task Learning

Paper
Add Code

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement

no code implementations • 12 Nov 2022 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk

Self-supervised speech representation learning aims to extract meaningful factors from the speech signal that can later be used across different downstream tasks, such as speech and/or emotion recognition.

Data Augmentation Emotion Recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.