Search Results for author: Michael I Mandel

Found 7 papers, 3 papers with code

Music autotagging as captioning

no code implementations • NLP4MusA 2020 • Tian Cai, Michael I Mandel, Di He

Paper
Add Code

ImportantAug: a data augmentation agent for speech

1 code implementation • ICASSP 2022 • Viet Anh Trinh, Hassan Salami Kavaki, Michael I Mandel

We introduce ImportantAug, a technique to augment training data for speech classification and recognition models by adding noise to unimportant regions of the speech and not to important regions.

Ranked #1 on Keyword Spotting on Google Speech Commands (Google Speech Command-Musan metric)

Data Augmentation Keyword Spotting +1

Paper
Code

Large scale evaluation of importance maps in automatic speech recognition

no code implementations • 21 May 2020 • Viet Anh Trinh, Michael I Mandel

In this paper, we propose a metric that we call the structured saliency benchmark (SSBM) to evaluate importance maps computed for automatic speech recognizers on individual utterances.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement

no code implementations • 14 Nov 2019 • Soumi Maiti, Michael I Mandel

In previous work, we showed that PR systems generate high quality speech for a single speaker using two neural vocoders, WaveNet and WaveGlow.

Resynthesis Speech Enhancement

Paper
Add Code

Onssen: an open-source speech separation and enhancement library

1 code implementation • 3 Nov 2019 • Zhaoheng Ni, Michael I Mandel

onssen is a library mainly for deep learning separation and enhancement algorithms.

Deep Clustering speech-recognition +2

205

Paper
Code

Parametric Resynthesis with neural vocoders

1 code implementation • 16 Jun 2019 • Soumi Maiti, Michael I Mandel

We propose to utilize the high quality speech generation capability of neural vocoders for noise suppression.

Resynthesis

2,275

Paper
Code

Speech denoising by parametric resynthesis

no code implementations • 2 Apr 2019 • Soumi Maiti, Michael I Mandel

In comparison to two denoising systems, the oracle Wiener mask and a DNN-based mask predictor, our model equals the oracle Wiener mask in subjective quality and intelligibility and surpasses the realistic system.

Denoising Resynthesis +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.