TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	LibriSpeech test-clean	Local Prior Matching (Large Model)	Word Error Rate (WER)	7.19	# 53
Speech Recognition	LibriSpeech test-other	Local Prior Matching (Large Model)	Word Error Rate (WER)	20.84	# 47
Speech Recognition	LibriSpeech test-other	Local Prior Matching (Large Model, ConvLM LM)	Word Error Rate (WER)	15.28	# 45

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-supervised-speech-recognition-via-local/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=semi-supervised-speech-recognition-via-local)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-supervised-speech-recognition-via-local/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=semi-supervised-speech-recognition-via-local)`

Semi-Supervised Speech Recognition via Local Prior Matching

24 Feb 2020 · Wei-Ning Hsu, Ann Lee, Gabriel Synnaeve, Awni Hannun ·

For sequence transduction tasks like speech recognition, a strong structured prior model encodes rich information about the target space, implicitly ruling out invalid sequences by assigning them low probability. In this work, we propose local prior matching (LPM), a semi-supervised objective that distills knowledge from a strong prior (e.g. a language model) to provide learning signal to a discriminative model trained on unlabeled speech. We demonstrate that LPM is theoretically well-motivated, simple to implement, and superior to existing knowledge distillation techniques under comparable settings. Starting from a baseline trained on 100 hours of labeled speech, with an additional 360 hours of unlabeled data, LPM recovers 54% and 73% of the word error rate on clean and noisy test sets relative to a fully supervised model on the same data.

PDF Abstract

Code

Add Remove Mark official

facebookresearch/wav2letter

6,331

Tasks

Add Remove

Knowledge Distillation

Language Modelling

speech-recognition

Speech Recognition

Datasets

LibriSpeech Libri-Light

Results from the Paper

Add Remove

Ranked #45 on Speech Recognition on LibriSpeech test-other

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	LibriSpeech test-clean	Local Prior Matching (Large Model)	Word Error Rate (WER)	7.19	# 53	Compare
Speech Recognition	LibriSpeech test-other	Local Prior Matching (Large Model)	Word Error Rate (WER)	20.84	# 47	Compare
Speech Recognition	LibriSpeech test-other	Local Prior Matching (Large Model, ConvLM LM)	Word Error Rate (WER)	15.28	# 45	Compare

Methods

Add Remove

Knowledge Distillation • LPM • Test

Edit Social Preview

Semi-Supervised Speech Recognition via Local Prior Matching

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove