TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	LibriSpeech test-clean	wav2vec_wav2letter	Word Error Rate (WER)	2.7	# 34
Speech Recognition	LibriSpeech test-clean	Conv + Transformer + wav2vec2.0 + pseudo labeling	Word Error Rate (WER)	1.5	# 4
Speech Recognition	LibriSpeech test-other	Conv + Transformer + wav2vec2.0 + pseudo labeling	Word Error Rate (WER)	3.1	# 5
Speech Recognition	LibriSpeech train-clean-100 test-clean	wav2vec_wav2letter	Word Error Rate (WER)	2.8	# 1
Speech Recognition	LibriSpeech train-clean-100 test-other	wav2vec_wav2letter	Word Error Rate (WER)	3.6	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-training-and-pre-training-are/speech-recognition-on-librispeech-train-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-train-clean?p=self-training-and-pre-training-are)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-training-and-pre-training-are/speech-recognition-on-librispeech-train-clean-1)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-train-clean-1?p=self-training-and-pre-training-are)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-training-and-pre-training-are/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=self-training-and-pre-training-are)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-training-and-pre-training-are/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=self-training-and-pre-training-are)`

Self-training and Pre-training are Complementary for Speech Recognition

22 Oct 2020 · Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli ·

Self-training and unsupervised pre-training have emerged as effective approaches to improve speech recognition systems using unlabeled data. However, it is not clear whether they learn similar patterns or if they can be effectively combined. In this paper, we show that pseudo-labeling and pre-training with wav2vec 2.0 are complementary in a variety of labeled data setups. Using just 10 minutes of labeled data from Libri-light as well as 53k hours of unlabeled data from LibriVox achieves WERs of 3.0%/5.2% on the clean and other test sets of Librispeech - rivaling the best published systems trained on 960 hours of labeled data only a year ago. Training on all labeled data of Librispeech achieves WERs of 1.5%/3.1%.

PDF Abstract

Code

Add Remove Mark official

pytorch/fairseq official

29,264

pytorch/fairseq official

29,261

facebookresearch/fairseq

29,265

Tasks

Add Remove

speech-recognition

Speech Recognition

Unsupervised Pre-training

Datasets

LibriSpeech Libri-Light

Results from the Paper

Edit

Ranked #1 on Speech Recognition on LibriSpeech train-clean-100 test-other (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	LibriSpeech test-clean	wav2vec_wav2letter	Word Error Rate (WER)	2.7	# 34	Compare
Speech Recognition	LibriSpeech test-clean	Conv + Transformer + wav2vec2.0 + pseudo labeling	Word Error Rate (WER)	1.5	# 4	Compare
Speech Recognition	LibriSpeech test-other	Conv + Transformer + wav2vec2.0 + pseudo labeling	Word Error Rate (WER)	3.1	# 5	Compare
Speech Recognition	LibriSpeech train-clean-100 test-clean	wav2vec_wav2letter	Word Error Rate (WER)	2.8	# 1	Compare
Speech Recognition	LibriSpeech train-clean-100 test-other	wav2vec_wav2letter	Word Error Rate (WER)	3.6	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Self-training and Pre-training are Complementary for Speech Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove