TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	Libri-Light test-clean	CPC unlab-60k	ABX-within	5.83	# 1
Speech Recognition	Libri-Light test-clean	CPC unlab-60k	ABX-across	7.56	# 1
Speech Recognition	Libri-Light test-clean	TDS 60k pseudo-label + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	29.3	# 2
Speech Recognition	Libri-Light test-clean	CPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	43.9	# 3
Speech Recognition	Libri-Light test-other	CPC unlab-60k	ABX-within	8.14	# 1
Speech Recognition	Libri-Light test-other	CPC unlab-60k	ABX-across	13.42	# 1
Speech Recognition	Libri-Light test-other	TDS 60k pseudo-label + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	56.6	# 2
Speech Recognition	Libri-Light test-other	CPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	69.5	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/libri-light-a-benchmark-for-asr-with-limited/speech-recognition-on-libri-light-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-libri-light-test-clean?p=libri-light-a-benchmark-for-asr-with-limited)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/libri-light-a-benchmark-for-asr-with-limited/speech-recognition-on-libri-light-test-other)](https://paperswithcode.com/sota/speech-recognition-on-libri-light-test-other?p=libri-light-a-benchmark-for-asr-with-limited)`

Libri-Light: A Benchmark for ASR with Limited or No Supervision

17 Dec 2019 · Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdel-rahman Mohamed, Emmanuel Dupoux ·

We introduce a new collection of spoken English audio suitable for training speech recognition systems under limited or no supervision. It is derived from open-source audio books from the LibriVox project. It contains over 60K hours of audio, which is, to our knowledge, the largest freely-available corpus of speech. The audio has been segmented using voice activity detection and is tagged with SNR, speaker ID and genre descriptions. Additionally, we provide baseline systems and evaluation metrics working under three settings: (1) the zero resource/unsupervised setting (ABX), (2) the semi-supervised setting (PER, CER) and (3) the distant supervision setting (WER). Settings (2) and (3) use limited textual resources (10 minutes to 10 hours) aligned with the speech. Setting (3) uses large amounts of unaligned text. They are evaluated on the standard LibriSpeech dev and test sets for comparison with the supervised state-of-the-art.

PDF Abstract

Code

Add Remove Mark official

facebookresearch/libri-light official

446

k2-fsa/libriheavy

134

Tasks

Add Remove

speech-recognition

Speech Recognition

Datasets

Introduced in the Paper:

Libri-Light

Used in the Paper:

LibriSpeech

Results from the Paper

Edit

Ranked #1 on Speech Recognition on Libri-Light test-other (ABX-within metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	Libri-Light test-clean	CPC unlab-60k	ABX-within	5.83	# 1	Compare
Speech Recognition	Libri-Light test-clean	CPC unlab-60k	ABX-across	7.56	# 1	Compare
Speech Recognition	Libri-Light test-clean	TDS 60k pseudo-label + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	29.3	# 2	Compare
Speech Recognition	Libri-Light test-clean	CPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	43.9	# 3	Compare
Speech Recognition	Libri-Light test-other	CPC unlab-60k	ABX-within	8.14	# 1	Compare
Speech Recognition	Libri-Light test-other	CPC unlab-60k	ABX-across	13.42	# 1	Compare
Speech Recognition	Libri-Light test-other	TDS 60k pseudo-label + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	56.6	# 2	Compare
Speech Recognition	Libri-Light test-other	CPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM	Word Error Rate (WER)	69.5	# 3	Compare

Methods

Add Remove

Test

Edit Social Preview

Libri-Light: A Benchmark for ASR with Limited or No Supervision

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove