TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	Hub5'00 CallHome	Espresso	Word Error Rate (WER)	19.1	# 1
Speech Recognition	Hub5'00 SwitchBoard	Espresso	Eval2000	9.2	# 1
Speech Recognition	LibriSpeech test-clean	Espresso	Word Error Rate (WER)	2.8	# 36
Speech Recognition	LibriSpeech test-other	Espresso	Word Error Rate (WER)	8.7	# 38
Speech Recognition	WSJ eval92	Espresso	Word Error Rate (WER)	3.4	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/espresso-a-fast-end-to-end-neural-speech/speech-recognition-on-hub500-callhome)](https://paperswithcode.com/sota/speech-recognition-on-hub500-callhome?p=espresso-a-fast-end-to-end-neural-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/espresso-a-fast-end-to-end-neural-speech/speech-recognition-on-hub500-switchboard)](https://paperswithcode.com/sota/speech-recognition-on-hub500-switchboard?p=espresso-a-fast-end-to-end-neural-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/espresso-a-fast-end-to-end-neural-speech/speech-recognition-on-wsj-eval92)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval92?p=espresso-a-fast-end-to-end-neural-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/espresso-a-fast-end-to-end-neural-speech/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=espresso-a-fast-end-to-end-neural-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/espresso-a-fast-end-to-end-neural-speech/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=espresso-a-fast-end-to-end-neural-speech)`

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

18 Sep 2019 · Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur ·

We present Espresso, an open-source, modular, extensible end-to-end neural automatic speech recognition (ASR) toolkit based on the deep learning library PyTorch and the popular neural machine translation toolkit fairseq. Espresso supports distributed training across GPUs and computing nodes, and features various decoding approaches commonly employed in ASR, including look-ahead word-based language model fusion, for which a fast, parallelized decoder is implemented. Espresso achieves state-of-the-art ASR performance on the WSJ, LibriSpeech, and Switchboard data sets among other end-to-end systems without data augmentation, and is 4--11x faster for decoding than similar systems (e.g. ESPnet).

PDF Abstract

Code

Add Remove Mark official

freewym/espresso official

941

Tasks

Add Remove

Automatic Speech Recognition

Automatic Speech Recognition (ASR)

Data Augmentation

Language Modelling

Machine Translation

speech-recognition

Speech Recognition

Translation

Datasets

LibriSpeech 2000 HUB5 English CALLHOME American English Speech

Results from the Paper

Edit

Ranked #1 on Speech Recognition on Hub5'00 CallHome

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	Hub5'00 CallHome	Espresso	Word Error Rate (WER)	19.1	# 1	Compare
Speech Recognition	Hub5'00 SwitchBoard	Espresso	Eval2000	9.2	# 1	Compare
Speech Recognition	LibriSpeech test-clean	Espresso	Word Error Rate (WER)	2.8	# 36	Compare
Speech Recognition	LibriSpeech test-other	Espresso	Word Error Rate (WER)	8.7	# 38	Compare
Speech Recognition	WSJ eval92	Espresso	Word Error Rate (WER)	3.4	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove