TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Lipreading	CAS-VSR-W1k (LRW-1000)	3D Conv + ResNet-18 + MS-TCN	Top-1 Accuracy	41.4%	# 7
Lipreading	Lip Reading in the Wild	3D Conv + ResNet-18 + MS-TCN	Top-1 Accuracy	85.30	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lipreading-using-temporal-convolutional/lipreading-on-lrw-1000)](https://paperswithcode.com/sota/lipreading-on-lrw-1000?p=lipreading-using-temporal-convolutional)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lipreading-using-temporal-convolutional/lipreading-on-lip-reading-in-the-wild)](https://paperswithcode.com/sota/lipreading-on-lip-reading-in-the-wild?p=lipreading-using-temporal-convolutional)`

Lipreading using Temporal Convolutional Networks

23 Jan 2020 · Brais Martinez, Pingchuan Ma, Stavros Petridis, Maja Pantic ·

Lip-reading has attracted a lot of research attention lately thanks to advances in deep learning. The current state-of-the-art model for recognition of isolated words in-the-wild consists of a residual network and Bidirectional Gated Recurrent Unit (BGRU) layers. In this work, we address the limitations of this model and we propose changes which further improve its performance. Firstly, the BGRU layers are replaced with Temporal Convolutional Networks (TCN). Secondly, we greatly simplify the training procedure, which allows us to train the model in one single stage. Thirdly, we show that the current state-of-the-art methodology produces models that do not generalize well to variations on the sequence length, and we addresses this issue by proposing a variable-length augmentation. We present results on the largest publicly-available datasets for isolated word recognition in English and Mandarin, LRW and LRW1000, respectively. Our proposed model results in an absolute improvement of 1.2% and 3.2%, respectively, in these datasets which is the new state-of-the-art performance.

PDF Abstract

Code

Add Remove Mark official

mpc001/Lipreading_using_Temporal_Co…

362

Yondijr/FlowerPower

Tasks

Add Remove

Lipreading

Lip Reading

Datasets

LRW

CAS-VSR-W1k (LRW-1000)

Results from the Paper

Edit

Ranked #7 on Lipreading on CAS-VSR-W1k (LRW-1000)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Lipreading	CAS-VSR-W1k (LRW-1000)	3D Conv + ResNet-18 + MS-TCN	Top-1 Accuracy	41.4%	# 7		Compare
Lipreading	Lip Reading in the Wild	3D Conv + ResNet-18 + MS-TCN	Top-1 Accuracy	85.30	# 9		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Lipreading using Temporal Convolutional Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove