TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Sign Language Recognition	RWTH-PHOENIX-Weather 2014	STMC	Word Error Rate (WER)	20.7	# 7
Sign Language Recognition	RWTH-PHOENIX-Weather 2014 T	STMC	Word Error Rate (WER)	21.0	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spatial-temporal-multi-cue-network-for/sign-language-recognition-on-rwth-phoenix-1)](https://paperswithcode.com/sota/sign-language-recognition-on-rwth-phoenix-1?p=spatial-temporal-multi-cue-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spatial-temporal-multi-cue-network-for/sign-language-recognition-on-rwth-phoenix)](https://paperswithcode.com/sota/sign-language-recognition-on-rwth-phoenix?p=spatial-temporal-multi-cue-network-for)`

Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition

8 Feb 2020 · Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li ·

Despite the recent success of deep learning in continuous sign language recognition (CSLR), deep models typically focus on the most discriminative features, ignoring other potentially non-trivial and informative contents. Such characteristic heavily constrains their capability to learn implicit visual grammars behind the collaboration of different visual cues (i,e., hand shape, facial expression and body posture). By injecting multi-cue learning into neural network design, we propose a spatial-temporal multi-cue (STMC) network to solve the vision-based sequence learning problem. Our STMC network consists of a spatial multi-cue (SMC) module and a temporal multi-cue (TMC) module. The SMC module is dedicated to spatial representation and explicitly decomposes visual features of different cues with the aid of a self-contained pose estimation branch. The TMC module models temporal correlations along two parallel paths, i.e., intra-cue and inter-cue, which aims to preserve the uniqueness and explore the collaboration of multiple cues. Finally, we design a joint optimization strategy to achieve the end-to-end sequence learning of the STMC network. To validate the effectiveness, we perform experiments on three large-scale CSLR benchmarks: PHOENIX-2014, CSL and PHOENIX-2014-T. Experimental results demonstrate that the proposed method achieves new state-of-the-art performance on all three benchmarks.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Pose Estimation

Sign Language Recognition

Datasets

RWTH-PHOENIX-Weather 2014 T RWTH-PHOENIX-Weather 2014

Results from the Paper

Edit

Ranked #4 on Sign Language Recognition on RWTH-PHOENIX-Weather 2014 T

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Sign Language Recognition	RWTH-PHOENIX-Weather 2014	STMC	Word Error Rate (WER)	20.7	# 7		Compare
Sign Language Recognition	RWTH-PHOENIX-Weather 2014 T	STMC	Word Error Rate (WER)	21.0	# 4		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove