TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
One-Shot 3D Action Recognition	NTU RGB+D 120	Average Pooling	Accuracy	42.9%	# 6
Skeleton Based Action Recognition	NTU RGB+D 120	Internal Feature Fusion	Accuracy (Cross-Subject)	58.2%	# 67
Skeleton Based Action Recognition	NTU RGB+D 120	Internal Feature Fusion	Accuracy (Cross-Setup)	60.9%	# 63
Skeleton Based Action Recognition	SYSU 3D	ST-LSTM (Tree)	Accuracy	73.4%	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-action-recognition-using/one-shot-3d-action-recognition-on-ntu-rgbd)](https://paperswithcode.com/sota/one-shot-3d-action-recognition-on-ntu-rgbd?p=skeleton-based-action-recognition-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-action-recognition-using/skeleton-based-action-recognition-on-sysu-3d)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-sysu-3d?p=skeleton-based-action-recognition-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-action-recognition-using/skeleton-based-action-recognition-on-ntu-rgbd-1)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd-1?p=skeleton-based-action-recognition-using)`

Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates

26 Jun 2017 · Jun Liu, Amir Shahroudy, Dong Xu, Alex C. Kot, Gang Wang ·

Skeleton-based human action recognition has attracted a lot of research attention during the past few years. Recent works attempted to utilize recurrent neural networks to model the temporal dependencies between the 3D positional configurations of human body joints for better analysis of human activities in the skeletal data. The proposed work extends this idea to spatial domain as well as temporal domain to better analyze the hidden sources of action-related information within the human skeleton sequences in both of these domains simultaneously. Based on the pictorial structure of Kinect's skeletal data, an effective tree-structure based traversal framework is also proposed. In order to deal with the noise in the skeletal data, a new gating mechanism within LSTM module is introduced, with which the network can learn the reliability of the sequential data and accordingly adjust the effect of the input data on the updating procedure of the long-term context representation stored in the unit's memory cell. Moreover, we introduce a novel multi-modal feature fusion strategy within the LSTM unit in this paper. The comprehensive experimental results on seven challenging benchmark datasets for human action recognition demonstrate the effectiveness of the proposed method.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

One-Shot 3D Action Recognition

Skeleton Based Action Recognition

Temporal Action Localization

Datasets

NTU RGB+D

NTU RGB+D 120

SBU

UT-Kinect

Results from the Paper

Edit

Ranked #6 on One-Shot 3D Action Recognition on NTU RGB+D 120

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	NTU RGB+D 120	Internal Feature Fusion	Accuracy (Cross-Subject)	58.2%	# 67	Compare
Skeleton Based Action Recognition	NTU RGB+D 120	Internal Feature Fusion	Accuracy (Cross-Setup)	60.9%	# 63	Compare
Skeleton Based Action Recognition	SYSU 3D	ST-LSTM (Tree)	Accuracy	73.4%	# 9	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
One-Shot 3D Action Recognition	NTU RGB+D 120	Average Pooling	Accuracy	42.9%	# 6		See all

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit