TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Skeleton Based Action Recognition	NTU RGB+D	Action Capsules	Accuracy (CV)	96.3	# 30
Skeleton Based Action Recognition	NTU RGB+D	Action Capsules	Accuracy (CS)	90	# 41
Skeleton Based Action Recognition	N-UCLA	Action Capsules	Accuracy	97.3	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/action-capsules-human-skeleton-action/skeleton-based-action-recognition-on-n-ucla)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-n-ucla?p=action-capsules-human-skeleton-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/action-capsules-human-skeleton-action/skeleton-based-action-recognition-on-ntu-rgbd)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd?p=action-capsules-human-skeleton-action)`

Action Capsules: Human Skeleton Action Recognition

30 Jan 2023 · Ali Farajzadeh Bavil, Hamed Damirchi, Hamid D. Taghirad ·

Due to the compact and rich high-level representations offered, skeleton-based human action recognition has recently become a highly active research topic. Previous studies have demonstrated that investigating joint relationships in spatial and temporal dimensions provides effective information critical to action recognition. However, effectively encoding global dependencies of joints during spatio-temporal feature extraction is still challenging. In this paper, we introduce Action Capsule which identifies action-related key joints by considering the latent correlation of joints in a skeleton sequence. We show that, during inference, our end-to-end network pays attention to a set of joints specific to each action, whose encoded spatio-temporal features are aggregated to recognize the action. Additionally, the use of multiple stages of action capsules enhances the ability of the network to classify similar actions. Consequently, our network outperforms the state-of-the-art approaches on the N-UCLA dataset and obtains competitive results on the NTURGBD dataset. This is while our approach has significantly lower computational requirements based on GFLOPs measurements.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

Skeleton Based Action Recognition

Temporal Action Localization

Datasets

NTU RGB+D N-UCLA

Results from the Paper

Edit

Ranked #4 on Skeleton Based Action Recognition on N-UCLA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	NTU RGB+D	Action Capsules	Accuracy (CV)	96.3	# 30	Compare
Skeleton Based Action Recognition	NTU RGB+D	Action Capsules	Accuracy (CS)	90	# 41	Compare
Skeleton Based Action Recognition	N-UCLA	Action Capsules	Accuracy	97.3	# 4	Compare

Methods

Add Remove

Capsule Network

Edit Social Preview

Action Capsules: Human Skeleton Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove