TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (CS)	76%	# 1
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (CV I)	29%	# 1
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (CV II)	71%	# 1
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (AV I)	57%	# 1
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (AV II)	75%	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-large-scale-varying-view-rgb-d-action/skeleton-based-action-recognition-on-varying)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-varying?p=a-large-scale-varying-view-rgb-d-action)`

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition

24 Apr 2019 · Yanli Ji, Feixiang Xu, Yang Yang, Fumin Shen, Heng Tao Shen, Wei-Shi Zheng ·

Current researches of action recognition mainly focus on single-view and multi-view recognition, which can hardly satisfies the requirements of human-robot interaction (HRI) applications to recognize actions from arbitrary views. The lack of datasets also sets up barriers. To provide data for arbitrary-view action recognition, we newly collect a large-scale RGB-D action dataset for arbitrary-view action analysis, including RGB videos, depth and skeleton sequences. The dataset includes action samples captured in 8 fixed viewpoints and varying-view sequences which covers the entire 360 degree view angles. In total, 118 persons are invited to act 40 action categories, and 25,600 video samples are collected. Our dataset involves more participants, more viewpoints and a large number of samples. More importantly, it is the first dataset containing the entire 360 degree varying-view sequences. The dataset provides sufficient data for multi-view, cross-view and arbitrary-view action analysis. Besides, we propose a View-guided Skeleton CNN (VS-CNN) to tackle the problem of arbitrary-view action recognition. Experiment results show that the VS-CNN achieves superior performance.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Analysis

Action Recognition

Skeleton Based Action Recognition

Temporal Action Localization

Datasets

Introduced in the Paper:

UESTC RGB-D

Used in the Paper:

NTU RGB+D

Results from the Paper

Edit

Ranked #1 on Skeleton Based Action Recognition on Varying-view RGB-D Action-Skeleton

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	VS-CNN	Accuracy (CS)	76%	# 1	Compare
			Accuracy (CV I)	29%	# 1	Compare
			Accuracy (CV II)	71%	# 1	Compare
			Accuracy (AV I)	57%	# 1	Compare
			Accuracy (AV II)	75%	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove