TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Skeleton Based Action Recognition	NTU RGB+D	LA-GCN	Accuracy (CV)	97.2	# 9
Skeleton Based Action Recognition	NTU RGB+D	LA-GCN	Accuracy (CS)	93.5	# 4
Skeleton Based Action Recognition	NTU RGB+D	LA-GCN	Ensembled Modalities	6	# 17
Skeleton Based Action Recognition	NTU RGB+D 120	LA-GCN	Accuracy (Cross-Subject)	90.7	# 2
Skeleton Based Action Recognition	NTU RGB+D 120	LA-GCN	Accuracy (Cross-Setup)	91.8	# 2
Skeleton Based Action Recognition	NTU RGB+D 120	LA-GCN	Ensembled Modalities	6	# 18
Skeleton Based Action Recognition	N-UCLA	LA-GCN	Accuracy	97.6	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/language-knowledge-assisted-representation/skeleton-based-action-recognition-on-ntu-rgbd-1)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd-1?p=language-knowledge-assisted-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/language-knowledge-assisted-representation/skeleton-based-action-recognition-on-n-ucla)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-n-ucla?p=language-knowledge-assisted-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/language-knowledge-assisted-representation/skeleton-based-action-recognition-on-ntu-rgbd)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd?p=language-knowledge-assisted-representation)`

Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

21 May 2023 · Haojun Xu, Yan Gao, Zheng Hui, Jie Li, Xinbo Gao ·

How humans understand and recognize the actions of others is a complex neuroscientific problem that involves a combination of cognitive mechanisms and neural networks. Research has shown that humans have brain areas that recognize actions that process top-down attentional information, such as the temporoparietal association area. Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe. Skeleton-based action recognition creates mappings for the complex connections between the human skeleton movement patterns and behaviors. Although existing studies encoded meaningful node relationships and synthesized action representations for classification with good results, few of them considered incorporating a priori knowledge to aid potential representation learning for better performance. LA-GCN proposes a graph convolution network using large-scale language models (LLM) knowledge assistance. First, the LLM knowledge is mapped into a priori global relationship (GPR) topology and a priori category relationship (CPR) topology between nodes. The GPR guides the generation of new "bone" representations, aiming to emphasize essential node information from the data level. The CPR mapping simulates category prior knowledge in human brain regions, encoded by the PC-AC module and used to add additional supervision-forcing the model to learn class-distinguishable features. In addition, to improve information transfer efficiency in topology modeling, we propose multi-hop attention graph convolution. It aggregates each node's k-order neighbor simultaneously to speed up model convergence. LA-GCN reaches state-of-the-art on NTU RGB+D, NTU RGB+D 120, and NW-UCLA datasets.

PDF Abstract

Code

Add Remove Mark official

damnull/lagcn official

Tasks

Add Remove

Action Recognition

GPR

Representation Learning

Skeleton Based Action Recognition

Datasets

NTU RGB+D

NTU RGB+D 120 N-UCLA

Results from the Paper

Edit

Ranked #2 on Skeleton Based Action Recognition on NTU RGB+D 120 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	NTU RGB+D	LA-GCN	Accuracy (CV)	97.2	# 9	Compare
			Accuracy (CS)	93.5	# 4	Compare
			Ensembled Modalities	6	# 17	Compare
Skeleton Based Action Recognition	NTU RGB+D 120	LA-GCN	Accuracy (Cross-Subject)	90.7	# 2	Compare
			Accuracy (Cross-Setup)	91.8	# 2	Compare
			Ensembled Modalities	6	# 18	Compare
Skeleton Based Action Recognition	N-UCLA	LA-GCN	Accuracy	97.6	# 2	Compare

Methods

Add Remove

Convolution • SPEED

Edit Social Preview

Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove