TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Segmentation	GTEA	ED-TCN	F1@10%	72.2	# 24
Action Segmentation	GTEA	ED-TCN	F1@50%	56.0	# 24
Action Segmentation	GTEA	ED-TCN	Acc	64.0	# 24
Action Segmentation	GTEA	ED-TCN	Edit	-	# 24
Action Segmentation	GTEA	ED-TCN	F1@25%	69.3	# 24
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (CS)	56%	# 6
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (CV I)	16%	# 4
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (CV II)	43%	# 5
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (AV I)	43%	# 4
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (AV II)	64%	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-convolutional-networks-for-action/skeleton-based-action-recognition-on-varying)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-varying?p=temporal-convolutional-networks-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-convolutional-networks-for-action/action-segmentation-on-gtea-1)](https://paperswithcode.com/sota/action-segmentation-on-gtea-1?p=temporal-convolutional-networks-for-action)`

Temporal Convolutional Networks for Action Segmentation and Detection

CVPR 2017 · Colin Lea, Michael D. Flynn, Rene Vidal, Austin Reiter, Gregory D. Hager ·

The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal features from video frames and then feeding them into a temporal classifier that captures high-level temporal patterns. We introduce a new class of temporal models, which we call Temporal Convolutional Networks (TCNs), that use a hierarchy of temporal convolutions to perform fine-grained action segmentation or detection. Our Encoder-Decoder TCN uses pooling and upsampling to efficiently capture long-range temporal patterns whereas our Dilated TCN uses dilated convolutions. We show that TCNs are capable of capturing action compositions, segment durations, and long-range dependencies, and are over a magnitude faster to train than competing LSTM-based Recurrent Neural Networks. We apply these models to three challenging fine-grained datasets and show large improvements over the state of the art.

PDF Abstract CVPR 2017 PDF CVPR 2017 Abstract

Code

Add Remove Mark official

colincsl/TemporalConvolutionalNetwo…

279

coderSkyChen/Action_Recognition_Zoo

244

yz-cnsdqz/TemporalActionParsing-Fin…

BehnooshParsa/HumanActionRecognitio…

sadari1/TumorDetectionDeepLearning

Tasks

Add Remove

Action Segmentation

Skeleton Based Action Recognition

Datasets

GTEA 50 Salads

MERL Shopping

Results from the Paper

Edit

Ranked #6 on Skeleton Based Action Recognition on Varying-view RGB-D Action-Skeleton

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Segmentation	GTEA	ED-TCN	F1@10%	72.2	# 24	Compare
			F1@50%	56.0	# 24	Compare
			Acc	64.0	# 24	Compare
			Edit	-	# 24	Compare
			F1@25%	69.3	# 24	Compare
Skeleton Based Action Recognition	Varying-view RGB-D Action-Skeleton	TCN	Accuracy (CS)	56%	# 6	Compare
			Accuracy (CV I)	16%	# 4	Compare
			Accuracy (CV II)	43%	# 5	Compare
			Accuracy (AV I)	43%	# 4	Compare
			Accuracy (AV II)	64%	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Temporal Convolutional Networks for Action Segmentation and Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove