TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Segmentation	50 Salads	MS-TCN++(sh)	F1@10%	78.7	# 23
Action Segmentation	50 Salads	MS-TCN++(sh)	Edit	70.7	# 23
Action Segmentation	50 Salads	MS-TCN++(sh)	Acc	82.2	# 20
Action Segmentation	50 Salads	MS-TCN++(sh)	F1@25%	76.6	# 22
Action Segmentation	50 Salads	MS-TCN++(sh)	F1@50%	68.3	# 22
Action Segmentation	50 Salads	MS-TCN++	F1@10%	80.7	# 20
Action Segmentation	50 Salads	MS-TCN++	Edit	74.3	# 19
Action Segmentation	50 Salads	MS-TCN++	Acc	83.7	# 17
Action Segmentation	50 Salads	MS-TCN++	F1@25%	78.5	# 20
Action Segmentation	50 Salads	MS-TCN++	F1@50%	70.1	# 20
Action Segmentation	Assembly101	MS-TCN++	MoF	37.1	# 5
Action Segmentation	Assembly101	MS-TCN++	F1@10%	31.6	# 5
Action Segmentation	Assembly101	MS-TCN++	F1@25%	27.8	# 5
Action Segmentation	Assembly101	MS-TCN++	F1@50%	20.6	# 5
Action Segmentation	Assembly101	MS-TCN++	Edit	30.7	# 3
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	F1@10%	63.3	# 25
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	F1@50%	44.5	# 25
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	Acc	67.3	# 25
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	Edit	64.9	# 25
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	F1@25%	57.7	# 25
Action Segmentation	Breakfast	MS-TCN++ (I3D)	F1@10%	64.1	# 24
Action Segmentation	Breakfast	MS-TCN++ (I3D)	F1@50%	45.9	# 24
Action Segmentation	Breakfast	MS-TCN++ (I3D)	Acc	67.6	# 23
Action Segmentation	Breakfast	MS-TCN++ (I3D)	Edit	65.6	# 24
Action Segmentation	Breakfast	MS-TCN++ (I3D)	F1@25%	58.6	# 24
Action Segmentation	GTEA	MS-TCN++	F1@10%	88.8	# 17
Action Segmentation	GTEA	MS-TCN++	F1@50%	76.0	# 17
Action Segmentation	GTEA	MS-TCN++	Acc	80.1	# 10
Action Segmentation	GTEA	MS-TCN++	Edit	83.5	# 20
Action Segmentation	GTEA	MS-TCN++	F1@25%	85.7	# 20
Action Segmentation	GTEA	MS-TCN++(sh)	F1@10%	88.2	# 20
Action Segmentation	GTEA	MS-TCN++(sh)	F1@50%	75.9	# 18
Action Segmentation	GTEA	MS-TCN++(sh)	Acc	79.7	# 14
Action Segmentation	GTEA	MS-TCN++(sh)	Edit	83.0	# 21
Action Segmentation	GTEA	MS-TCN++(sh)	F1@25%	86.2	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ms-tcn-multi-stage-temporal-convolutional-2/action-segmentation-on-assembly101)](https://paperswithcode.com/sota/action-segmentation-on-assembly101?p=ms-tcn-multi-stage-temporal-convolutional-2)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ms-tcn-multi-stage-temporal-convolutional-2/action-segmentation-on-gtea-1)](https://paperswithcode.com/sota/action-segmentation-on-gtea-1?p=ms-tcn-multi-stage-temporal-convolutional-2)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ms-tcn-multi-stage-temporal-convolutional-2/action-segmentation-on-50-salads-1)](https://paperswithcode.com/sota/action-segmentation-on-50-salads-1?p=ms-tcn-multi-stage-temporal-convolutional-2)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ms-tcn-multi-stage-temporal-convolutional-2/action-segmentation-on-breakfast-1)](https://paperswithcode.com/sota/action-segmentation-on-breakfast-1?p=ms-tcn-multi-stage-temporal-convolutional-2)`

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation

16 Jun 2020 · Shijie Li, Yazan Abu Farha, Yun Liu, Ming-Ming Cheng, Juergen Gall ·

With the success of deep learning in classifying short trimmed videos, more attention has been focused on temporally segmenting and classifying activities in long untrimmed videos. State-of-the-art approaches for action segmentation utilize several layers of temporal convolution and temporal pooling. Despite the capabilities of these approaches in capturing temporal dependencies, their predictions suffer from over-segmentation errors. In this paper, we propose a multi-stage architecture for the temporal action segmentation task that overcomes the limitations of the previous approaches. The first stage generates an initial prediction that is refined by the next ones. In each stage we stack several layers of dilated temporal convolutions covering a large receptive field with few parameters. While this architecture already performs well, lower layers still suffer from a small receptive field. To address this limitation, we propose a dual dilated layer that combines both large and small receptive fields. We further decouple the design of the first stage from the refining stages to address the different requirements of these stages. Extensive evaluation shows the effectiveness of the proposed model in capturing long-range dependencies and recognizing action segments. Our models achieve state-of-the-art results on three datasets: 50Salads, Georgia Tech Egocentric Activities (GTEA), and the Breakfast dataset.

PDF Abstract

Code

Add Remove Mark official

sj-li/MS-TCN2

133

Tasks

Add Remove

Action Segmentation

Segmentation

Datasets

Breakfast

GTEA

Assembly101 50 Salads

Results from the Paper

Edit

Ranked #5 on Action Segmentation on Assembly101

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Segmentation	50 Salads	MS-TCN++(sh)	F1@10%	78.7	# 23	Compare
			Edit	70.7	# 23	Compare
			Acc	82.2	# 20	Compare
			F1@25%	76.6	# 22	Compare
			F1@50%	68.3	# 22	Compare
Action Segmentation	50 Salads	MS-TCN++	F1@10%	80.7	# 20	Compare
			Edit	74.3	# 19	Compare
			Acc	83.7	# 17	Compare
			F1@25%	78.5	# 20	Compare
			F1@50%	70.1	# 20	Compare
Action Segmentation	Assembly101	MS-TCN++	MoF	37.1	# 5	Compare
			F1@10%	31.6	# 5	Compare
			F1@25%	27.8	# 5	Compare
			F1@50%	20.6	# 5	Compare
			Edit	30.7	# 3	Compare
Action Segmentation	Breakfast	MS-TCN++(I3D) (sh)	F1@10%	63.3	# 25	Compare
			F1@50%	44.5	# 25	Compare
			Acc	67.3	# 25	Compare
			Edit	64.9	# 25	Compare
			F1@25%	57.7	# 25	Compare
Action Segmentation	Breakfast	MS-TCN++ (I3D)	F1@10%	64.1	# 24	Compare
			F1@50%	45.9	# 24	Compare
			Acc	67.6	# 23	Compare
			Edit	65.6	# 24	Compare
			F1@25%	58.6	# 24	Compare
Action Segmentation	GTEA	MS-TCN++	F1@10%	88.8	# 17	Compare
			F1@50%	76.0	# 17	Compare
			Acc	80.1	# 10	Compare
			Edit	83.5	# 20	Compare
			F1@25%	85.7	# 20	Compare
Action Segmentation	GTEA	MS-TCN++(sh)	F1@10%	88.2	# 20	Compare
			F1@50%	75.9	# 18	Compare
			Acc	79.7	# 14	Compare
			Edit	83.0	# 21	Compare
			F1@25%	86.2	# 19	Compare

Methods

Add Remove

Convolution

Edit Social Preview

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove