TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Segmentation	50 Salads	CETNet	F1@10%	87.6	# 10
Action Segmentation	50 Salads	CETNet	Edit	81.7	# 11
Action Segmentation	50 Salads	CETNet	Acc	86.9	# 10
Action Segmentation	50 Salads	CETNet	F1@25%	86.5	# 9
Action Segmentation	50 Salads	CETNet	F1@50%	80.1	# 9
Action Segmentation	Breakfast	CETNet	F1@10%	79.3	# 4
Action Segmentation	Breakfast	CETNet	F1@50%	61.9	# 5
Action Segmentation	Breakfast	CETNet	Acc	74.9	# 8
Action Segmentation	Breakfast	CETNet	Edit	77.8	# 5
Action Segmentation	Breakfast	CETNet	F1@25%	74.3	# 4
Action Segmentation	GTEA	CETNet	F1@10%	91.8	# 8
Action Segmentation	GTEA	CETNet	F1@50%	81.3	# 7
Action Segmentation	GTEA	CETNet	Acc	80.3	# 8
Action Segmentation	GTEA	CETNet	Edit	87.9	# 8
Action Segmentation	GTEA	CETNet	F1@25%	91.2	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-enhancement-transformer-for-action/action-segmentation-on-breakfast-1)](https://paperswithcode.com/sota/action-segmentation-on-breakfast-1?p=cross-enhancement-transformer-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-enhancement-transformer-for-action/action-segmentation-on-gtea-1)](https://paperswithcode.com/sota/action-segmentation-on-gtea-1?p=cross-enhancement-transformer-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-enhancement-transformer-for-action/action-segmentation-on-50-salads-1)](https://paperswithcode.com/sota/action-segmentation-on-50-salads-1?p=cross-enhancement-transformer-for-action)`

Cross-Enhancement Transformer for Action Segmentation

19 May 2022 · Jiahui Wang, Zhenyou Wang, Shanna Zhuang, Hui Wang ·

Temporal convolutions have been the paradigm of choice in action segmentation, which enhances long-term receptive fields by increasing convolution layers. However, high layers cause the loss of local information necessary for frame recognition. To solve the above problem, a novel encoder-decoder structure is proposed in this paper, called Cross-Enhancement Transformer. Our approach can be effective learning of temporal structure representation with interactive self-attention mechanism. Concatenated each layer convolutional feature maps in encoder with a set of features in decoder produced via self-attention. Therefore, local and global information are used in a series of frame actions simultaneously. In addition, a new loss function is proposed to enhance the training process that penalizes over-segmentation errors. Experiments show that our framework performs state-of-the-art on three challenging datasets: 50Salads, Georgia Tech Egocentric Activities and the Breakfast dataset.

PDF Abstract

Code

Add Remove Mark official

Wangjhdeveloper/CETNet official

Tasks

Add Remove

Action Segmentation

Decoder

Segmentation

Datasets

Breakfast

GTEA 50 Salads

Results from the Paper

Edit

Ranked #5 on Action Segmentation on Breakfast

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Segmentation	50 Salads	CETNet	F1@10%	87.6	# 10	Compare
			Edit	81.7	# 11	Compare
			Acc	86.9	# 10	Compare
			F1@25%	86.5	# 9	Compare
			F1@50%	80.1	# 9	Compare
Action Segmentation	Breakfast	CETNet	F1@10%	79.3	# 4	Compare
			F1@50%	61.9	# 5	Compare
			Acc	74.9	# 8	Compare
			Edit	77.8	# 5	Compare
			F1@25%	74.3	# 4	Compare
Action Segmentation	GTEA	CETNet	F1@10%	91.8	# 8	Compare
			F1@50%	81.3	# 7	Compare
			Acc	80.3	# 8	Compare
			Edit	87.9	# 8	Compare
			F1@25%	91.2	# 7	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Convolution • Dense Connections • Dropout • HTCN • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Cross-Enhancement Transformer for Action Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove