TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Segmentation	50 Salads	UVAST	F1@10%	89.1	# 7
Action Segmentation	50 Salads	UVAST	Edit	83.9	# 5
Action Segmentation	50 Salads	UVAST	Acc	87.4	# 7
Action Segmentation	50 Salads	UVAST	F1@25%	87.6	# 7
Action Segmentation	50 Salads	UVAST	F1@50%	81.7	# 6
Action Segmentation	Assembly101	UVAST	MoF	37.4	# 4
Action Segmentation	Assembly101	UVAST	F1@10%	32.1	# 4
Action Segmentation	Assembly101	UVAST	F1@25%	28.3	# 4
Action Segmentation	Assembly101	UVAST	F1@50%	20.8	# 4
Action Segmentation	Assembly101	UVAST	Edit	31.5	# 2
Action Segmentation	Breakfast	UVAST	F1@10%	76.9	# 8
Action Segmentation	Breakfast	UVAST	F1@50%	58	# 9
Action Segmentation	Breakfast	UVAST	Acc	69.7	# 18
Action Segmentation	Breakfast	UVAST	Edit	77.1	# 6
Action Segmentation	Breakfast	UVAST	F1@25%	71.5	# 9
Action Segmentation	GTEA	UVAST	F1@10%	92.7	# 5
Action Segmentation	GTEA	UVAST	F1@50%	81	# 8
Action Segmentation	GTEA	UVAST	Acc	80.2	# 9
Action Segmentation	GTEA	UVAST	Edit	92.1	# 2
Action Segmentation	GTEA	UVAST	F1@25%	91.3	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-fully-and-timestamp-supervised/action-segmentation-on-assembly101)](https://paperswithcode.com/sota/action-segmentation-on-assembly101?p=unified-fully-and-timestamp-supervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-fully-and-timestamp-supervised/action-segmentation-on-50-salads-1)](https://paperswithcode.com/sota/action-segmentation-on-50-salads-1?p=unified-fully-and-timestamp-supervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-fully-and-timestamp-supervised/action-segmentation-on-gtea-1)](https://paperswithcode.com/sota/action-segmentation-on-gtea-1?p=unified-fully-and-timestamp-supervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unified-fully-and-timestamp-supervised/action-segmentation-on-breakfast-1)](https://paperswithcode.com/sota/action-segmentation-on-breakfast-1?p=unified-fully-and-timestamp-supervised)`

Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

1 Sep 2022 · Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Juergen Gall, Mehdi Noroozi ·

This paper introduces a unified framework for video action segmentation via sequence to sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to current state-of-the-art frame-level prediction methods, we view action segmentation as a seq2seq translation task, i.e., mapping a sequence of video frames to a sequence of action segments. Our proposed method involves a series of modifications and auxiliary loss functions on the standard Transformer seq2seq translation model to cope with long input sequences opposed to short output sequences and relatively few videos. We incorporate an auxiliary supervision signal for the encoder via a frame-wise loss and propose a separate alignment decoder for an implicit duration prediction. Finally, we extend our framework to the timestamp supervised setting via our proposed constrained k-medoids algorithm to generate pseudo-segmentations. Our proposed framework performs consistently on both fully and timestamp supervised settings, outperforming or competing state-of-the-art on several datasets. Our code is publicly available at https://github.com/boschresearch/UVAST.

PDF Abstract

Code

Add Remove Mark official

boschresearch/uvast official

boschresearch/UVAST

Tasks

Add Remove

Action Segmentation

Decoder

Translation

Datasets

Breakfast

GTEA

Assembly101 50 Salads

Results from the Paper

Edit

Ranked #4 on Action Segmentation on Assembly101

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Segmentation	50 Salads	UVAST	F1@10%	89.1	# 7	Compare
			Edit	83.9	# 5	Compare
			Acc	87.4	# 7	Compare
			F1@25%	87.6	# 7	Compare
			F1@50%	81.7	# 6	Compare
Action Segmentation	Assembly101	UVAST	MoF	37.4	# 4	Compare
			F1@10%	32.1	# 4	Compare
			F1@25%	28.3	# 4	Compare
			F1@50%	20.8	# 4	Compare
			Edit	31.5	# 2	Compare
Action Segmentation	Breakfast	UVAST	F1@10%	76.9	# 8	Compare
			F1@50%	58	# 9	Compare
			Acc	69.7	# 18	Compare
			Edit	77.1	# 6	Compare
			F1@25%	71.5	# 9	Compare
Action Segmentation	GTEA	UVAST	F1@10%	92.7	# 5	Compare
			F1@50%	81	# 8	Compare
			Acc	80.2	# 9	Compare
			Edit	92.1	# 2	Compare
			F1@25%	91.3	# 6	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • LSTM • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Seq2Seq • Sigmoid Activation • Softmax • Tanh Activation • Transformer

Edit Social Preview

Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove