Action Segmentation

73 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,408

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

boschresearch/uvast • • 1 Sep 2022

This paper introduces a unified framework for video action segmentation via sequence to sequence (seq2seq) translation in a fully and timestamp supervised setup.

01 Sep 2022

Paper
Code

RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks

ShangHua-Gao/RFNext • • 14 Jun 2022

Our search scheme exploits both global search to find the coarse combinations and local search to get the refined receptive field combinations further.

14 Jun 2022

Paper
Code

Do we really need temporal convolutions in action segmentation?

ddz16/TUT • • 26 May 2022

Most state-of-the-art methods focus on designing temporal convolution-based models, but the inflexibility of temporal convolutions and the difficulties in modeling long-term temporal dependencies restrict the potential of these models.

26 May 2022

Paper
Code

Cross-Enhancement Transformer for Action Segmentation

Wangjhdeveloper/CETNet • • 19 May 2022

Temporal convolutions have been the paradigm of choice in action segmentation, which enhances long-term receptive fields by increasing convolution layers.

19 May 2022

Paper
Code

Temporal Alignment Networks for Long-term Video

tengdahan/temporalalignnet • • CVPR 2022

The objective of this paper is a temporal alignment network that ingests long term video sequences, and associated text sentences, in order to: (1) determine if a sentence is alignable with the video; and (2) if it is alignable, then determine its alignment.

106

06 Apr 2022

Paper
Code

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

assembly101/assembly101.github.io • CVPR 2022

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles.

28 Mar 2022

Paper
Code

Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

ttlmh/bridge-prompt • • CVPR 2022

The generated text prompts are paired with corresponding video clips, and together co-train the text encoder and the video encoder via a contrastive approach.

26 Mar 2022

Paper
Code

HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction

leolyliu/HOI4D-Instructions • • CVPR 2022

We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the research of category-level human-object interaction.

03 Mar 2022

Paper
Code

Skeleton-Based Action Segmentation with Multi-Stage Spatial-Temporal Graph Convolutional Neural Networks

benjaminfiltjens/ms-gcn • • 3 Feb 2022

State-of-the-art action segmentation approaches use multiple stages of temporal convolutions.

03 Feb 2022

Paper
Code

Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order Consistency

ZijiaLewisLu/CVPR22-POC • • CVPR 2022

We address the problem of set-supervised action learning, whose goal is to learn an action segmentation model using weak supervision in the form of sets of actions occurring in training videos.

01 Jan 2022

Paper
Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result