Action Segmentation

72 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,192

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

LOGO: A Long-Form Video Dataset for Group Action Quality Assessment

faceonlive/ai-research • CVPR 2023

Action quality assessment (AQA) has become an emerging topic since it can be extensively applied in numerous scenarios.

124

07 Apr 2024

Paper
Code

Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation

mingu6/action_seg_ot • • 1 Apr 2024

We evaluate our segmentation approach and unsupervised learning pipeline on the Breakfast, 50-Salads, YouTube Instructions and Desktop Assembly datasets, yielding state-of-the-art results for the unsupervised video action segmentation task.

01 Apr 2024

Paper
Code

Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment

isee-laboratory/cvpr24_atba • • 28 Mar 2024

Weakly-supervised action segmentation is a task of learning to partition a long video into several action segments, where training videos are only accompanied by transcripts (ordered list of actions).

28 Mar 2024

Paper
Code

Multi-granularity Correspondence Learning from Long-term Noisy Videos

XLearning-SCU/2024-ICLR-Norton • • 30 Jan 2024

Existing video-language studies mainly focus on learning short video clips, leaving long-term temporal dependencies rarely explored due to over-high computational cost of modeling long videos.

30 Jan 2024

Paper
Code

A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation

lyhisme/dest • • 10 Dec 2023

Effectively modeling discriminative spatio-temporal information is essential for segmenting activities in long action sequences.

10 Dec 2023

Paper
Code

Activity Grammars for Temporal Action Segmentation

gongda0e/kari • NeurIPS 2023

Sequence prediction on temporal data requires the ability to understand compositional structures of multi-level semantics beyond individual and contextual properties.

07 Dec 2023

Paper
Code

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs

fpv-iplab/synchronization-is-all-you-need • 5 Dec 2023

Instead, we propose a novel methodology which performs the adaptation leveraging existing labeled exocentric videos and a new set of unlabeled, synchronized exocentric-egocentric video pairs, for which temporal action segmentation annotations do not need to be collected.

05 Dec 2023

Paper
Code

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation

liuqi-creat/sigformer • • 29 Nov 2023

Nowadays, the majority of approaches concentrate on the fusion of dense signals (i. e., RGB, optical flow, and depth maps).

29 Nov 2023

Paper
Code

Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union Learning

fandulu/DD-Net • • IROS 2023 2023

To alleviate this issue, we proposed a novel learning pattern in our training stage, which maximizes the probability of action union of surrounding timestamps for unlabeled frames.

253

22 Oct 2023

Paper
Code

End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning

Thinksky5124/SVTAS • • 27 Sep 2023

The end-to-end SVTAS which regard TAS as an action segment clustering task can expand the application scenarios of TAS; and RL is used to alleviate the problem of inconsistent optimization objective and direction.

27 Sep 2023

Paper
Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result