Action Segmentation

72 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,273

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Temporal Action Segmentation

no code yet • 19 Dec 2023

However, learning the representation of each frame by unsupervised contrastive learning for action segmentation remains an open and challenging problem.

Paper
Add Code

X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer

no code yet • 12 Dec 2023

The field of 4D point cloud understanding is rapidly developing with the goal of analyzing dynamic 3D point cloud sequences.

Paper
Add Code

AdaFocus: Towards End-to-end Weakly Supervised Learning for Long-Video Action Understanding

no code yet • 28 Nov 2023

Under the weak supervision setting, action labels are provided for the whole video without precise start and end times of the action clip.

Paper
Add Code

CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

no code yet • 21 Nov 2023

CASR works out by reducing the difference in the causal adjacency matrix between we constructed and pre-segmentation results of backbone models.

Paper
Add Code

NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding

no code yet • 12 Oct 2023

We integrate NSM4D with state-of-the-art 4D perception backbones, demonstrating significant improvements on various online perception benchmarks in indoor and outdoor settings.

Paper
Add Code

Action Segmentation Using 2D Skeleton Heatmaps and Multi-Modality Fusion

no code yet • 12 Sep 2023

This paper presents a 2D skeleton-based action segmentation method with applications in fine-grained human activity recognition.

Paper
Add Code

Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary Resuscitation Instruction via Temporal Action Segmentation

no code yet • 31 Aug 2023

The vast majority of people who suffer unexpected cardiac arrest are performed cardiopulmonary resuscitation (CPR) by passersby in a desperate attempt to restore life, but endeavors turn out to be fruitless on account of disqualification.

Paper
Add Code

LAC: Latent Action Composition for Skeleton-based Action Segmentation

no code yet • 28 Aug 2023

In this context, we propose Latent Action Composition (LAC), a novel self-supervised framework aiming at learning from synthesized composable motions for skeleton-based action segmentation.

Paper
Add Code

BIT: Bi-Level Temporal Modeling for Efficient Supervised Action Segmentation

no code yet • 28 Aug 2023

We address the task of supervised action segmentation which aims to partition a video into non-overlapping segments, each representing a different action.

Paper
Add Code

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation

no code yet • 31 Jul 2023

The proposed method, named Mixture of Depth and Point cloud video experts (DPMix), achieved the first place in the 4D Action Segmentation Track of the HOI4D Challenge 2023.

Paper
Add Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result