Action Segmentation

72 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,251

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Temporal Action Segmentation: An Analysis of Modern Techniques

atlas-eccv22/awesome-temporal-action-segmentation • 19 Oct 2022

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in minutes-long videos with multiple action classes.

Paper
Code

Temporal Convolutional Networks: A Unified Approach to Action Segmentation

Around-30/Kaggle • 29 Aug 2016

The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally, and second, input these features into a classifier that captures high-level temporal relationships, such as a Recurrent Neural Network (RNN).

Paper
Code

Weakly Supervised Action Learning with RNN based Fine-to-coarse Modeling

alexanderrichard/squirrel • CVPR 2017

We present an approach for weakly supervised learning of human actions.

Paper
Code

Action Sets: Weakly Supervised Action Segmentation without Ordering Constraints

alexanderrichard/action-sets • • CVPR 2018

Action detection and temporal segmentation of actions in videos are topics of increasing interest.

Paper
Code

Temporal Human Action Segmentation via Dynamic Clustering

yz-cnsdqz/dynamic_clustering • 15 Mar 2018

We present an effective dynamic clustering algorithm for the task of temporal human action segmentation, which has comprehensive applications such as robotics, motion analysis, and patient monitoring.

Paper
Code

Actor and Action Video Segmentation from a Sentence

JerryX1110/awesome-rvos • CVPR 2018

This paper strives for pixel-level segmentation of actors and their actions in video content.

Paper
Code

Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment

ld-ing/TCFPN-ISBA • • CVPR 2018

In this work, we address the task of weakly-supervised human action segmentation in long, untrimmed videos.

Paper
Code

Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification

Finspire13/RL-Surgical-Gesture-Segmentation • • 21 Jun 2018

Recognition of surgical gesture is crucial for surgical skill assessment and efficient surgery training.

Paper
Code

Toward Ergonomic Risk Prediction via Segmentation of Indoor Object Manipulation Actions Using Spatiotemporal Convolutional Networks

BehnooshParsa/HumanActionRecognition_with_ErgonomicRisk • • 14 Feb 2019

Automated real-time prediction of the ergonomic risks of manipulating objects is a key unsolved challenge in developing effective human-robot collaboration systems for logistics and manufacturing applications.

Paper
Code

Fast Weakly Supervised Action Segmentation Using Mutual Consistency

yassersouri/MuCon • • 5 Apr 2019

Action segmentation is the task of predicting the actions for each frame of a video.

Paper
Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result