TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Semi-Supervised Action Detection	ActivityNet-1.3	SPOT ( 60 % labeled, IoU thresh=0.5)	mAP	52.8	# 1
Semi-Supervised Action Detection	THUMOS' 14	SPOT ( 10 % labeled, IoU thresh=0.3)	mAP	58.9	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-supervised-temporal-action-detection/semi-supervised-action-detection-on)](https://paperswithcode.com/sota/semi-supervised-action-detection-on?p=semi-supervised-temporal-action-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-supervised-temporal-action-detection/semi-supervised-action-detection-on-thumos-14)](https://paperswithcode.com/sota/semi-supervised-action-detection-on-thumos-14?p=semi-supervised-temporal-action-detection)`

Semi-Supervised Temporal Action Detection with Proposal-Free Masking

14 Jul 2022 · Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang ·

Existing temporal action detection (TAD) methods rely on a large number of training data with segment-level annotations. Collecting and annotating such a training set is thus highly expensive and unscalable. Semi-supervised TAD (SS-TAD) alleviates this problem by leveraging unlabeled videos freely available at scale. However, SS-TAD is also a much more challenging problem than supervised TAD, and consequently much under-studied. Prior SS-TAD methods directly combine an existing proposal-based TAD method and a SSL method. Due to their sequential localization (e.g, proposal generation) and classification design, they are prone to proposal error propagation. To overcome this limitation, in this work we propose a novel Semi-supervised Temporal action detection model based on PropOsal-free Temporal mask (SPOT) with a parallel localization (mask generation) and classification architecture. Such a novel design effectively eliminates the dependence between localization and classification by cutting off the route for error propagation in-between. We further introduce an interaction mechanism between classification and localization for prediction refinement, and a new pretext task for self-supervised model pre-training. Extensive experiments on two standard benchmarks show that our SPOT outperforms state-of-the-art alternatives, often by a large margin. The PyTorch implementation of SPOT is available at https://github.com/sauradip/SPOT

PDF Abstract

Code

Add Remove Mark official

sauradip/spot official

Tasks

Add Remove

Action Detection

General Classification

Semi-Supervised Action Detection

Datasets

ActivityNet

THUMOS14

Results from the Paper

Edit

Ranked #1 on Semi-Supervised Action Detection on ActivityNet-1.3

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Semi-Supervised Action Detection	ActivityNet-1.3	SPOT ( 60 % labeled, IoU thresh=0.5)	mAP	52.8	# 1		Compare
Semi-Supervised Action Detection	THUMOS' 14	SPOT ( 10 % labeled, IoU thresh=0.3)	mAP	58.9	# 1		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Semi-Supervised Temporal Action Detection with Proposal-Free Masking

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove