STEP: Spatio-Temporal Progressive Learning for Video Action Detection

CVPR 2019 Xitong YangXiaodong YangMing-Yu LiuFanyi XiaoLarry DavisJan Kautz

In this paper, we propose Spatio-TEmporal Progressive (STEP) action detector---a progressive learning framework for spatio-temporal action detection in videos. Starting from a handful of coarse-scale proposal cuboids, our approach progressively refines the proposals towards actions over a few steps... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Action Detection UCF101-24 STEP Video-mAP 0.2 76.6 # 2
Video-mAP 0.1 83.1 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet