A Multigrid Method for Efficiently Training Video Models

CVPR 2020 Chao-Yuan WuRoss GirshickKaiming HeChristoph FeichtenhoferPhilipp Krähenbühl

Training competitive deep video models is an order of magnitude slower than training their counterpart image models. Slow training causes long research cycles, which hinders progress in video understanding research... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Video Classification Charades Multigrid mAP 38.2 # 1
Video Classification Kinetics Multigrid Top-1 77.6 # 1
Video Classification Kinetics-400 Multigrid Top-1 78.1 # 1
Action Recognition Something-Something V2 Multigrid Top-1 Accuracy 61.7 # 6

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet