Browse SoTA > Computer Vision > Object Detection > Video Object Detection

Video Object Detection

20 papers with code · Computer Vision
Subtask of Object Detection

Video object detection is the task of detecting objects from a video as opposed to images.

( Image credit: Learning Motion Priors for Efficient Video Object Detection )

Benchmarks

Greatest papers with code

Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection

CVPR 2020 tensorflow/models

In this paper we propose a method that leverages temporal context from the unlabeled frames of a novel camera to improve performance at that camera.

VIDEO OBJECT DETECTION VIDEO UNDERSTANDING

Mobile Video Object Detection with Temporally-Aware Feature Maps

CVPR 2018 tensorflow/models

This paper introduces an online model for object detection in videos designed to run in real-time on low-powered mobile and embedded devices.

VIDEO OBJECT DETECTION

TSM: Temporal Shift Module for Efficient Video Understanding

ICCV 2019 MIT-HAN-LAB/temporal-shift-module

The explosive growth in video streaming gives rise to challenges on performing video understanding at high accuracy and low computation cost.

 Ranked #1 on Action Recognition on Something-Something V1 (Top-1 Accuracy metric)

ACTION RECOGNITION VIDEO OBJECT DETECTION VIDEO RECOGNITION VIDEO UNDERSTANDING

Flow-Guided Feature Aggregation for Video Object Detection

ICCV 2017 msracver/Flow-Guided-Feature-Aggregation

The accuracy of detection suffers from degenerated object appearances in videos, e. g., motion blur, video defocus, rare poses, etc.

VIDEO OBJECT DETECTION VIDEO RECOGNITION

Optimizing Video Object Detection via a Scale-Time Lattice

CVPR 2018 guanfuchen/video_obj

High-performance object detection relies on expensive convolutional networks to compute features, often leading to significant challenges in applications, e. g. those that require detecting objects from video streams in real time.

VIDEO OBJECT DETECTION

Memory Enhanced Global-Local Aggregation for Video Object Detection

CVPR 2020 Scalsol/mega.pytorch

We argue that there are two important cues for humans to recognize objects in videos: the global semantic information and the local localization information.

VIDEO OBJECT DETECTION

Relation Distillation Networks for Video Object Detection

ICCV 2019 Scalsol/mega.pytorch

In this paper, we introduce a new design to capture the interactions across the objects in spatio-temporal context.

REGION PROPOSAL VIDEO OBJECT DETECTION

Vision Meets Drones: Past, Present and Future

16 Jan 2020VisDrone/VisDrone-Dataset

We provide a large-scale drone captured dataset, VisDrone, which includes four tracks, i. e., (1) image object detection, (2) video object detection, (3) single object tracking, and (4) multi-object tracking.

MULTI-OBJECT TRACKING VIDEO OBJECT DETECTION