Video Object Detection

66 papers with code • 7 benchmarks • 10 datasets

Video object detection is the task of detecting objects from a video as opposed to images.

( Image credit: Learning Motion Priors for Efficient Video Object Detection )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Detection

Dataset	Best Model	Compare
ImageNet VID	DiffusionVID (Swin-B)	See all
EPIC KITCHENS-seen splits	Temporal ROI Align	See all
EPIC KITCHENS-unseen splits	Temporal ROI Align	See all
USC-GRAD-STDdb	SLTnet FPN-X101	See all
EPIC-KITCHENS-55	Ours (Faster RCNN)	See all
YT-BB		See all
Waymo Open Dataset		See all

Libraries

Use these libraries to find Video Object Detection models and implementations

guanxiongsun/vfe.pytorch

4 papers

open-mmlab/mmtracking

3 papers

3,384

lingyunwu14/STFT

2 papers

Datasets

Latest papers with no code

Most implemented Social Latest No code

Memory Maps for Video Object Detection and Tracking on UAVs

no code yet • 6 Mar 2023

This paper introduces a novel approach to video object detection detection and tracking on Unmanned Aerial Vehicles (UAVs).

Paper
Add Code

Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection

no code yet • 20 Dec 2022

First, no tracking supervisions are in LVIS, which leads to inconsistent learning of detection (with LVIS and TAO) and tracking (only with TAO).

Paper
Add Code

Unifying Tracking and Image-Video Object Detection

no code yet • 20 Nov 2022

We propose TrIVD (Tracking and Image-Video Detection), the first framework that unifies image OD, video OD, and MOT within one end-to-end model.

Paper
Add Code

Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance

no code yet • 10 Nov 2022

Then, the semantic features of the motion representation are obtained through the local attention mechanism in the motion guidance module to obtain the high-level semantic features of the appearance representation.

Paper
Add Code

BoxMask: Revisiting Bounding Box Supervision for Video Object Detection

no code yet • 12 Oct 2022

We present a new, simple yet effective approach to uplift video object detection.

Paper
Add Code

Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection

no code yet • 5 Oct 2022

Second, motivated by sequence-level semantic aggregation, we incorporate the attention-guided Semantic Proposal Feature Aggregation module to enhance object feature representation before detection.

Paper
Add Code

DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

no code yet • 2 Oct 2022

Video object detection is a fundamental yet challenging task in computer vision.

Paper
Add Code

DAFA: Diversity-Aware Feature Aggregation for Attention-Based Video Object Detection

no code yet • IEEE Access 2022

Our method with global and local attention stages obtains 84. 5 and 85. 9 mAP on ResNet-101 and ResNeXt-101, respectively, thus achieving state-of-the-art performance without requiring additional post-processing methods.

Paper
Add Code

TemporalNet: Real-time 2D-3D Video Object Detection

no code yet • Conference on Robots and Vision 2022

Our TemporalNet is a plug-and-play block that can be added to a multi-scale single-image detection network without any adjustments in the network architecture.

Paper
Add Code

Real-Time Robust Video Object Detection System Against Physical-World Adversarial Attacks

no code yet • 19 Aug 2022

This work proposes Themis, a software/hardware system to defend against adversarial patches for real-time robust video object detection.

Paper
Add Code

Video Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result