Video Object Detection

66 papers with code • 7 benchmarks • 10 datasets

Video object detection is the task of detecting objects from a video as opposed to images.

( Image credit: Learning Motion Priors for Efficient Video Object Detection )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Detection

Dataset	Best Model	Compare
ImageNet VID	DiffusionVID (Swin-B)	See all
EPIC KITCHENS-seen splits	Temporal ROI Align	See all
EPIC KITCHENS-unseen splits	Temporal ROI Align	See all
USC-GRAD-STDdb	SLTnet FPN-X101	See all
EPIC-KITCHENS-55	Ours (Faster RCNN)	See all
YT-BB		See all
Waymo Open Dataset		See all

Libraries

Use these libraries to find Video Object Detection models and implementations

guanxiongsun/vfe.pytorch

4 papers

open-mmlab/mmtracking

3 papers

3,384

lingyunwu14/STFT

2 papers

Datasets

Latest papers

Most implemented Social Latest No code

Video Sparse Transformer With Attention-Guided Memory for Video Object Detection

Malik1998/VSTAM • • IEEE Access 2022

In this paper, we enhance features element-wisely before the object candidate region detection, proposing Video Sparse Transformer with Attention-guided Memory (VSTAM).

17 Jun 2022

Paper
Code

Representation Recycling for Streaming Video Analysis

ufukertenli/streamdeq-code • • 28 Apr 2022

Our experiments on video semantic segmentation, video object detection, and human pose estimation in videos show that StreamDEQ achieves on-par accuracy with the baseline while being more than 2-4x faster.

28 Apr 2022

Paper
Code

Delta Distillation for Efficient Video Processing

Qualcomm-AI-research/delta-distillation • • 17 Mar 2022

By extensive experiments on a wide range of architectures, including the most efficient ones, we demonstrate that delta distillation sets a new state of the art in terms of accuracy vs. efficiency trade-off for semantic segmentation and object detection in videos.

17 Mar 2022

Paper
Code

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

SJTU-LuHe/TransVOD • • 13 Jan 2022

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

197

13 Jan 2022

Paper
Code

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

marioduran/tyolov5 • • 17 Nov 2021

Much of the previous research on handgun detection is based on static image detectors, leaving aside valuable temporal information that could be used to improve object detection in videos.

17 Nov 2021

Paper
Code

AI Accelerator Survey and Trends

areuther/ai-accelerators • 18 Sep 2021

Over the past several years, new machine learning accelerators were being announced and released every month for a variety of applications from speech recognition, video object detection, assisted driving, and many data center applications.

136

18 Sep 2021

Paper
Code

FFAVOD: Feature Fusion Architecture for Video Object Detection

hu64/ffavod • • 15 Sep 2021

We propose FFAVOD, standing for feature fusion architecture for video object detection.

15 Sep 2021

Paper
Code

Temporal RoI Align for Video Object Recognition

open-mmlab/mmtracking • • 8 Sep 2021

In this work, considering the features of the same object instance are highly similar among frames in a video, a novel Temporal RoI Align operator is proposed to extract features from other frames feature maps for current frame proposals by utilizing feature similarity.

3,384

08 Sep 2021

Paper
Code

TF-Blender: Temporal Feature Blender for Video Object Detection

goodproj13/tf-blender • • ICCV 2021

One of the popular solutions is to exploit the temporal information and enhance per-frame representation through aggregating features from neighboring frames.

12 Aug 2021

Paper
Code

End-to-End Video Object Detection with Spatial-Temporal Transformers

SJTU-LuHe/TransVOD • • 23 May 2021

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

197

23 May 2021

Paper
Code

Video Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result