Video Object Detection

66 papers with code • 7 benchmarks • 10 datasets

Video object detection is the task of detecting objects from a video as opposed to images.

( Image credit: Learning Motion Priors for Efficient Video Object Detection )

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Detection

Dataset	Best Model	Compare
ImageNet VID	DiffusionVID (Swin-B)	See all
EPIC KITCHENS-seen splits	Temporal ROI Align	See all
EPIC KITCHENS-unseen splits	Temporal ROI Align	See all
USC-GRAD-STDdb	SLTnet FPN-X101	See all
EPIC-KITCHENS-55	Ours (Faster RCNN)	See all
YT-BB		See all
Waymo Open Dataset		See all

Libraries

Use these libraries to find Video Object Detection models and implementations

guanxiongsun/vfe.pytorch

4 papers

open-mmlab/mmtracking

3 papers

3,372

lingyunwu14/STFT

2 papers

Datasets

Latest papers

Most implemented Social Latest No code

Objects do not disappear: Video object detection by single-frame object location anticipation

l-kid/video-object-detection-by-location-anticipation • • ICCV 2023

2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames.

09 Aug 2023

Paper
Code

Video object detection for privacy-preserving patient monitoring in intensive care

raember/yolov5r_autodidact • • 26 Jun 2023

In this paper, we propose a new method for exploiting information in the temporal succession of video frames.

26 Jun 2023

Paper
Code

3D Video Object Detection with Learnable Object-Centric Global Optimization

jiaweihe1996/ba-det • • CVPR 2023

We explore long-term temporal visual correspondence-based optimization for 3D video object detection in this work.

27 Mar 2023

Paper
Code

FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors

yimingcuicuicui/faq • • 15 Mar 2023

With Transformerbased object detectors getting a better performance on the image domain tasks, recent works began to extend those methods to video object detection.

15 Mar 2023

Paper
Code

Feature Aggregated Queries for Transformer-Based Video Object Detectors

yimingcuicuicui/faq • • CVPR 2023

With Transformer-based object detectors getting a better performance on the image domain tasks, recent works began to extend those methods to video object detection.

01 Jan 2023

Paper
Code

Fewer is More: Efficient Object Detection in Large Aerial Images

ranchosky/oan • • 26 Dec 2022

Current mainstream object detection methods for large aerial images usually divide large images into patches and then exhaustively detect the objects of interest on all patches, no matter whether there exist objects or not.

26 Dec 2022

Paper
Code

Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark

roboflow-ai/roboflow-100-benchmark • • 24 Nov 2022

The evaluation of object detection models is usually performed by optimizing a single metric, e. g. mAP, on a fixed set of datasets, e. g. Microsoft COCO and Pascal VOC.

224

24 Nov 2022

Paper
Code

Deep-Learning-Based Computer Vision Approach For The Segmentation Of Ball Deliveries And Tracking In Cricket

theimad/cricket-image-segmentation • 22 Nov 2022

Our research tries to solve one of these problems by segmenting ball deliveries in a cricket broadcast using deep learning models, MobileNet and YOLO, thus enabling researchers to use our work as a dataset for their research.

22 Nov 2022

Paper
Code

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

hon-wong/ptseformer • • 6 Sep 2022

The temporal information is introduced by the temporal feature aggregation model (TFAM), by conducting an attention mechanism between the context frames and the target frame (i. e., the frame to be detected).

06 Sep 2022

Paper
Code

YOLOV: Making Still Image Object Detectors Great at Video Object Detection

yuhengsss/yolov • • 20 Aug 2022

On the positive side, the detection in a certain frame of a video, compared with that in a still image, can draw support from other frames.

257

20 Aug 2022

Paper
Code

Video Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result