Video Object Detection

66 papers with code • 7 benchmarks • 10 datasets

Video object detection is the task of detecting objects from a video as opposed to images.

( Image credit: Learning Motion Priors for Efficient Video Object Detection )

Libraries

Use these libraries to find Video Object Detection models and implementations

Video Sparse Transformer With Attention-Guided Memory for Video Object Detection

Malik1998/VSTAM IEEE Access 2022

In this paper, we enhance features element-wisely before the object candidate region detection, proposing Video Sparse Transformer with Attention-guided Memory (VSTAM).

29
17 Jun 2022

Representation Recycling for Streaming Video Analysis

ufukertenli/streamdeq-code 28 Apr 2022

Our experiments on video semantic segmentation, video object detection, and human pose estimation in videos show that StreamDEQ achieves on-par accuracy with the baseline while being more than 2-4x faster.

8
28 Apr 2022

Delta Distillation for Efficient Video Processing

Qualcomm-AI-research/delta-distillation 17 Mar 2022

By extensive experiments on a wide range of architectures, including the most efficient ones, we demonstrate that delta distillation sets a new state of the art in terms of accuracy vs. efficiency trade-off for semantic segmentation and object detection in videos.

4
17 Mar 2022

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

SJTU-LuHe/TransVOD 13 Jan 2022

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

197
13 Jan 2022

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

marioduran/tyolov5 17 Nov 2021

Much of the previous research on handgun detection is based on static image detectors, leaving aside valuable temporal information that could be used to improve object detection in videos.

20
17 Nov 2021

AI Accelerator Survey and Trends

areuther/ai-accelerators 18 Sep 2021

Over the past several years, new machine learning accelerators were being announced and released every month for a variety of applications from speech recognition, video object detection, assisted driving, and many data center applications.

136
18 Sep 2021

FFAVOD: Feature Fusion Architecture for Video Object Detection

hu64/ffavod 15 Sep 2021

We propose FFAVOD, standing for feature fusion architecture for video object detection.

23
15 Sep 2021

Temporal RoI Align for Video Object Recognition

open-mmlab/mmtracking 8 Sep 2021

In this work, considering the features of the same object instance are highly similar among frames in a video, a novel Temporal RoI Align operator is proposed to extract features from other frames feature maps for current frame proposals by utilizing feature similarity.

3,384
08 Sep 2021

TF-Blender: Temporal Feature Blender for Video Object Detection

goodproj13/tf-blender ICCV 2021

One of the popular solutions is to exploit the temporal information and enhance per-frame representation through aggregating features from neighboring frames.

13
12 Aug 2021

End-to-End Video Object Detection with Spatial-Temporal Transformers

SJTU-LuHe/TransVOD 23 May 2021

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

197
23 May 2021