Object Tracking

584 papers with code • 7 benchmarks • 61 datasets

Object tracking is the task of taking an initial set of object detections, creating a unique ID for each of the initial detections, and then tracking each of the objects as they move around frames in a video, maintaining the ID assignment. State-of-the-art methods involve fusing data from RGB and event-based cameras to produce more reliable object tracking. CNN-based models using only RGB images as input are also effective. The most popular benchmark is OTB. There are several evaluation metrics specific to object tracking, including HOTA, MOTA, IDF1, and Track-mAP.

( Image credit: Towards-Realtime-MOT )

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Tracking

Dataset	Best Model	Compare
COESOT	HR-CEUTrack-Large	See all
FE108	HR-MonTrack-Base	See all
SeaDronesSee	DiMP50	See all
KITTI	M2-Track	See all
MMPTRACK	UMMT	See all
VisEvent	RT-MDNet	See all
BIRDSAI - ICVGIP 2020	final	See all

Libraries

Use these libraries to find Object Tracking models and implementations

visionml/pytracking

9 papers

3,088

PaddlePaddle/PaddleDetection

8 papers

12,082

open-mmlab/mmtracking

6 papers

3,383

mikel-brostrom/yolo_tracking

5 papers

6,113

See all 6 libraries.

Datasets

Subtasks

Cell Tracking

Video Object Tracking

Online Multi-Object Tracking

Thermal Infrared Object Tracking

Sports Ball Detection and Tracking

Pupil Tracking

Amodal Tracking

Most implemented papers

Most implemented Social Latest No code

Lucid Data Dreaming for Video Object Segmentation

omkar13/MaskTrack • • 28 Mar 2017

Our approach is suitable for both single and multiple object segmentation.

Paper
Code

Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data

BerkeleyAutomation/sd-maskrcnn • 16 Sep 2018

We train a variant of Mask R-CNN with domain randomization on the generated dataset to perform category-agnostic instance segmentation without any hand-labeled data and we evaluate the trained network, which we refer to as Synthetic Depth (SD) Mask R-CNN, on a set of real, high-resolution depth images of challenging, densely-cluttered bins containing objects with highly-varied geometry.

Paper
Code

SiamVGG: Visual Tracking using Deeper Siamese Networks

leeyeehoo/SiamVGG • • 7 Feb 2019

It combines a Convolutional Neural Network (CNN) backbone and a cross-correlation operator, and takes advantage of the features from exemplary images for more accurate object tracking.

Paper
Code

LiDARTag: A Real-Time Fiducial Tag System for Point Clouds

UMich-BipedLab/LiDARTag • 23 Aug 2019

Because of the LiDAR sensors' nature, rapidly changing ambient lighting will not affect the detection of a LiDARTag; hence, the proposed fiducial marker can operate in a completely dark environment.

Paper
Code

Probabilistic 3D Multi-Object Tracking for Autonomous Driving

eddyhkchiu/mahalanobis_3d_multi_object_tracking • 16 Jan 2020

Our method estimates the object states by adopting a Kalman Filter.

Paper
Code

Rethinking the competition between detection and ReID in Multi-Object Tracking

JudasDie/SOTS • • 23 Oct 2020

However, the inherent differences and relations between detection and re-identification (ReID) are unconsciously overlooked because of treating them as two isolated tasks in the one-shot tracking paradigm.

Paper
Code

HarDNet-MSEG: A Simple Encoder-Decoder Polyp Segmentation Neural Network that Achieves over 0.9 Mean Dice and 86 FPS

james128333/HarDNet-MSEG • • 18 Jan 2021

The decoder part is inspired by the Cascaded Partial Decoder, known for fast and accurate salient object detection.

Paper
Code

Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences

TuSimple/LiDAR_SOT • 10 Mar 2021

The code and protocols for our benchmark and algorithm are available at https://github. com/TuSimple/LiDAR_SOT/.

Paper
Code

Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response Drones

mlvlab/drone_ai_challenge • • 5 Jan 2022

In the second approach, although DeepSORT only processes a quarter of all frames due to hardware and time limitations, our model with DeepSORT (42. 9%) outperforms FairMOT (71. 4%) in terms of recall.

Paper
Code

MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

DanceTrack/DanceTrack • • CVPR 2023

In this paper, we propose MOTRv2, a simple yet effective pipeline to bootstrap end-to-end multi-object tracking with a pretrained object detector.

Paper
Code

Object Tracking

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result