Visual Object Tracking

150 papers with code • 21 benchmarks • 26 datasets

Visual Object Tracking is an important research topic in computer vision, image understanding and pattern recognition. Given the initial state (centre location and scale) of a target in the first frame of a video sequence, the aim of Visual Object Tracking is to automatically obtain the states of the object in the subsequent video frames.

Source: Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Object Tracking

Benchmarks

Add a Result

These leaderboards are used to track progress in Visual Object Tracking

Dataset	Best Model	Compare
LaSOT	ODTrack-L	See all
TrackingNet	ARTrackV2-L	See all
GOT-10k	ARTrackV2-L	See all
VOT2017/18	SiamMask_E	See all
OTB-2015	ODTrack-L	See all
UAV123	NeighborTrack-OSTrack	See all
LaSOT-ext	UNINEXT-H	See all
YouTube-VOS 2018	OSVOS	See all
OTB-2013	SE-SiamFC	See all
TNL2K	ODTrack-L	See all
VOT2017	GFS-DCF	See all
VOT2016	SiamMask_E	See all
OTB-50	SiamVGG	See all
NeedForSpeed	ARTrackV2-L	See all
VOT2019	TREG	See all
VOT2018	TREG	See all
OTB-100	DiMP-NCE+	See all
YouTube-VOS	AOC-MF	See all
TempleColor128	AAA	See all
ITB	DropTrack	See all
VOT2014		See all

Show all 21 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Visual Object Tracking models and implementations

visionml/pytracking

7 papers

3,089

fengyang95/pyCFTrackers

4 papers

505

martin-danelljan/ECO

3 papers

610

researchmm/SiamDW

2 papers

748

See all 5 libraries.

Datasets

Subtasks

Zero-Shot Single Object Tracking

Latest papers

Most implemented Social Latest No code

Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline

wangxiao5791509/Single_Object_Tracking_Paper_List • • 26 Sep 2023

Tracking using bio-inspired event cameras has drawn more and more attention in recent years.

359

26 Sep 2023

Paper
Code

Mobile Vision Transformer-based Visual Object Tracking

goutamyg/mvt • • 11 Sep 2023

We propose a lightweight, accurate, and fast tracking algorithm using Mobile Vision Transformers (MobileViT) as the backbone for the first time.

11 Sep 2023

Paper
Code

Separable Self and Mixed Attention Transformers for Efficient Object Tracking

goutamyg/smat • • 7 Sep 2023

Our ablation study testifies to the effectiveness of the proposed combination of backbone and head modules.

07 Sep 2023

Paper
Code

Improving Underwater Visual Tracking With a Large Scale Dataset and Image Enhancement

basitalawode/uwvot400 • • 30 Aug 2023

The method has resulted in a significant performance improvement, of up to 5. 0% AUC, of state-of-the-art (SOTA) visual trackers.

30 Aug 2023

Paper
Code

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation

yoxu515/mits • • ICCV 2023

Tracking any given object(s) spatially and temporally is a common purpose in Visual Object Tracking (VOT) and Video Object Segmentation (VOS).

25 Aug 2023

Paper
Code

360VOT: A New Benchmark Dataset for Omnidirectional Visual Object Tracking

huajianup/360vot • ICCV 2023

360{\deg} images can provide an omnidirectional field of view which is important for stable and long-term scene perception.

27 Jul 2023

Paper
Code

Tracking Anything in High Quality

jiawen-zhu/hqtrack • • 26 Jul 2023

To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.

735

26 Jul 2023

Paper
Code

Cross-Drone Transformer Network for Robust Single Object Tracking

cgjacklin/transmdot • • IEEE Transactions on Circuits and Systems for Video Technology 2023

During the tracking process, a cross-drone mapping mechanism is proposed by using the surrounding information of the drone with promising tracking status as reference, assisting drones that lost targets to re-calibrate, which implements real-time cross-drone information interaction.

05 Jun 2023

Paper
Code

Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

chenxin-dlut/seqtrackv2 • • CVPR 2023

In this paper, we introduce a new sequence-to-sequence learning framework for RGB-based and multi-modal object tracking.

27 Apr 2023

Paper
Code

DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks

jimmy-dq/dropmae • • CVPR 2023

However, we find that this simple baseline heavily relies on spatial cues while ignoring temporal relations for frame reconstruction, thus leading to sub-optimal temporal matching representations for VOT and VOS.

02 Apr 2023

Paper
Code

Visual Object Tracking

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result