Visual Object Tracking

150 papers with code • 21 benchmarks • 26 datasets

Visual Object Tracking is an important research topic in computer vision, image understanding and pattern recognition. Given the initial state (centre location and scale) of a target in the first frame of a video sequence, the aim of Visual Object Tracking is to automatically obtain the states of the object in the subsequent video frames.

Source: Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Object Tracking

Benchmarks

Add a Result

These leaderboards are used to track progress in Visual Object Tracking

Dataset	Best Model	Compare
LaSOT	ODTrack-L	See all
TrackingNet	ARTrackV2-L	See all
GOT-10k	ARTrackV2-L	See all
VOT2017/18	SiamMask_E	See all
OTB-2015	ODTrack-L	See all
UAV123	NeighborTrack-OSTrack	See all
LaSOT-ext	UNINEXT-H	See all
YouTube-VOS 2018	OSVOS	See all
OTB-2013	SE-SiamFC	See all
TNL2K	ODTrack-L	See all
VOT2017	GFS-DCF	See all
VOT2016	SiamMask_E	See all
OTB-50	SiamVGG	See all
NeedForSpeed	ARTrackV2-L	See all
VOT2019	TREG	See all
VOT2018	TREG	See all
OTB-100	DiMP-NCE+	See all
YouTube-VOS	AOC-MF	See all
TempleColor128	AAA	See all
ITB	DropTrack	See all
VOT2014		See all

Show all 21 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Visual Object Tracking models and implementations

visionml/pytracking

7 papers

3,080

fengyang95/pyCFTrackers

4 papers

506

martin-danelljan/ECO

3 papers

610

researchmm/SiamDW

2 papers

748

See all 5 libraries.

Datasets

Subtasks

Zero-Shot Single Object Tracking

Latest papers

Most implemented Social Latest No code

LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks

faceonlive/ai-research • 9 Apr 2024

To achieve high accuracy on both clean and adversarial data, we propose building a spatial-temporal continuous representation using the semantic text guidance of the object of interest.

131

09 Apr 2024

Paper
Code

OmniVid: A Generative Framework for Universal Video Understanding

wangjk666/omnivid • • 26 Mar 2024

The core of video understanding tasks, such as recognition, captioning, and tracking, is to automatically detect objects or actions in a video and analyze their temporal evolution.

26 Mar 2024

Paper
Code

Elysium: Exploring Object-level Perception in Videos via MLLM

hon-wong/elysium • 25 Mar 2024

Multi-modal Large Language Models (MLLMs) have demonstrated their ability to perceive objects in still images, but their application in video-related tasks, such as object tracking, remains understudied.

25 Mar 2024

Paper
Code

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

hoqolo/sdstrack • • 24 Mar 2024

Multimodal Visual Object Tracking (VOT) has recently gained significant attention due to its robustness.

24 Mar 2024

Paper
Code

VastTrack: Vast Category Visual Object Tracking

henglan/vasttrack • 6 Mar 2024

The rich annotations of VastTrack enables development of both the vision-only and the vision-language tracking.

06 Mar 2024

Paper
Code

Spatio-temporal Prompting Network for Robust Video Feature Extraction

guanxiongsun/vfe.pytorch • • ICCV 2023

Then, these video prompts are prepended to the patch embeddings of the current frame as the updated input for video feature extraction.

04 Feb 2024

Paper
Code

Correlation-Embedded Transformer Tracking: A Single-Branch Framework

phiphiphi31/SBT • • 23 Jan 2024

Thus, we reformulate the two-branch Siamese tracking as a conceptually simple, fully transformer-based Single-Branch Tracking pipeline, dubbed SBT.

23 Jan 2024

Paper
Code

Explicit Visual Prompts for Visual Object Tracking

GXNU-ZhongLab/EVPTrack • • 6 Jan 2024

Specifically, we utilize spatio-temporal tokens to propagate information between consecutive frames without focusing on updating templates.

06 Jan 2024

Paper
Code

ODTrack: Online Dense Temporal Token Learning for Visual Tracking

gxnu-zhonglab/odtrack • • 3 Jan 2024

To alleviate the above problem, we propose a simple, flexible and effective video-level tracking pipeline, named \textbf{ODTrack}, which densely associates the contextual relationships of video frames in an online token propagation manner.

03 Jan 2024

Paper
Code

ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe

miv-xjtu/artrack • • 28 Dec 2023

We present ARTrackV2, which integrates two pivotal aspects of tracking: determining where to look (localization) and how to describe (appearance analysis) the target object across video frames.

185

28 Dec 2023

Paper
Code

Visual Object Tracking

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result