TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multi-Object Tracking	MOT17	TrackFormer	MOTA	74.1	# 18
Multi-Object Tracking	MOT17	TrackFormer	IDF1	68.0	# 21
Multi-Object Tracking	MOT17	TrackFormer	e2e-MOT	Yes	# 1
Multi-Object Tracking	MOTS20	TrackFormer	sMOTSA	54.9	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/trackformer-multi-object-tracking-with/multi-object-tracking-on-mot17)](https://paperswithcode.com/sota/multi-object-tracking-on-mot17?p=trackformer-multi-object-tracking-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/trackformer-multi-object-tracking-with/multi-object-tracking-on-mots20)](https://paperswithcode.com/sota/multi-object-tracking-on-mots20?p=trackformer-multi-object-tracking-with)`

TrackFormer: Multi-Object Tracking with Transformers

CVPR 2022 · Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixe, Christoph Feichtenhofer ·

The challenging task of multi-object tracking (MOT) requires simultaneous reasoning about track initialization, identity, and spatio-temporal trajectories. We formulate this task as a frame-to-frame set prediction problem and introduce TrackFormer, an end-to-end trainable MOT approach based on an encoder-decoder Transformer architecture. Our model achieves data association between frames via attention by evolving a set of track predictions through a video sequence. The Transformer decoder initializes new tracks from static object queries and autoregressively follows existing tracks in space and time with the conceptually new and identity preserving track queries. Both query types benefit from self- and encoder-decoder attention on global frame-level features, thereby omitting any additional graph optimization or modeling of motion and/or appearance. TrackFormer introduces a new tracking-by-attention paradigm and while simple in its design is able to achieve state-of-the-art performance on the task of multi-object tracking (MOT17 and MOT20) and segmentation (MOTS20). The code is available at https://github.com/timmeinhardt/trackformer .

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

timmeinhardt/trackformer official

475

acaelles97/devis

Tasks

Add Remove

Multi-Object Tracking

Object

Object Tracking

Video Understanding

Datasets

MS COCO

MOT17

MOTChallenge

CrowdHuman

JTA

Results from the Paper

Add Remove

Ranked #1 on Multi-Object Tracking on MOT17 (e2e-MOT metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multi-Object Tracking	MOT17	TrackFormer	MOTA	74.1	# 18	Compare
			IDF1	68.0	# 21	Compare
			e2e-MOT	Yes	# 1	Compare
Multi-Object Tracking	MOTS20	TrackFormer	sMOTSA	54.9	# 4	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Convolution • Dense Connections • Detr • Dropout • Feedforward Network • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

TrackFormer: Multi-Object Tracking with Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove