The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance Challenge
We present our 1st place solution to the Group Dance Multiple People Tracking Challenge. Based on MOTR: End-to-End Multiple-Object Tracking with Transformer, we explore: 1) detect queries as anchors, 2) tracking as query denoising, 3) joint training on pseudo video clips generated from CrowdHuman dataset, and 4) using the YOLOX detection proposals for the anchor initialization of detect queries. Our method achieves 73.4% HOTA on the DanceTrack test set, surpassing the second-place solution by +6.8% HOTA.
PDF AbstractCode
Datasets
Results from the Paper
Submit
results from this paper
to get state-of-the-art GitHub badges and help the
community compare results to other papers.
Methods
1x1 Convolution •
Absolute Position Encodings •
Adam •
Average Pooling •
Batch Normalization •
BPE •
Convolution •
CSPDarknet53 •
Darknet-53 •
Dense Connections •
Dropout •
Global Average Pooling •
Label Smoothing •
Layer Normalization •
Linear Layer •
Multi-Head Attention •
Position-Wise Feed-Forward Layer •
Residual Connection •
Scaled Dot-Product Attention •
Softmax •
Test •
Transformer •
YOLOX