no code implementations • 3 Jan 2024 • Dengdi Sun, Yajie Pan, Andong Lu, Chenglong Li, Bin Luo
We introduce independent dynamic template tokens to interact with the search region, embedding temporal information to address appearance changes, while also retaining the involvement of the initial static template tokens in the joint feature extraction process to ensure the preservation of the original reliable target appearance information that prevent deviations from the target appearance caused by traditional temporal updates.
Ranked #4 on Rgb-T Tracking on RGBT210