1 code implementation • CVPR 2023 • Hoyoung Choi, Seungwan Jin, Kyungsik Han
Vision transformers use [CLS] tokens to predict image classes.
Object Discovery Weakly-Supervised Object Localization