1 code implementation • 8 Dec 2023 • Hanjung Kim, Jaehyun Kang, Miran Heo, Sukjun Hwang, Seoung Wug Oh, Seon Joo Kim
By effectively resolving the over-reliance on location information, we achieve state-of-the-art results on YouTube-VIS 2019/2021 and Occluded VIS (OVIS).
1 code implementation • CVPR 2023 • Miran Heo, Sukjun Hwang, Jeongseok Hyun, Hanjung Kim, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim
Notably, we greatly outperform the state-of-the-art on the long VIS benchmark (OVIS), improving 5. 6 AP with ResNet-50 backbone.
Ranked #6 on Video Instance Segmentation on YouTube-VIS 2021 (using extra training data)
1 code implementation • CVPR 2023 • Hyolim Kang, Hanjung Kim, Joungbin An, Minsu Cho, Seon Joo Kim
Temporal Action Localization (TAL) methods typically operate on top of feature sequences from a frozen snippet encoder that is pretrained with the Trimmed Action Classification (TAC) tasks, resulting in a task discrepancy problem.