3 code implementations • CVPR 2022 • Junhyeong Cho, Youngseok Yoon, Suha Kwak
To implement this idea, we propose Collaborative Glance-Gaze TransFormer (CoFormer) that consists of two modules: Glance transformer for activity classification and Gaze transformer for entity estimation.
Ranked #2 on Situation Recognition on imSitu
1 code implementation • 19 Nov 2021 • Junhyeong Cho, Youngseok Yoon, Hyeonjun Lee, Suha Kwak
Grounded Situation Recognition (GSR) is the task that not only classifies a salient action (verb), but also predicts entities (nouns) associated with semantic roles and their locations in the given image.
Ranked #5 on Situation Recognition on imSitu