1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson
Deep video models, for example, 3D CNNs or video transformers, have achieved promising performance on sparse video tasks, i. e., predicting one result per video.
1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson
Based on the analysis, we present a simple yet efficient framework to address the computational bottlenecks and achieve efficient one-stage VOD by exploiting the temporal consistency in video frames.
2 code implementations • ICCV 2023 • Guanxiong Sun, Chi Wang, Zhaoyu Zhang, Jiankang Deng, Stefanos Zafeiriou, Yang Hua
Then, these video prompts are prepended to the patch embeddings of the current frame as the updated input for video feature extraction.
1 code implementation • 18 Jan 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson
However, we argue that these memory structures are not efficient or sufficient because of two implied operations: (1) concatenating all features in memory for enhancement, leading to a heavy computational cost; (2) frame-wise memory updating, preventing the memory from capturing more temporal information.
no code implementations • 5 Dec 2023 • Vasileios Baltatzis, Rolandos Alexandros Potamias, Evangelos Ververas, Guanxiong Sun, Jiankang Deng, Stefanos Zafeiriou
Sign Languages (SL) serve as the primary mode of communication for the Deaf and Hard of Hearing communities.
no code implementations • 22 Sep 2018 • Guanxiong Sun, Chengqin Ye, Kuanquan Wang
In this work, we proposed a convolutional network architecture combined with the novel attention model.