no code implementations • 31 Mar 2024 • Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, LiMin Wang
To address this issue, we propose a new Dual-level query-based TAD framework, namely DualDETR, to detect actions from both instance-level and boundary-level.
no code implementations • 19 Aug 2023 • Chen Xu, Yuhan Zhu, Guozhen Zhang, Haocheng Shen, Yixuan Liao, Xiaoxin Chen, Gangshan Wu, LiMin Wang
Prompt learning has emerged as an efficient and effective approach for transferring foundational Vision-Language Models (e. g., CLIP) to downstream tasks.
1 code implementation • CVPR 2023 • Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang
In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation.
Ranked #1 on Video Frame Interpolation on MSU Video Frame Interpolation (PSNR metric)