no code implementations • 10 Apr 2024 • Chunxu Liu, Guozhen Zhang, Rui Zhao, LiMin Wang
Large motion poses a critical challenge in Video Frame Interpolation (VFI) task.
no code implementations • 31 Mar 2024 • Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, LiMin Wang
To address this issue, we propose a new Dual-level query-based TAD framework, namely DualDETR, to detect actions from both instance-level and boundary-level.
no code implementations • 7 Mar 2024 • Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, LiMin Wang
Point-based image editing has attracted remarkable attention since the emergence of DragGAN.
1 code implementation • ICCV 2023 • Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, LiMin Wang
Based on this masking volume, we can track the unmasked tokens in time and sample a set of temporal consistent cubes from videos.
no code implementations • 19 Aug 2023 • Chen Xu, Yuhan Zhu, Guozhen Zhang, Haocheng Shen, Yixuan Liao, Xiaoxin Chen, Gangshan Wu, LiMin Wang
Prompt learning has emerged as an efficient and effective approach for transferring foundational Vision-Language Models (e. g., CLIP) to downstream tasks.
1 code implementation • CVPR 2023 • Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang
In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation.
Ranked #1 on Video Frame Interpolation on MSU Video Frame Interpolation (PSNR metric)