Search Results for author: Guozhen Zhang

Found 6 papers, 2 papers with code

Sparse Global Matching for Video Frame Interpolation with Large Motion

no code implementations • 10 Apr 2024 • Chunxu Liu, Guozhen Zhang, Rui Zhao, LiMin Wang

Large motion poses a critical challenge in Video Frame Interpolation (VFI) task.

Paper
Add Code

Dual DETRs for Multi-Label Temporal Action Detection

no code implementations • 31 Mar 2024 • Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, LiMin Wang

To address this issue, we propose a new Dual-level query-based TAD framework, namely DualDETR, to detect actions from both instance-level and boundary-level.

Action Detection object-detection +1

Paper
Add Code

StableDrag: Stable Dragging for Point-based Image Editing

no code implementations • 7 Mar 2024 • Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, LiMin Wang

Point-based image editing has attracted remarkable attention since the emergence of DragGAN.

Point Tracking

Paper
Add Code

MGMAE: Motion Guided Masking for Video Masked Autoencoding

1 code implementation • ICCV 2023 • Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, LiMin Wang

Based on this masking volume, we can track the unmasked tokens in time and sample a set of temporal consistent cubes from videos.

Optical Flow Estimation Representation Learning

Paper
Code

DPL: Decoupled Prompt Learning for Vision-Language Models

no code implementations • 19 Aug 2023 • Chen Xu, Yuhan Zhu, Guozhen Zhang, Haocheng Shen, Yixuan Liao, Xiaoxin Chen, Gangshan Wu, LiMin Wang

Prompt learning has emerged as an efficient and effective approach for transferring foundational Vision-Language Models (e. g., CLIP) to downstream tasks.

Paper
Add Code

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

1 code implementation • CVPR 2023 • Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang

In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation.

Ranked #1 on Video Frame Interpolation on MSU Video Frame Interpolation (PSNR metric)

Video Frame Interpolation

316

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.