no code implementations • 27 Mar 2024 • Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo Gao
Two key techniques are introduced into InstructBrush, Attention-based Instruction Optimization and Transformation-oriented Instruction Initialization, to address the limitations of the previous method in terms of inversion effects and instruction generalization.
no code implementations • 29 Jan 2024 • Shiyin Dong, Mingrui Zhu, Kun Cheng, Nannan Wang, Xinbo Gao
Our purpose is to establish a unified visual perception framework, capitalizing on the potential synergies between generative and discriminative models.
no code implementations • 24 Nov 2023 • Ruoyu Zhao, Mingrui Zhu, Shiyin Dong, Nannan Wang, Xinbo Gao
We propose CatVersion, an inversion-based method that learns the personalized concept through a handful of examples.
no code implementations • 15 Nov 2023 • Dongxin Chen, Mingrui Zhu, Nannan Wang, Xinbo Gao
To disentangle the latent codes in the GAN inversion space, we introduce an Identity Disentanglement Module (IDM).
no code implementations • 11 Sep 2023 • Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao
In this paper, we unify the task of anonymization and visual identity information hiding and propose a novel face privacy protection method based on diffusion models, dubbed Diff-Privacy.
no code implementations • 9 May 2023 • Shiyin Dong, Mingrui Zhu, Nannan Wang, Xinbo Gao
Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions.
no code implementations • 28 Jan 2023 • Ruoyu Zhao, Mingrui Zhu, Xiaoyu Wang, Nannan Wang
GPD contains two models: a teacher network with GAN Prior and a student network that fulfills end-to-end translation.
no code implementations • 24 Jan 2023 • Xiao He, Mingrui Zhu, Nannan Wang, Xinbo Gao, Heng Yang
To address this issue, we propose a novel font generation approach by learning the Difference between different styles and the Similarity of the same style (DS-Font).
no code implementations • ICCV 2023 • Mingrui Zhu, Xiao He, Nannan Wang, Xiaoyu Wang, Xinbo Gao
In this paper, we propose a novel all-to-key attention mechanism -- each position of content features is matched to stable key positions of style features -- that is more in line with the characteristics of style transfer.
1 code implementation • 27 Nov 2022 • Kun Cheng, Xiaodong Cun, Yong Zhang, Menghan Xia, Fei Yin, Mingrui Zhu, Xuan Wang, Jue Wang, Nannan Wang
Our system disentangles this objective into three sequential tasks: (1) face video generation with a canonical expression; (2) audio-driven lip-sync; and (3) face enhancement for improving photo-realism.
1 code implementation • 4 Mar 2022 • Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao
The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer.
no code implementations • 26 May 2017 • Huahui Liu, Mingrui Zhu, Xiaonan Meng, Yi Hu, Hao Wang
In recent years, RTB(Real Time Bidding) becomes a popular online advertisement trading method.