Search Results for author: Mingdeng Cao

Found 17 papers, 13 papers with code

Rolling Shutter Correction with Intermediate Distortion Flow Estimation

2 code implementations • 9 Apr 2024 • Mingdeng Cao, Sidi Yang, Yujiu Yang, Yinqiang Zheng

Additionally, a multi-distortion flow prediction strategy is integrated to mitigate the issue of inaccurate flow estimation further.

Rolling Shutter Correction

156

Paper
Code

Taming Lookup Tables for Efficient Image Retouching

1 code implementation • 28 Mar 2024 • Sidi Yang, Binxiao Huang, Mingdeng Cao, Yatai Ji, Hanzhong Guo, Ngai Wong, Yujiu Yang

Existing enhancement models often optimize for high performance while falling short of reducing hardware inference time and power consumption, especially on edge devices with constrained computing and storage resources.

Image Enhancement Image Retouching

Paper
Code

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

1 code implementation • 7 Dec 2023 • Zhen Li, Mingdeng Cao, Xintao Wang, Zhongang Qi, Ming-Ming Cheng, Ying Shan

Recent advances in text-to-image generation have made remarkable progress in synthesizing realistic human photos conditioned on given text prompts.

Ranked #6 on Diffusion Personalization Tuning Free on AgeDB

Diffusion Personalization Tuning Free Text-to-Image Generation

8,253

Paper
Code

CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models

no code implementations • 30 Oct 2023 • Ziyang Yuan, Mingdeng Cao, Xintao Wang, Zhongang Qi, Chun Yuan, Ying Shan

As a result, our CustomNet ensures enhanced identity preservation and generates diverse, harmonious outputs.

Novel View Synthesis Object +1

Paper
Add Code

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

3 code implementations • ICCV 2023 • Mingdeng Cao, Xintao Wang, Zhongang Qi, Ying Shan, XiaoHu Qie, Yinqiang Zheng

Despite the success in large-scale text-to-image generation and text-conditioned image editing, existing methods still struggle to produce consistent generation and editing results.

Ranked #11 on Text-based Image Editing on PIE-Bench

Text-based Image Editing

631

Paper
Code

OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer

1 code implementation • CVPR 2023 • Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong

Omnidirectional images (ODIs) have obtained lots of research interest for immersive experiences.

Data Augmentation ERP +1

Paper
Code

Polarized Color Image Denoising

no code implementations • CVPR 2023 • Zhuoxiao Li, Haiyang Jiang, Mingdeng Cao, Yinqiang Zheng

Single-chip polarized color photography provides both visual textures and object surface information in one snapshot.

Color Image Denoising Image Denoising

Paper
Add Code

Blur Interpolation Transformer for Real-World Motion from Blur

1 code implementation • CVPR 2023 • Zhihang Zhong, Mingdeng Cao, Xiang Ji, Yinqiang Zheng, Imari Sato

This paper studies the challenging problem of recovering motion from blur, also known as joint deblurring and interpolation or blur temporal super-resolution.

Deblurring Super-Resolution

191

Paper
Code

Towards Real-World Video Deblurring by Exploring Blur Formation Process

1 code implementation • 28 Aug 2022 • Mingdeng Cao, Zhihang Zhong, Yanbo Fan, Jiahao Wang, Yong Zhang, Jue Wang, Yujiu Yang, Yinqiang Zheng

We believe the novel realistic synthesis pipeline and the corresponding RAW video dataset can help the community to easily construct customized blur datasets to improve real-world video deblurring performance largely, instead of laboriously collecting real data pairs.

Deblurring

Paper
Code

Learning Adaptive Warping for Real-World Rolling Shutter Correction

1 code implementation • CVPR 2022 • Mingdeng Cao, Zhihang Zhong, Jiahao Wang, Yinqiang Zheng, Yujiu Yang

This paper proposes the first real-world rolling shutter (RS) correction dataset, BS-RSC, and a corresponding model to correct the RS frames in a distorted video.

Rolling Shutter Correction

Paper
Code

MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

2 code implementations • 19 Apr 2022 • Sidi Yang, Tianhe Wu, Shuwei Shi, Shanshan Lao, Yuan Gong, Mingdeng Cao, Jiahao Wang, Yujiu Yang

No-Reference Image Quality Assessment (NR-IQA) aims to assess the perceptual quality of images in accordance with human subjective perception.

Ranked #8 on Video Quality Assessment on MSU SR-QA Dataset

No-Reference Image Quality Assessment NR-IQA +1

255

Paper
Code

VDTR: Video Deblurring with Transformer

1 code implementation • 17 Apr 2022 • Mingdeng Cao, Yanbo Fan, Yong Zhang, Jue Wang, Yujiu Yang

For multi-frame temporal modeling, we adapt Transformer to fuse multiple spatial features efficiently.

Deblurring Video Restoration

Paper
Code

Bringing Rolling Shutter Images Alive with Dual Reversed Distortion

1 code implementation • 12 Mar 2022 • Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato

In this paper, instead of two consecutive frames, we propose to exploit a pair of images captured by dual RS cameras with reversed RS directions for this highly challenging task.

Optical Flow Estimation

Paper
Code

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

1 code implementation • 8 Mar 2022 • Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang

Our framework elevates the resolution of the synthesized talking face to 1024*1024 for the first time, even though the training dataset has a lower resolution.

Facial Editing Talking Face Generation +1

598

Paper
Code

Accelerating Neural Network Optimization Through an Automated Control Theory Lens

no code implementations • CVPR 2022 • Jiahao Wang, Baoyuan Wu, Rui Su, Mingdeng Cao, Shuwei Shi, Wanli Ouyang, Yujiu Yang

We conduct experiments both from a control theory lens through a phase locus verification and from a network training lens on several models, including CNNs, Transformers, MLPs, and on benchmark datasets.

Math

Paper
Add Code

NTIRE 2021 Challenge on Perceptual Image Quality Assessment

no code implementations • 7 May 2021 • Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, SungJun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, Jingyu Guo, ZiRui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang, Yifan Chen, Yujiu Yang, Yang Li, Tao Zhang, Longtao Feng, Yiting Liao, Junlin Li, William Thong, Jose Costa Pereira, Ales Leonardis, Steven McDonagh, Kele Xu, Lehan Yang, Hengxing Cai, Pengfei Sun, Seyed Mehdi Ayyoubzadeh, Ali Royat, Sid Ahmed Fezza, Dounia Hammou, Wassim Hamidouche, Sewoong Ahn, Gwangjin Yoon, Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa

This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021.

Image Quality Assessment Image Restoration

Paper
Add Code

Region-Adaptive Deformable Network for Image Quality Assessment

3 code implementations • 23 Apr 2021 • Shuwei Shi, Qingyan Bai, Mingdeng Cao, Weihao Xia, Jiahao Wang, Yifan Chen, Yujiu Yang

Image quality assessment (IQA) aims to assess the perceptual quality of images.

Image Quality Assessment Image Restoration

255

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.