Search Results for author: Yuanzhi Zhu

Found 11 papers, 7 papers with code

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

no code implementations • 20 Mar 2024 • Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang, Cong Yao, Lianwen Jin

Text recognition, especially for complex scripts like Chinese, faces unique challenges due to its intricate character structures and vast vocabulary.

Zero-Shot Learning

Paper
Add Code

Conditional Text Image Generation with Diffusion Models

no code implementations • CVPR 2023 • Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao

Current text recognition systems, including those for handwritten scripts and scene text, have relied heavily on image synthesis and augmentation, since it is difficult to realize real-world complexity and diversity through collecting and annotating enough real text images.

Domain Adaptation Image Generation

Paper
Add Code

Denoising Diffusion Models for Plug-and-Play Image Restoration

2 code implementations • 15 May 2023 • Yuanzhi Zhu, Kai Zhang, Jingyun Liang, JieZhang Cao, Bihan Wen, Radu Timofte, Luc van Gool

Although diffusion models have shown impressive performance for high-quality image synthesis, their potential to serve as a generative denoiser prior to the plug-and-play IR methods remains to be further explored.

Deblurring Denoising +4

310

Paper
Code

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

3 code implementations • ICCV 2023 • Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc van Gool

To leverage strong generative priors and address challenges such as unstable training and lack of interpretability for GAN-based generative methods, we propose a novel fusion algorithm based on the denoising diffusion probabilistic model (DDPM).

Denoising

320

Paper
Code

SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text

no code implementations • 23 Feb 2022 • Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Zhe Li, Dezhi Peng

Specifically, we propose a style bank to parameterize the specific handwriting styles as latent vectors, which are input to a generator as style priors to achieve the corresponding handwritten styles.

Attribute Generative Adversarial Network

Paper
Add Code

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

1 code implementation • CVPR 2021 • Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo

Specifically, we integrate IFA into the two most prevailing text recognition streams (attention-based and CTC-based) and propose attention-guided dense prediction (ADP) and Extended CTC (ExCTC).

Optical Character Recognition Optical Character Recognition (OCR) +1

Paper
Code

Text Recognition in the Wild: A Survey

1 code implementation • 7 May 2020 • Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang

This paper aims to (1) summarize the fundamental problems and the state-of-the-art associated with scene text recognition; (2) introduce new insights and ideas; (3) provide a comprehensive review of publicly available resources; (4) point out directions for future work.

Scene Text Recognition

596

Paper
Code

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

3 code implementations • CVPR 2020 • Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

An agent network learns from the output of the recognition network and controls the fiducial points to generate more proper training samples for the recognition network.

Image Augmentation