1 code implementation • 15 Apr 2024 • Bozhi Luan, Hao Feng, Hong Chen, Yonghui Wang, Wengang Zhou, Houqiang Li
The image overview stage provides a comprehensive understanding of the global scene information, and the coarse localization stage approximates the image area containing the answer based on the question asked.
no code implementations • 18 Mar 2024 • Yue Ding, Hongqiao Shi, Shuang Song, Yonghui Wang, Ya Li
The integration of local elements into shape contours is critical for target detection and identification in cluttered scenes.
1 code implementation • 22 Nov 2023 • Yonghui Wang, Wengang Zhou, Hao Feng, Keyi Zhou, Houqiang Li
Moreover, we curate a collection of text-rich images and prompt the text-only GPT-4 to generate 12K high-quality conversations, featuring textual locations within text-rich scenarios.
no code implementations • 1 Nov 2023 • Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li
To handle this issue, we consider removing the shadow in a coarse-to-fine fashion and propose a simple but effective Progressive Recurrent Network (PRNet).
1 code implementation • 26 May 2023 • Yonghui Wang, Wengang Zhou, Yunyao Mao, Houqiang Li
Segment anything model (SAM) has achieved great success in the field of natural image segmentation.
1 code implementation • 15 Oct 2022 • Yonghui Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
To this end, we propose UDoc-GAN, the first framework to address the problem of document illumination correction under the unpaired setting.
no code implementations • Applied Computational Intelligence and Soft Computing 2020 • Suxia Cui, Yu Zhou, Yonghui Wang, Lujun Zhai
An advanced system with more computing power can facilitate deep learning feature, which exploit many neural network algorithms to simulate human brains.