Search Results for author: Chuhui Xue

Found 20 papers, 4 papers with code

Debiasing Text-to-Image Diffusion Models

no code implementations • 22 Feb 2024 • Ruifei He, Chuhui Xue, Haoru Tan, Wenqing Zhang, Yingchen Yu, Song Bai, Xiaojuan Qi

Despite its simplicity, we show that IDA shows efficiency and fast convergence in resolving the social bias in TTI diffusion models.

Paper
Add Code

Dataset Condensation via Generative Model

no code implementations • 14 Sep 2023 • David Junhao Zhang, Heng Wang, Chuhui Xue, Rui Yan, Wenqing Zhang, Song Bai, Mike Zheng Shou

Dataset condensation aims to condense a large dataset with a lot of training samples into a small set.

Dataset Condensation

Paper
Add Code

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

no code implementations • 13 Aug 2023 • David Junhao Zhang, Mutian Xu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou

Despite the rapid advancement of unsupervised learning in visual representation, it requires training on large-scale datasets that demand costly data collection, and pose additional challenges due to concerns regarding data privacy.

Contrastive Learning Image Classification +2

Paper
Add Code

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding

no code implementations • 1 Aug 2023 • Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

To address this challenge, we propose to harness pre-trained vision-language (VL) foundation models that encode extensive knowledge from image-text pairs to generate captions for multi-view images of 3D scenes.

Ranked #3 on 3D Open-Vocabulary Instance Segmentation on S3DIS

3D Open-Vocabulary Instance Segmentation Instance Segmentation +4

Paper
Add Code

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

3 code implementations • 26 Jun 2023 • Yujun Shi, Chuhui Xue, Jun Hao Liew, Jiachun Pan, Hanshu Yan, Wenqing Zhang, Vincent Y. F. Tan, Song Bai

In this work, we extend this editing framework to diffusion models and propose a novel approach DragDiffusion.

1,052

Paper
Code

Domain Adaptive Scene Text Detection via Subcategorization

no code implementations • 1 Dec 2022 • Zichen Tian, Chuhui Xue, Jingyi Zhang, Shijian Lu

We study domain adaptive scene text detection, a largely neglected yet very meaningful task that aims for optimal transfer of labelled scene text images while handling unlabelled images in various new domains.

Scene Text Detection Text Detection

Paper
Add Code

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding

1 code implementation • CVPR 2023 • Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

Open-vocabulary scene understanding aims to localize and recognize unseen categories beyond the annotated label space.

Ranked #2 on 3D Open-Vocabulary Instance Segmentation on S3DIS

3D Open-Vocabulary Instance Segmentation Contrastive Learning +4

216

Paper
Code

Is synthetic data from generative models ready for image recognition?

1 code implementation • 14 Oct 2022 • Ruifei He, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, Xiaojuan Qi

Recent text-to-image generation models have shown promising results in generating high-fidelity photo-realistic images.

Text-to-Image Generation Transfer Learning

165

Paper
Code

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words

no code implementations • 1 Sep 2022 • Zhangzi Zhu, Chuhui Xue, Yu Hao, Wenqing Zhang, Song Bai

Our oCLIP-based model achieves 28. 59\% in h-mean which ranks 1st in end-to-end OOV word recognition track of OOV Challenge in ECCV2022 TiE Workshop.

Autonomous Driving Scene Text Recognition +1

Paper
Add Code

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition

no code implementations • 4 Aug 2022 • Zhangzi Zhu, Yu Hao, Wenqing Zhang, Chuhui Xue, Song Bai

This report presents our 2nd place solution to ECCV 2022 challenge on Out-of-Vocabulary Scene Text Understanding (OOV-ST) : Cropped Word Recognition.

Paper
Add Code

Contextual Text Block Detection towards Scene Text Understanding

no code implementations • 26 Jul 2022 • Chuhui Xue, Jiaxing Huang, Shijian Lu, Changhu Wang, Song Bai

We formulate the new setup by a dual detection task which first detects integral text units and then groups them into a CTB.

text-classification Text Classification +2

Paper
Add Code

Fourier Document Restoration for Robust Document Dewarping and Recognition

1 code implementation • CVPR 2022 • Chuhui Xue, Zichen Tian, Fangneng Zhan, Shijian Lu, Song Bai

State-of-the-art document dewarping techniques learn to predict 3-dimensional information of documents which are prone to errors while dealing with documents with irregular distortions or large variations in depth.

Paper
Code

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting

no code implementations • 8 Mar 2022 • Chuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip Torr, Song Bai

Our network consists of an image encoder and a character-aware text encoder that extract visual and textual features, respectively, as well as a visual-textual decoder that models the interaction among textual and visual features for learning effective scene text representations.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

Contextual Text Detection

no code implementations • 29 Sep 2021 • Chuhui Xue, Jiaxing Huang, Wenqing Zhang, Shijian Lu, Song Bai, Changhu Wang

This paper presents Contextual Text Detection, a new setup that detects contextual text blocks for better understanding of texts in scenes.

Text Detection

Paper
Add Code

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition

no code implementations • 18 May 2021 • Chuhui Xue, Jiaxing Huang, Wenqing Zhang, Shijian Lu, Changhu Wang, Song Bai

The first task focuses on image-to-character (I2C) mapping which detects a set of character candidates from images based on different alignments of visual features in an non-sequential way.

Decoder Scene Text Recognition

Paper
Add Code

Detection and Rectification of Arbitrary Shaped Scene Texts by using Text Keypoints and Links

no code implementations • 1 Mar 2021 • Chuhui Xue, Shijian Lu, Steven Hoi

Detection and recognition of scene texts of arbitrary shapes remain a grand challenge due to the super-rich text shape variation in text line orientations, lengths, curvatures, etc.

Scene Text Detection Text Detection

Paper
Add Code

GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition

no code implementations • ICCV 2019 • Fangneng Zhan, Chuhui Xue, Shijian Lu

Recent adversarial learning research has achieved very impressive progress for modelling cross-domain data shifts in appearance space but its counterpart in modelling cross-domain shifts in geometry space lags far behind.

Domain Adaptation Scene Text Detection +1

Paper
Add Code

MSR: Multi-Scale Shape Regression for Scene Text Detection

no code implementations • 9 Jan 2019 • Chuhui Xue, Shijian Lu, Wei zhang

State-of-the-art scene text detection techniques predict quadrilateral boxes that are prone to localization errors while dealing with straight or curved text lines of different orientations and lengths in scenes.

regression Scene Text Detection +1

Paper
Add Code

Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping

no code implementations • ECCV 2018 • Chuhui Xue, Shijian Lu, Fangneng Zhan

This paper presents a scene text detection technique that exploits bootstrapping and text border semantics for accurate localization of texts in scenes.

Scene Text Detection Text Detection

Paper
Add Code

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes

no code implementations • ECCV 2018 • Fangneng Zhan, Shijian Lu, Chuhui Xue

This paper presents a novel image synthesis technique that aims to generate a large amount of annotated scene text images for training accurate and robust scene text detection and recognition models.

Image Generation Scene Text Detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.