Search Results for author: Haoxing Chen

Found 14 papers, 10 papers with code

The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning

1 code implementation • 15 Apr 2024 • Yaohui Li, Qifeng Zhou, Haoxing Chen, Jianbing Zhang, Xinyu Dai, Hao Zhou

Few-shot learning aims to further enhance the transfer capability of CLIP by giving few images in each class, aka 'few shots'.

Few-Shot Learning Zero-Shot Learning

Paper
Code

Conditional Prototype Rectification Prompt Learning

1 code implementation • 15 Apr 2024 • Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang

Recent advancements in efficient transfer learning (ETL) have shown remarkable success in fine-tuning VLMs within the scenario of limited data, introducing only a few parameters to harness task-specific insights from VLMs.

Few-Shot Learning Transfer Learning

Paper
Code

Segment Anything Model Meets Image Harmonization

no code implementations • 20 Dec 2023 • Haoxing Chen, Yaohui Li, Zhangxuan Gu, Zhuoer Xu, Jun Lan, Huaxiong Li

Image harmonization is a crucial technique in image composition that aims to seamlessly match the background by adjusting the foreground of composite images.

Image Harmonization Semantic Segmentation

Paper
Add Code

Boosting Audio-visual Zero-shot Learning with Large Language Models

1 code implementation • 21 Nov 2023 • Haoxing Chen, Yaohui Li, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang

Recent methods mainly focus on learning multi-modal features aligned with class names to enhance the generalization ability to unseen categories.

Ranked #1 on GZSL Video Classification on ActivityNet-GZSL (cls)

audio-visual learning Descriptive +1

Paper
Code

DiffUTE: Universal Text Editing Diffusion Model

1 code implementation • NeurIPS 2023 • Haoxing Chen, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Xing Zheng, Yaohui Li, Changhua Meng, Huijia Zhu, Weiqiang Wang

Specifically, we build our model on a diffusion model and carefully modify the network structure to enable the model for drawing multilingual characters with the help of glyph and position information.

Self-Supervised Learning

100

Paper
Code

Mobile User Interface Element Detection Via Adaptively Prompt Tuning

1 code implementation • CVPR 2023 • Zhangxuan Gu, Zhuoer Xu, Haoxing Chen, Jun Lan, Changhua Meng, Weiqiang Wang

Recent object detection approaches rely on pretrained vision-language models for image-text alignment.

object-detection Object Detection +1

Paper
Code

DiffusionInst: Diffusion Model for Instance Segmentation

2 code implementations • 6 Dec 2022 • Zhangxuan Gu, Haoxing Chen, Zhuoer Xu, Jun Lan, Changhua Meng, Weiqiang Wang

Extensive experimental results on COCO and LVIS show that DiffusionInst achieves competitive performance compared to existing instance segmentation models with various backbones, such as ResNet and Swin Transformers.

Ranked #8 on Instance Segmentation on LVIS v1.0 val

Instance Segmentation Segmentation

216

Paper
Code

Hierarchical Dynamic Image Harmonization

1 code implementation • 16 Nov 2022 • Haoxing Chen, Zhangxuan Gu, Yaohui Li, Jun Lan, Changhua Meng, Weiqiang Wang, Huaxiong Li

The MGD effectively applies distinct convolution to the foreground and background, learning the representations of foreground and background regions as well as their correlations to the global harmonization, facilitating local visual consistency for the images much more efficiently.

Ranked #2 on Image Harmonization on HAdobe5k(1024$\times$1024)

Image Harmonization

Paper
Code

Model-Aware Contrastive Learning: Towards Escaping the Dilemmas

1 code implementation • 16 Jul 2022 • Zizheng Huang, Haoxing Chen, Ziqi Wen, Chao Zhang, Huaxiong Li, Bo wang, Chunlin Chen

Contrastive learning (CL) continuously achieves significant breakthroughs across multiple domains.

Contrastive Learning Graph Representation Learning +3

Paper
Code

Shaping Visual Representations with Attributes for Few-Shot Recognition

1 code implementation • 13 Dec 2021 • Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen

Under the guidance of attribute modality, our method can learn enhanced semantic-aware representation for classification.

Attribute Few-Shot Learning +2

Paper
Code

Sparse Spatial Transformers for Few-Shot Learning

1 code implementation • 27 Sep 2021 • Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen

Finally, we propose using an image patch-matching module to calculate the distance between dense local representations, thus determining which category the query image belongs to in the support set.

Ranked #16 on Few-Shot Image Classification on FC100 5-way (1-shot)

Few-Shot Image Classification Few-Shot Learning +1

Paper
Code

Hierarchical Representation based Query-Specific Prototypical Network for Few-Shot Image Classification

no code implementations • 21 Mar 2021 • Yaohui Li, Huaxiong Li, Haoxing Chen, Chunlin Chen

Few-shot image classification aims at recognizing unseen categories with a small number of labeled training data.

Few-Shot Image Classification General Classification

Paper
Add Code

Multi-level Metric Learning for Few-shot Image Recognition

no code implementations • 21 Mar 2021 • Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen

Moreover, a Multi-level Metric Learning (MML) method is proposed, which not only calculates the pixel-level similarity but also considers the similarity of part-level features and global-level features.

Ranked #1 on Few-Shot Image Classification on Stanford Dogs 5-way (1-shot)

Few-Shot Image Classification Few-Shot Learning +1

Paper
Add Code

Multi-scale Adaptive Task Attention Network for Few-Shot Learning

no code implementations • 30 Nov 2020 • Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen

Then, an adaptive task attention module is proposed to select the most important LRs among the entire task.

Ranked #1 on Few-Shot Image Classification on Stanford Cars 5-way (1-shot)

Few-Shot Image Classification Few-Shot Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.