Search Results for author: Xin-Shun Xu

Found 12 papers, 2 papers with code

Effective and Robust Adversarial Training against Data and Label Corruptions

no code implementations • 7 May 2024 • Peng-Fei Zhang, Zi Huang, Xin-Shun Xu, Guangdong Bai

We propose a hybrid adversarial training surrounding multiple potential adversarial perturbations, alongside a semi-supervised learning based on class- rebalancing sample selection to enhance the resilience of the model for dual corruption.

Paper
Add Code

Bias Mitigating Few-Shot Class-Incremental Learning

no code implementations • 1 Feb 2024 • Li-Jun Zhao, Zhen-Duo Chen, Zi-Chao Zhang, Xin Luo, Xin-Shun Xu

Some recent methods somewhat alleviate the accuracy imbalance between base and incremental classes by fine-tuning the feature extractor in the incremental sessions, but they further cause the accuracy imbalance between past and current incremental classes.

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Add Code

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models

no code implementations • 26 Nov 2023 • Yu-Wei Zhan, Fan Liu, Xin Luo, Liqiang Nie, Xin-Shun Xu, Mohan Kankanhalli

To capitalize on these rich Human-Centric Visual Cues, we propose a novel approach named HCVC for HOI detection.

Human-Object Interaction Detection

Paper
Add Code

Federated Class-Incremental Learning with Prompting

no code implementations • 13 Oct 2023 • Jiale Liu, Yu-Wei Zhan, Chong-Yu Zhang, Xin Luo, Zhen-Duo Chen, Yinwei Wei, Xin-Shun Xu

For FCIL, the local and global models may suffer from catastrophic forgetting on old classes caused by the arrival of new classes and the data distributions of clients are non-independent and identically distributed (non-iid).

Class Incremental Learning Federated Learning +1

Paper
Add Code

FedVMR: A New Federated Learning method for Video Moment Retrieval

no code implementations • 28 Oct 2022 • Yan Wang, Xin Luo, Zhen-Duo Chen, Peng-Fei Zhang, Meng Liu, Xin-Shun Xu

As the first that is explored in VMR field, the new task is defined as video moment retrieval with distributed data.

Federated Learning Moment Retrieval +1

Paper
Add Code

Prototype-Based Layered Federated Cross-Modal Hashing

no code implementations • 27 Oct 2022 • Jiale Liu, Yu-Wei Zhan, Xin Luo, Zhen-Duo Chen, Yongxin Wang, Xin-Shun Xu

And due to the problems of statistical heterogeneity, model heterogeneity, and forcing each client to accept the same parameters, applying federated learning to cross-modal hash learning becomes very tricky.

Personalized Federated Learning

Paper
Add Code

Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval

no code implementations • 12 Apr 2022 • Yu-Wei Zhan, Xin Luo, Yongxin Wang, Zhen-Duo Chen, Xin-Shun Xu

To narrow the domain differences between sketches and images, we extract edge maps for natural images and treat them as a bridge between images and sketches, which have similar content to images and similar style to sketches.

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator

no code implementations • 24 Mar 2022 • Zi-Chao Zhang, Zhen-Duo Chen, Yongxin Wang, Xin Luo, Xin-Shun Xu

Recently, several Vision Transformer (ViT) based methods have been proposed for Fine-Grained Visual Classification (FGVC). These methods significantly surpass existing CNN-based ones, demonstrating the effectiveness of ViT in FGVC tasks. However, there are some limitations when applying ViT directly to FGVC. First, ViT needs to split images into patches and calculate the attention of every pair, which may result in heavy redundant calculation and unsatisfying performance when handling fine-grained images with complex background and small objects. Second, a standard ViT only utilizes the class token in the final layer for classification, which is not enough to extract comprehensive fine-grained information.

Fine-Grained Image Classification

Paper
Add Code

Online Enhanced Semantic Hashing: Towards Effective and Efficient Retrieval for Streaming Multi-Modal Data

1 code implementation • 9 Sep 2021 • Xiao-Ming Wu, Xin Luo, Yu-Wei Zhan, Chen-Lu Ding, Zhen-Duo Chen, Xin-Shun Xu

With the vigorous development of multimedia equipment and applications, efficient retrieval of large-scale multi-modal data has become a trendy research topic.

Retrieval

Paper
Code

Weakly-Supervised Online Hashing

no code implementations • 16 Sep 2020 • Yu-Wei Zhan, Xin Luo, Yu Sun, Yongxin Wang, Zhen-Duo Chen, Xin-Shun Xu

However, existing hashing methods for social image retrieval are based on batch mode which violates the nature of social images, i. e., social images are usually generated periodically or collected in a stream fashion.

Image Retrieval Retrieval

Paper
Add Code

Deep Recurrent Quantization for Generating Sequential Binary Codes

1 code implementation • 16 Jun 2019 • Jingkuan Song, Xiaosu Zhu, Lianli Gao, Xin-Shun Xu, Wu Liu, Heng Tao Shen

To the end, when the model is trained, a sequence of binary codes can be generated and the code length can be easily controlled by adjusting the number of recurrent iterations.

Image Retrieval Quantization +1

Paper
Code

Optimized Cartesian $K$-Means

no code implementations • 16 May 2014 • Jianfeng Wang, Jingdong Wang, Jingkuan Song, Xin-Shun Xu, Heng Tao Shen, Shipeng Li

In OCKM, multiple sub codewords are used to encode the subvector of a data point in a subspace.

Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.