no code implementations • 16 Apr 2024 • Zexin Li, Yiyang Lin, Zijie Fang, Shuyan Li, Xiu Li
In this paper, we propose the Attention-Based Varifocal Generative Adversarial Network (AV-GAN), which solves multiple problems in pathologic image translation tasks, such as uneven translation difficulty in different regions, mutual interference of multiple resolution information, and nuclear deformation.
no code implementations • 13 Mar 2024 • Long Lan, Fengxiang Wang, Shuyan Li, Xiangtao Zheng, Zengmao Wang, Xinwang Liu
Directly fine-tuning VLMs for RS-FGSC often encounters the challenge of overfitting the seen classes, resulting in suboptimal generalization to unseen classes, which highlights the difficulty in differentiating complex backgrounds and capturing distinct ship features.
no code implementations • 12 Oct 2023 • Qiang Li, Dan Zhang, Shengzhao Lei, Xun Zhao, Porawit Kamnoedboon, Weiwei Li, Junhao Dong, Shuyan Li
Despite the promising performance of existing visual models on public benchmarks, the critical assessment of their robustness for real-world applications remains an ongoing challenge.
1 code implementation • NeurIPS 2023 • Zhuoyan Luo, Yicheng Xiao, Yong liu, Shuyan Li, Yitong Wang, Yansong Tang, Xiu Li, Yujiu Yang
To address this issue, we propose Semantic-assisted Object Cluster (SOC), which aggregates video content and textual guidance for unified temporal modeling and cross-modal alignment.
Ranked #2 on Referring Expression Segmentation on A2D Sentences (using extra training data)
no code implementations • 21 Apr 2023 • mengqun Jin, Kai Li, Shuyan Li, Chunming He, Xiu Li
We further propose a consistency learning based mean teacher model to effectively adapt the learned UDA model using labeled and unlabeled target samples.
Semi-supervised Domain Adaptation Unsupervised Domain Adaptation
1 code implementation • 12 Mar 2023 • Haonan Han, Rui Yang, Shuyan Li, Runze Hu, Xiu Li
Interactive devices with touch screen have become commonly used in various aspects of daily life, which raises the demand for high production quality of touch screen glass.
no code implementations • 12 Feb 2023 • Yicheng Xiao, Yue Ma, Shuyan Li, Hantao Zhou, Ran Liao, Xiu Li
In this paper, we propose SemanticAC, a semantics-assisted framework for Audio Classification to better leverage the semantic information.
no code implementations • 11 Jan 2023 • Qiaosong Chu, Shuyan Li, Guangyi Chen, Kai Li, Xiu Li
Source-free object detection (SFOD) aims to transfer a detector pre-trained on a label-rich source domain to an unlabeled target domain without seeing source data.
1 code implementation • CVPR 2021 • Shuyan Li, Xiu Li, Jiwen Lu, Jie zhou
Most existing unsupervised video hashing methods are built on unidirectional models with less reliable training objectives, which underuse the correlations among frames and the similarity structure between videos.
no code implementations • ICCV 2019 • Shuyan Li, Zhixiang Chen, Jiwen Lu, Xiu Li, Jie Zhou
We then integrate the neighborhood attention mechanism into an RNN-based reconstruction scheme to encourage the binary codes to capture the spatial-temporal structure in a video which is consistent with that in the neighborhood.