no code implementations • 18 Apr 2024 • Zeliang Ma, Song Yang, Zhe Cui, Zhicheng Zhao, Fei Su, Delong Liu, Jingyu Wang
The new trend in multi-object tracking task is to track objects of interest using natural language.
no code implementations • 13 Apr 2024 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men, Hongying Meng
Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories.
1 code implementation • 7 Mar 2024 • Yunhao Du, Zhicheng Zhao, Fei Su
To this end, we present the Refer-VI-ReID settings, which aims to match target visible images from both infrared images and coarse language descriptions (e. g., "a man with red top and black pants") to complement the missing color information.
no code implementations • IEEE Transactions on Circuits and Systems for Video Technology 2024 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Member, IEEE, Aidong Men, and Yuan Dong
In this paper, we propose an instance paradigm contrastive learning framework, introducing contrast between original features and novel paradigms to alleviate domain-specific distractions.
1 code implementation • 25 Dec 2023 • Yunhao Du, Cheng Lei, Zhicheng Zhao, Fei Su
Referring multi-object tracking (RMOT) aims to track multiple objects based on input textual descriptions.
1 code implementation • 15 Dec 2023 • Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
To address this issue, we propose a novel vehicle-centric pre-training framework called VehicleMAE, which incorporates the structural information including the spatial structure from vehicle profile information and the semantic structure from informative high-level natural language descriptions for effective masked vehicle appearance reconstruction.
1 code implementation • 27 Nov 2023 • Yunhao Du, Cheng Lei, Zhicheng Zhao, Yuan Dong, Fei Su
Previous methods focus on learning from cross-modality person images in different cameras.
1 code implementation • 25 Nov 2023 • Delong Liu, Haiwen Li, Zhicheng Zhao, Fei Su, Yuan Dong
Searching for specific person has great social benefits and security value, and it often involves a combination of visual and textual information.
Ranked #1 on Zero-shot Composed Person Retrieval on ITCPR dataset (using extra training data)
no code implementations • 16 Nov 2023 • Zhu Meng, Junhao Dong, Limei Guo, Fei Su, Guangxi Wang, Zhicheng Zhao
Since signet ring cells (SRCs) are associated with high peripheral metastasis rate and dismal survival, they play an important role in determining surgical approaches and prognosis, while they are easily missed by even experienced pathologists.
1 code implementation • 27 Sep 2023 • Zhongling Huang, Chong Wu, Xiwen Yao, Zhicheng Zhao, Xiankai Huang, Junwei Han
There has been a recent emphasis on integrating physical models and deep neural networks (DNNs) for SAR target recognition, to improve performance and achieve a higher level of physical interpretability.
no code implementations • 19 Jul 2023 • Junhao Dong, Zhu Meng, Delong Liu, Zhicheng Zhao, Fei Su
Prototype-based classification is a classical method in machine learning, and recently it has achieved remarkable success in semi-supervised semantic segmentation.
1 code implementation • 13 Mar 2023 • Ziqi He, Mengjia Xue, Yunhao Du, Zhicheng Zhao, Fei Su
To address this problem, we propose a dynamic clustering and cluster contrastive learning (DCCC) method.
no code implementations • 22 Jan 2023 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Aidong Men
In this paper, we propose a Dual-Contrastive Learning (DCL) module on feature and prototype contrast.
no code implementations • 23 Aug 2022 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Aidong Men, Hong Chen
Recently, out-of-distribution (OOD) generalization has attracted attention to the robustness and generalization ability of deep learning based models, and accordingly, many strategies have been made to address different aspects related to this issue.
1 code implementation • 18 Apr 2022 • Yunhao Du, Binyu Zhang, Xiangning Ruan, Fei Su, Zhicheng Zhao, Hong Chen
For the textual representation, one global embedding, three local embeddings and a color-type prompt embedding are extracted to represent various granularities of semantic features.
14 code implementations • 28 Feb 2022 • Yunhao Du, Zhicheng Zhao, Yang song, Yanyun Zhao, Fei Su, Tao Gong, Hongying Meng
As a result, the construction of a good baseline for a fair comparison is essential.
Ranked #7 on Multi-Object Tracking on MOT17 (using extra training data)
no code implementations • 5 Aug 2017 • Wenhui Jiang, Thuyen Ngo, B. S. Manjunath, Zhicheng Zhao, Fei Su
This region selection procedure is further integrated into a CNN-based weakly supervised detection (WSD) framework, and can be performed in each stochastic gradient descent mini-batch during training.