1 code implementation • 6 Dec 2023 • Xiaobo Yang, Xiaojin Gong
This work aims to leverage pre-trained foundation models, such as contrastive language-image pre-training (CLIP) and segment anything model (SAM), to address weakly supervised semantic segmentation (WSSS) using image-level labels.
1 code implementation • 2 Nov 2023 • Menglin Wang, Xiaojin Gong
To eliminate the confounding effect of camera bias, we propose to learn both intra- and inter-camera invariance under a unified framework.
1 code implementation • 26 Oct 2023 • Jiachen Li, Xiaojin Gong
Although prompt learning has enabled a recent work named CLIP-ReID to achieve promising performance, the underlying mechanisms and the necessity of prompt learning remain unclear due to the absence of semantic labels in ReID tasks.
Ranked #1 on Unsupervised Vehicle Re-Identification on VeRi-776
Contrastive Learning Unsupervised Person Re-Identification +1
1 code implementation • 31 Mar 2023 • Shengyang Sun, Xiaojin Gong
That is, clip-level pseudo labels generated from each network are used to supervise the other one at the next training round, and the two networks are learned alternatively and iteratively.
no code implementations • CVPR 2023 • Shengyang Sun, Xiaojin Gong
In this work, we propose a hierarchical semantic contrast (HSC) method to learn a scene-aware VAD model from normal videos.
1 code implementation • 22 Nov 2022 • Jiachen Li, Menglin Wang, Xiaojin Gong
To this end, we build a dual-branch network architecture based upon a modified Vision Transformer (ViT).
1 code implementation • CVPR 2022 • Mu Hu, Junyi Feng, Jiashen Hua, Baisheng Lai, Jianqiang Huang, Xiaojin Gong, Xiansheng Hua
Structural re-parameterization has drawn increasing attention in various computer vision tasks.
1 code implementation • 15 Jan 2022 • Menglin Wang, Jiachen Li, Baisheng Lai, Xiaojin Gong, Xian-Sheng Hua
Assisted with the camera-aware proxies, we design two proxy-level contrastive learning losses that are, respectively, based on offline and online association results.
3 code implementations • 1 Mar 2021 • Mu Hu, Shuling Wang, Bin Li, Shiyu Ning, Li Fan, Xiaojin Gong
More specifically, one branch inputs a color image and a sparse depth map to predict a dense depth map.
Ranked #3 on Depth Completion on KITTI Depth Completion
no code implementations • 5 Jan 2021 • Bin Li, Mu Hu, Shuling Wang, Lianghao Wang, Xiaojin Gong
To this end, we design a self-supervised visual-lidar odometry (Self-VLO) framework.
1 code implementation • 19 Dec 2020 • Menglin Wang, Baisheng Lai, Jianqiang Huang, Xiaojin Gong, Xian-Sheng Hua
These camera-aware proxies enable us to deal with large intra-ID variance and generate more reliable pseudo labels for learning.
no code implementations • 12 Feb 2020 • Menglin Wang, Baisheng Lai, Haokun Chen, Jianqiang Huang, Xiaojin Gong, Xian-Sheng Hua
Our approach performs even comparable to state-of-the-art fully supervised methods in two of the datasets.
1 code implementation • 12 Oct 2019 • Qi Yao, Xiaojin Gong
Weakly supervised semantic segmentation (WSSS) using only image-level labels can greatly reduce the annotation cost and therefore has attracted considerable research interest.
Ranked #35 on Weakly-Supervised Semantic Segmentation on COCO 2014 val (using extra training data)
1 code implementation • 5 Jan 2019 • Ningning Wang, Xiaojin Gong
RGB-D salient object detection aims to identify the most visually distinctive objects in a pair of color and depth images.
no code implementations • 14 Dec 2018 • Menglin Wang, Baisheng Lai, Zhongming Jin, Xiaojin Gong, Jianqiang Huang, Xian-Sheng Hua
With the gained annotations of the actively selected candidates, the tracklets' pesudo labels are updated by label merging and further used to re-train our re-ID model.
no code implementations • 21 Jun 2017 • Baisheng Lai, Xiaojin Gong
To address this issue, this paper integrates saliency into a deep architecture, in which the location in- formation is explored both explicitly and implicitly.
no code implementations • CVPR 2016 • Baisheng Lai, Xiaojin Gong
In this paper, we propose a novel method to perform weakly-supervised image parsing based on the dictionary learning framework.
no code implementations • 29 Jun 2014 • Wenqi Huang, Xiaojin Gong
This paper addresses the problem of holistic road scene understanding based on the integration of visual and range data.