1 code implementation • 19 Mar 2024 • Qiangguo Jin, Hui Cui, Changming Sun, Yang song, Jiangbin Zheng, Leilei Cao, Leyi Wei, Ran Su
To address these issues, we first propose a novel inter- and intra-uncertainty regularization method to measure and constrain both inter- and intra-inconsistencies in the teacher-student architecture.
1 code implementation • 27 Nov 2022 • Leilei Cao, Yibo Guo, Ye Yuan, Qiangguo Jin
In this way, the spatial details can be better captured and the semantic features of target class in the query image can be focused.
Ranked #11 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
no code implementations • 28 Jun 2022 • Bo Yan, Leilei Cao, Zhuang Li, Hongbin Wang
Finally, our approach achieves 63. 008\%AP@0. 50:0. 95 on the test set of CVPR2022 AVA Challenge.
no code implementations • 24 Jun 2022 • Bo Yan, Leilei Cao, Fengliang Qi, Hongbin Wang
Firstly, we designed a context branch based on channel splitting network with transformer to obtain sufficient context information.
no code implementations • 24 Jun 2022 • Leilei Cao, Zhuang Li, Bo Yan, Feng Zhang, Fengliang Qi, Yuchen Hu, Hongbin Wang
The referring video object segmentation task (RVOS) aims to segment object instances in a given video referred by a language expression in all video frames.
no code implementations • 2 Dec 2021 • Bo Yan, Fengliang Qi, Leilei Cao, Hongbin Wang
Finally, our approach can achieve 40. 2\%AP@0. 50:0. 95 on the test set of ICCV2021 VIPriors instance segmentation challenge.
no code implementations • 2 Dec 2021 • Fengliang Qi, Bo Yan, Leilei Cao, Hongbin Wang
Person re-identification (re-ID) aims to identify the same person of interest across non-overlapping capturing cameras, which plays an important role in visual surveillance applications and computer vision research areas.
no code implementations • 2 Dec 2021 • Bo Yan, Leilei Cao, Hongbin Wang
Based on VSPW, we design a Temporal Bilateral Network with Vision Transformer.
no code implementations • 21 May 2021 • Leilei Cao, Yao Xiao, Lin Xu
Modern face detectors employ feature pyramids to deal with scale variation.
no code implementations • 4 Dec 2020 • Leilei Cao, Tong Yang, Yixu Wang, Bo Yan, Yandong Guo
Thus, our model consists of a pyramid of fully convolutional GANs, wherein the content GAN is responsible for completing contents in the lowest-resolution masked image, and each texture GAN is responsible for synthesizing textures in a higher-resolution image.