Semi-Supervised Object Detection with Adaptive Class-Rebalancing Self-Training

11 Jul 2021  ·  Fangyuan Zhang, Tianxiang Pan, Bin Wang ·

This study delves into semi-supervised object detection (SSOD) to improve detector performance with additional unlabeled data. State-of-the-art SSOD performance has been achieved recently by self-training, in which training supervision consists of ground truths and pseudo-labels. In current studies, we observe that class imbalance in SSOD severely impedes the effectiveness of self-training. To address the class imbalance, we propose adaptive class-rebalancing self-training (ACRST) with a novel memory module called CropBank. ACRST adaptively rebalances the training data with foreground instances extracted from the CropBank, thereby alleviating the class imbalance. Owing to the high complexity of detection tasks, we observe that both self-training and data-rebalancing suffer from noisy pseudo-labels in SSOD. Therefore, we propose a novel two-stage filtering algorithm to generate accurate pseudo-labels. Our method achieves satisfactory improvements on MS-COCO and VOC benchmarks. When using only 1\% labeled data in MS-COCO, our method achieves 17.02 mAP improvement over supervised baselines, and 5.32 mAP improvement compared with state-of-the-art methods.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Semi-Supervised Object Detection COCO 0.5% labeled data Adaptive Rebalancing mAP 19.62±0.37 # 2
Semi-Supervised Object Detection COCO 100% labeled data Adaptive Class-Rebalancing mAP 42.79 # 9
Semi-Supervised Object Detection COCO 10% labeled data Adaptive Class-Rebalancing mAP 34.92±0.22 # 13
Semi-Supervised Object Detection COCO 1% labeled data Adaptive Class-Rebalancing mAP 26.07±0.46 # 3
Semi-Supervised Object Detection COCO 2% labeled data Adaptive Class-Rebalancing mAP 28.69±0.17 # 6
Semi-Supervised Object Detection COCO 5% labeled data Adaptive Class-Rebalancing mAP 31.35±0.13 # 13

Methods


No methods listed for this paper. Add relevant methods here