Scale-Aware Trident Networks for Object Detection

Scale variation is one of the key challenges in object detection. In this work, we first present a controlled experiment to investigate the effect of receptive fields for scale variation in object detection. Based on the findings from the exploration experiments, we propose a novel Trident Network (TridentNet) aiming to generate scale-specific feature maps with a uniform representational power. We construct a parallel multi-branch architecture in which each branch shares the same transformation parameters but with different receptive fields. Then, we adopt a scale-aware training scheme to specialize each branch by sampling object instances of proper scales for training. As a bonus, a fast approximation version of TridentNet could achieve significant improvements without any additional parameters and computational cost compared with the vanilla detector. On the COCO dataset, our TridentNet with ResNet-101 backbone achieves state-of-the-art single-model results of 48.4 mAP. Codes are available at https://git.io/fj5vR.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Object Detection COCO minival TridentNet (ResNet-101) box AP 42 # 144
AP50 63.5 # 49
AP75 45.5 # 60
APS 24.9 # 52
APM 47 # 37
APL 56.9 # 51
Object Detection COCO test-dev TridentNet (ResNet-101) box mAP 42.7 # 166
AP50 63.6 # 97
AP75 46.5 # 108
APS 23.9 # 106
APM 46.6 # 92
APL 56.6 # 85
Hardware Burden None # 1
Operations per network pass None # 1
Object Detection COCO test-dev TridentNet (ResNet-101-Deformable, Image Pyramid) box mAP 48.4 # 101
AP50 69.7 # 38
AP75 53.5 # 47
APS 31.8 # 37
APM 51.3 # 52
APL 60.3 # 55
Hardware Burden None # 1
Operations per network pass None # 1

Methods