ICNet for Real-Time Semantic Segmentation on High-Resolution Images

We focus on the challenging task of real-time semantic segmentation in this paper. It finds many practical applications and yet is with fundamental difficulty of reducing a large portion of computation for pixel-wise label inference. We propose an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to address this challenge. We provide in-depth analysis of our framework and introduce the cascade feature fusion unit to quickly achieve high-quality segmentation. Our system yields real-time inference on a single GPU card with decent quality results evaluated on challenging datasets like Cityscapes, CamVid and COCO-Stuff.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Real-Time Semantic Segmentation CamVid ICNet mIoU 67.1% # 21
Time (ms) 36 # 12
Frame (fps) 27.8 # 12
Semantic Segmentation Cityscapes test ICNet Mean IoU (class) 70.6% # 77
Real-Time Semantic Segmentation Cityscapes test ICNet mIoU 70.6% # 29
Time (ms) 33 # 18
Frame (fps) 30.3 # 19
Dichotomous Image Segmentation DIS-TE1 ICNet max F-Measure 0.631 # 16
weighted F-measure 0.535 # 15
MAE 0.095 # 14
S-Measure 0.716 # 16
E-measure 0.784 # 14
HCE 234 # 10
Dichotomous Image Segmentation DIS-TE2 ICNet max F-Measure 0.716 # 14
weighted F-measure 0.627 # 13
MAE 0.095 # 12
S-Measure 0.759 # 13
E-measure 0.826 # 13
HCE 512 # 10
Dichotomous Image Segmentation DIS-TE3 ICNet max F-Measure 0.752 # 11
weighted F-measure 0.664 # 11
MAE 0.091 # 12
S-Measure 0.780 # 10
E-measure 0.852 # 13
HCE 1001 # 11
Dichotomous Image Segmentation DIS-TE4 ICNet max F-Measure 0.749 # 11
weighted F-measure 0.663 # 11
MAE 0.099 # 11
S-Measure 0.776 # 10
E-measure 0.837 # 13
HCE 3690 # 12
Dichotomous Image Segmentation DIS-VD ICNet max F-Measure 0.697 # 11
weighted F-measure 0.609 # 11
MAE 0.102 # 10
S-Measure 0.747 # 11
E-measure 0.811 # 11
HCE 1503 # 10
Semantic Segmentation Trans10K ICNet mIoU 23.39% # 15
GFLOPs 10.64 # 2

Methods