AdaptIS: Adaptive Instance Selection Network

We present Adaptive Instance Selection network architecture for class-agnostic instance segmentation. Given an input image and a point $(x, y)$, it generates a mask for the object located at $(x, y)$. The network adapts to the input point with a help of AdaIN layers, thus producing different masks for different objects on the same image. AdaptIS generates pixel-accurate object masks, therefore it accurately segments objects of complex shape or severely occluded ones. AdaptIS can be easily combined with standard semantic segmentation pipeline to perform panoptic segmentation. To illustrate the idea, we perform experiments on a challenging toy problem with difficult occlusions. Then we extensively evaluate the method on panoptic segmentation benchmarks. We obtain state-of-the-art results on Cityscapes and Mapillary even without pretraining on COCO, and show competitive results on a challenging COCO dataset. The source code of the method and the trained models are available at https://github.com/saic-vul/adaptis.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Panoptic Segmentation Cityscapes val AdaptIS (ResNet-50) PQ 59.0 # 29
PQst 61.3 # 21
PQth 55.8 # 13
mIoU 75.3 # 28
AP 32.3 # 28
Panoptic Segmentation Cityscapes val AdaptIS (ResNet-101) PQ 60.6 # 24
PQst 62.9 # 15
PQth 57.5 # 9
mIoU 77.2 # 25
AP 33.9 # 25
Panoptic Segmentation Cityscapes val AdaptIS (ResNeXt-101) PQ 62.0 # 20
PQst 64.4 # 12
PQth 58.7 # 7
mIoU 79.2 # 20
AP 36.3 # 22
Panoptic Segmentation COCO test-dev AdaptIS (ResNeXt-101) PQ 42.8 # 30
PQst 31.8 # 28
PQth 50.1 # 25
Panoptic Segmentation Mapillary val AdaptIS (ResNeXt-101) PQ 40.3 # 9
mIoU 56.8 # 5

Methods