Toward unsupervised, multi-object discovery in large-scale image collections

ECCV 2020  ยท  Huy V. Vo, Patrick Pรฉrez, Jean Ponce ยท

This paper addresses the problem of discovering the objects present in a collection of images without any supervision. We build on the optimization approach of Vo et al. (CVPR'19) with several key novelties: (1) We propose a novel saliency-based region proposal algorithm that achieves significantly higher overlap with ground-truth objects than other competitive methods. This procedure leverages off-the-shelf CNN features trained on classification tasks without any bounding box information, but is otherwise unsupervised. (2) We exploit the inherent hierarchical structure of proposals as an effective regularizer for the approach to object discovery of Vo et al., boosting its performance to significantly improve over the state of the art on several standard benchmarks. (3) We adopt a two-stage strategy to select promising proposals using small random sets of images before using the whole image collection to discover the objects it depicts, allowing us to tackle, for the first time (to the best of our knowledge), the discovery of multiple objects in each one of the pictures making up datasets with up to 20,000 images, an over five-fold increase compared to existing methods, and a first step toward true large-scale unsupervised image interpretation.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Single-object discovery COCO_20k rOSD CorLoc 48.5 # 9
Multi-object discovery COCO_20k Large-scale rOSD Detection Rate 12.0 # 1
Single-object discovery COCO_20k rOSD + CAD CorLoc 53.0 # 7
Single-object discovery Object Discovery rOSD CorLoc 89.2 # 1
Single-object colocalization Object Discovery rOSD CorLoc 90.2 # 1
Single-object colocalization VOC12 rOSD CorLoc 49.2 # 1
Multi-object discovery VOC12 Large-scale rOSD Detection Rate 41.2 # 1
Multi-object colocalization VOC12 rOSD Detection Rate 51.5 # 1
Single-object discovery VOC12 Large-scale rOSD CorLoc 51.9 # 1
Multi-object discovery VOC12 rOSD Detection Rate 40.4 # 2
Single-object discovery VOC12 rOSD CorLoc 51.2 # 2
Single-object discovery VOC_6x2 rOSD CorLoc 72.5 # 1
Single-object colocalization VOC_6x2 rOSD CorLoc 76.1 # 1
Single-object colocalization VOC_all rOSD CorLoc 46.7 # 1
Single-object discovery VOC_all Large-scale rOSD CorLoc 49.4 # 1
Single-object discovery VOC_all rOSD CorLoc 49.3 # 2
Multi-object discovery VOC_all rOSD Detection Rate 37.6 # 2
Multi-object discovery VOC_all Large-scale rOSD Detection Rate 38.3 # 1
Multi-object colocalization VOC_all rOSD Detection Rate 49.4 # 1

Methods


No methods listed for this paper. Add relevant methods here