Object Detection Models

FoveaBox

Introduced by Kong et al. in FoveaBox: Beyond Anchor-based Object Detector

FoveaBox is anchor-free framework for object detection. Instead of using predefined anchors to enumerate possible locations, scales and aspect ratios for the search of the objects, FoveaBox directly learns the object existing possibility and the bounding box coordinates without anchor reference. This is achieved by: (a) predicting category-sensitive semantic maps for the object existing possibility, and (b) producing category-agnostic bounding box for each position that potentially contains an object. The scales of target boxes are naturally associated with feature pyramid representations for each input image

It is a single, unified network composed of a backbone network and two task-specific subnetworks. The backbone is responsible for computing a convolutional feature map over an entire input image and is an off-the-shelf convolutional network. The first subnet performs per pixel classification on the backbone’s output; the second subnet performs bounding box prediction for the corresponding position.

Source: FoveaBox: Beyond Anchor-based Object Detector

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Object Detection 3 100.00%

Components


Component Type
FPN
Feature Extractors

Categories