Object Localization

231 papers with code • 18 benchmarks • 17 datasets

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Localization

Dataset	Best Model	Compare
IllusionVQA	GPT4-Vision 4-shot+CoT	See all
KITTI Pedestrians Moderate	Frustrum-PointPillars	See all
KITTI Pedestrians Hard	Frustrum-PointPillars	See all
GRIT	Unified-IOXL	See all
KITTI Cars Easy	VoxelNet	See all
KITTI Cars Moderate	Frustum PointNets	See all
KITTI Cars Hard	VoxelNet	See all
KITTI Pedestrians Easy	Frustum PointNets	See all
KITTI Cyclists Easy	Frustum PointNets	See all
KITTI Cyclists Moderate	Frustum PointNets	See all
KITTI Cyclists Hard	Frustum PointNets	See all
Mall	Hausdorff Loss	See all
Pupil	Hausdorff Loss	See all
Plant	Hausdorff Loss	See all
PASCAL VOC 2007	DeepCut	See all
PASCAL VOC 2012	DeepCut	See all
KITTI Pedestrian Easy	Frustrum-PointPillars	See all
REVERIE	CoLabBUAA_MiNLP	See all

Show all 18 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Object Localization models and implementations

PaddlePaddle/PaddleDetection

3 papers

12,066

jacobgil/pytorch-grad-cam

3 papers

9,444

Westlake-AI/openmixup

2 papers

570

kargarisaac/PointNet-SemSeg-VKITTI3D

2 papers

See all 6 libraries.

Datasets

Subtasks

Monocular 3D Object Localization

Active Object Localization

Most implemented papers

Most implemented Social Latest No code

Bounding Box Regression with Uncertainty for Accurate Object Detection

yihui-he/KL-Loss • • CVPR 2019

Large-scale object detection datasets (e. g., MS-COCO) try to define the ground truth bounding boxes as clear as possible.

Paper
Code

Smooth Grad-CAM++: An Enhanced Inference Level Visualization Technique for Deep Convolutional Neural Network Models

frgfm/torch-cam • • 3 Aug 2019

With the intention to create an enhanced visual explanation in terms of visual sharpness, object localization and explaining multiple occurrences of objects in a single image, we present Smooth Grad-CAM++ \footnote{Simple demo: http://35. 238. 22. 135:5000/}, a technique that combines methods from two other recent techniques---SMOOTHGRAD and Grad-CAM++.

Paper
Code

BOP Challenge 2020 on 6D Object Localization

thodan/bop_toolkit • 15 Sep 2020

This paper presents the evaluation methodology, datasets, and results of the BOP Challenge 2020, the third in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB-D image.

Paper
Code

Active Object Localization with Deep Reinforcement Learning

otoofim/ObjLocalisation • • ICCV 2015

We present an active detection model for localizing objects in scenes.

Paper
Code

Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization

zhengshou/AutoLoc • ICCV 2017

We propose `Hide-and-Seek', a weakly-supervised framework that aims to improve object localization in images and action localization in videos.

Paper
Code

Dilated Residual Networks

fyu/drn • • CVPR 2017

Convolutional networks for image classification progressively reduce resolution until the image is represented by tiny feature maps in which the spatial structure of the scene is no longer discernible.

Paper
Code

Unsupervised Traffic Accident Detection in First-Person Videos

MoonBlvd/tad-IROS2019 • • 2 Mar 2019

Recognizing abnormal events such as traffic violations and accidents in natural driving scenes is essential for successful autonomous driving and advanced driver assistance systems.

Paper
Code

ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

daveredrum/ScanRefer • • ECCV 2020

We introduce the task of 3D object localization in RGB-D scans using natural language descriptions.

Paper
Code

Learning to Segment from Scribbles using Multi-scale Adversarial Attention Gates

gvalvano/multiscale-adversarial-attention-gates • • 2 Jul 2020

We evaluated our model on several medical (ACDC, LVSC, CHAOS) and non-medical (PPSS) datasets, and we report performance levels matching those achieved by models trained with fully annotated segmentation masks.

Paper
Code

Eigen-CAM: Class Activation Map using Principal Components

jacobgil/pytorch-grad-cam • • 1 Aug 2020

At the heart of this progress is convolutional neural networks (CNNs) that are capable of learning representations or features given a set of data.

Paper
Code

Object Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result