Object Localization

231 papers with code • 18 benchmarks • 17 datasets

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Localization

Dataset	Best Model	Compare
IllusionVQA	GPT4-Vision 4-shot+CoT	See all
KITTI Pedestrians Moderate	Frustrum-PointPillars	See all
KITTI Pedestrians Hard	Frustrum-PointPillars	See all
GRIT	Unified-IOXL	See all
KITTI Cars Easy	VoxelNet	See all
KITTI Cars Moderate	Frustum PointNets	See all
KITTI Cars Hard	VoxelNet	See all
KITTI Pedestrians Easy	Frustum PointNets	See all
KITTI Cyclists Easy	Frustum PointNets	See all
KITTI Cyclists Moderate	Frustum PointNets	See all
KITTI Cyclists Hard	Frustum PointNets	See all
Mall	Hausdorff Loss	See all
Pupil	Hausdorff Loss	See all
Plant	Hausdorff Loss	See all
PASCAL VOC 2007	DeepCut	See all
PASCAL VOC 2012	DeepCut	See all
KITTI Pedestrian Easy	Frustrum-PointPillars	See all
REVERIE	CoLabBUAA_MiNLP	See all

Show all 18 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Object Localization models and implementations

PaddlePaddle/PaddleDetection

3 papers

12,022

jacobgil/pytorch-grad-cam

3 papers

9,370

Westlake-AI/openmixup

2 papers

567

kargarisaac/PointNet-SemSeg-VKITTI3D

2 papers

See all 6 libraries.

Datasets

Subtasks

Monocular 3D Object Localization

Active Object Localization

Latest papers with no code

Most implemented Social Latest No code

Weakly Supervised Monocular 3D Detection with a Single-View Image

no code yet • 29 Feb 2024

We propose SKD-WM3D, a weakly supervised monocular 3D detection framework that exploits depth information to achieve M3D with a single-view image exclusively without any 3D annotations or other training data.

Paper
Add Code

Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

no code yet • 7 Feb 2024

First, a depth estimation (DE) scheme leverages relative depth information to realize the effective feature lifting from 2D to 3D spaces.

Paper
Add Code

MsSVT++: Mixed-scale Sparse Voxel Transformer with Center Voting for 3D Object Detection

no code yet • 22 Jan 2024

To mitigate the computational complexity associated with applying a window-based transformer in 3D voxel space, we introduce a novel Chessboard Sampling strategy and implement voxel sampling and gathering operations sparsely using a hash map.

Paper
Add Code

Removal and Selection: Improving RGB-Infrared Object Detection via Coarse-to-Fine Fusion

no code yet • 19 Jan 2024

Specifically, following this perspective, we design a Redundant Spectrum Removal module to coarsely remove interfering information within each modality and a Dynamic Feature Selection module to finely select the desired features for feature fusion.

Paper
Add Code

Domain Adaptation for Large-Vocabulary Object Detectors

no code yet • 13 Jan 2024

Large-vocabulary object detectors (LVDs) aim to detect objects of many categories, which learn super objectness features and can locate objects accurately while applied to various downstream data.

Paper
Add Code

GTA: Guided Transfer of Spatial Attention from Object-Centric Representations

no code yet • 5 Jan 2024

Through experimental analysis using attention maps in ViT, we observe that the rich representations deteriorate when trained on a small dataset.

Paper
Add Code

FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection

no code yet • 22 Dec 2023

Compared to state-of-the-art methods, our proposed method delivers comparable performance on DOTA-v1. 0 and outperforms by 1. 5 mAP on DOTA-v1. 5, all while significantly reducing the model parameters to 16%.

Paper
Add Code

Weakly Supervised Open-Vocabulary Object Detection

no code yet • 19 Dec 2023

Despite weakly supervised object detection (WSOD) being a promising step toward evading strong instance-level annotations, its capability is confined to closed-set categories within a single training dataset.

Paper
Add Code

Multiscale Vision Transformer With Deep Clustering-Guided Refinement for Weakly Supervised Object Localization

no code yet • 15 Dec 2023

This work addresses the task of weakly-supervised object localization.

Paper
Add Code

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

no code yet • 4 Dec 2023

Recently, the rise of query-based Transformer decoders is reshaping camera-based 3D object detection.

Paper
Add Code

Object Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result