Object Localization

231 papers with code • 18 benchmarks • 17 datasets

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Localization

Dataset	Best Model	Compare
IllusionVQA	GPT4-Vision 4-shot+CoT	See all
KITTI Pedestrians Moderate	Frustrum-PointPillars	See all
KITTI Pedestrians Hard	Frustrum-PointPillars	See all
GRIT	Unified-IOXL	See all
KITTI Cars Easy	VoxelNet	See all
KITTI Cars Moderate	Frustum PointNets	See all
KITTI Cars Hard	VoxelNet	See all
KITTI Pedestrians Easy	Frustum PointNets	See all
KITTI Cyclists Easy	Frustum PointNets	See all
KITTI Cyclists Moderate	Frustum PointNets	See all
KITTI Cyclists Hard	Frustum PointNets	See all
Mall	Hausdorff Loss	See all
Pupil	Hausdorff Loss	See all
Plant	Hausdorff Loss	See all
PASCAL VOC 2007	DeepCut	See all
PASCAL VOC 2012	DeepCut	See all
KITTI Pedestrian Easy	Frustrum-PointPillars	See all
REVERIE	CoLabBUAA_MiNLP	See all

Show all 18 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Object Localization models and implementations

PaddlePaddle/PaddleDetection

3 papers

12,029

jacobgil/pytorch-grad-cam

3 papers

9,389

Westlake-AI/openmixup

2 papers

567

kargarisaac/PointNet-SemSeg-VKITTI3D

2 papers

See all 6 libraries.

Datasets

Subtasks

Monocular 3D Object Localization

Active Object Localization

Latest papers with no code

Most implemented Social Latest No code

Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label

no code yet • 15 Apr 2024

This paper investigates a framework for weakly-supervised object localization, which aims to train a neural network capable of predicting both the object class and its location using only images and their image-level class labels.

Paper
Add Code

Real-world Instance-specific Image Goal Navigation for Service Robots: Bridging the Domain Gap with Contrastive Learning

no code yet • 15 Apr 2024

To address this, we propose a novel method called Few-shot Cross-quality Instance-aware Adaptation (CrossIA), which employs contrastive learning with an instance classifier to align features between massive low- and few high-quality images.

Paper
Add Code

IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic

no code yet • 12 Apr 2024

Intelligent vehicle systems require a deep understanding of the interplay between road conditions, surrounding entities, and the ego vehicle's driving behavior for safe and efficient navigation.

Paper
Add Code

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

no code yet • 10 Apr 2024

Online construction of open-ended language scenes is crucial for robotic applications, where open-vocabulary interactive scene understanding is required.

Paper
Add Code

MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues

no code yet • 8 Apr 2024

3D object detection based on roadside cameras is an additional way for autonomous driving to alleviate the challenges of occlusion and short perception range from vehicle cameras.

Paper
Add Code

Towards Two-Stream Foveation-based Active Vision Learning

no code yet • 24 Mar 2024

Specifically, the proposed framework models the following mechanisms: 1) ventral (what) stream focusing on the input regions perceived by the fovea part of an eye (foveation), 2) dorsal (where) stream providing visual guidance, and 3) iterative processing of the two streams to calibrate visual focus and process the sequence of focused image patches.

Paper
Add Code

Spatio-Temporal Bi-directional Cross-frame Memory for Distractor Filtering Point Cloud Single Object Tracking

no code yet • 23 Mar 2024

This integrates future and synthetic past frame memory to enhance the current memory, thereby improving the accuracy of iteration-based tracking.

Paper
Add Code

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

no code yet • 22 Mar 2024

Training high-accuracy 3D detectors necessitates massive labeled 3D annotations with 7 degree-of-freedom, which is laborious and time-consuming.

Paper
Add Code

EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration

no code yet • 20 Mar 2024

Detecting marine objects inshore presents challenges owing to algorithmic intricacies and complexities in system deployment.

Paper
Add Code

Could We Generate Cytology Images from Histopathology Images? An Empirical Study

no code yet • 16 Mar 2024

Automation in medical imaging is quite challenging due to the unavailability of annotated datasets and the scarcity of domain experts.

Paper
Add Code

Object Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result