Image Retrieval

666 papers with code • 54 benchmarks • 75 datasets

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a provided query from a large database. It's often considered as a form of fine-grained, instance-level classification. Not just integral to image recognition alongside classification and detection, it also holds substantial business value by helping users discover images aligning with their interests or requirements, guided by visual similarity or other parameters.

( Image credit: DELF )

Libraries

Use these libraries to find Image Retrieval models and implementations
2 papers
9,377
2 papers
8,724
See all 10 libraries.

Latest papers with no code

Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval

no code yet • 23 Apr 2024

Composed Image Retrieval (CIR) is a task that retrieves images similar to a query, based on a provided textual modification.

Collaborative Visual Place Recognition through Federated Learning

no code yet • 20 Apr 2024

Visual Place Recognition (VPR) aims to estimate the location of an image by treating it as a retrieval problem.

Shotit: compute-efficient image-to-video search engine for the cloud

no code yet • 18 Apr 2024

We present Shotit, a cloud-native image-to-video search engine that tailors this search scenario in a compute-efficient approach.

Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

no code yet • 17 Apr 2024

The Composed Image Retrieval (CIR) task aims to retrieve target images using a composed query consisting of a reference image and a modified text.

Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing

no code yet • 17 Apr 2024

Our work introduces a transformative image hashing framework enabling spatial-aware conditional retrieval.

Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping

no code yet • 9 Apr 2024

A crucial practical aspect for an object identification model is to be flexible in input size.

Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning

no code yet • 6 Apr 2024

It is a step-by-step linear reasoning process that adjusts the length of the chain to improve the performance of generated prompts.

On the Estimation of Image-matching Uncertainty in Visual Place Recognition

no code yet • 31 Mar 2024

In Visual Place Recognition (VPR) the pose of a query image is estimated by comparing the image to a map of reference images with known reference poses.

Do Vision-Language Models Understand Compound Nouns?

no code yet • 30 Mar 2024

We curate Compun, a novel benchmark with 400 unique and commonly used CNs, to evaluate the effectiveness of VLMs in interpreting CNs.

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

no code yet • 28 Mar 2024

Image retrieval, i. e., finding desired images given a reference image, inherently encompasses rich, multi-faceted search intents that are difficult to capture solely using image-based measures.