Image Retrieval
672 papers with code • 54 benchmarks • 75 datasets
Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a provided query from a large database. It's often considered as a form of fine-grained, instance-level classification. Not just integral to image recognition alongside classification and detection, it also holds substantial business value by helping users discover images aligning with their interests or requirements, guided by visual similarity or other parameters.
( Image credit: DELF )
Libraries
Use these libraries to find Image Retrieval models and implementationsDatasets
Subtasks
Most implemented papers
Kornia: an Open Source Differentiable Computer Vision Library for PyTorch
This work presents Kornia -- an open source computer vision library which consists of a set of differentiable routines and modules to solve generic computer vision problems.
Unifying Deep Local and Global Features for Image Search
Image retrieval is the problem of searching an image database for items that are similar to a query image.
Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
GLDv2 is the largest such dataset to date by a large margin, including over 5M images and 200k distinct instance labels.
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Large-scale pre-training methods of learning cross-modal representations on image-text pairs are becoming popular for vision-language tasks.
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval
In this paper, we propose Suppression-Enhancing Mask based attention and Interactive Channel transformatiON (SEMICON) to learn binary hash codes for dealing with large-scale fine-grained image retrieval tasks.
Deep Image Retrieval: Learning global representations for image search
We propose a novel approach for instance-level image retrieval.
Efficient Diffusion on Region Manifolds: Recovering Small Objects with Compact CNN Representations
The diffusion is carried out on descriptors of overlapping image regions rather than on a global image descriptor like in previous approaches.
Repeatability Is Not Enough: Learning Affine Regions via Discriminability
A method for learning local affine-covariant regions is presented.
Single Shot Scene Text Retrieval
In this way, the text based image retrieval task can be casted as a simple nearest neighbor search of the query text representation over the outputs of the CNN over the entire image database.
Detect-to-Retrieve: Efficient Regional Aggregation for Image Search
Then, we demonstrate how a trained landmark detector, using our new dataset, can be leveraged to index image regions and improve retrieval accuracy while being much more efficient than existing regional methods.