TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image-Based Localization	cvact	SAIG-D	Recall@1	89.21	# 2
Image-Based Localization	cvact	SAIG-D	Recall@5	96.07	# 2
Image-Based Localization	cvact	SAIG-D	Recall@10	97.04	# 2
Image-Based Localization	cvact	SAIG-D	Recall@1 (%)	98.74	# 3
Image-Based Localization	cvusa	SAIG-D	Recall@10	99.50	# 2
Image-Based Localization	cvusa	SAIG-D	Recall@1	96.34	# 2
Image-Based Localization	cvusa	SAIG-D	Recall@5	99.10	# 2
Image-Based Localization	cvusa	SAIG-D	Recall@top1%	99.86	# 2
Image-Based Localization	VIGOR Cross Area	SAIG-D	Recall@1	33.05	# 2
Image-Based Localization	VIGOR Cross Area	SAIG-D	Recall@5	55.94	# 2
Image-Based Localization	VIGOR Cross Area	SAIG-D	Recall@10	-	# 4
Image-Based Localization	VIGOR Cross Area	SAIG-D	Recall@1%	94.64	# 2
Image-Based Localization	VIGOR Cross Area	SAIG-D	Hit Rate	36.71	# 2
Image-Based Localization	VIGOR Same Area	SAIG-D	Recall@1	65.23	# 2
Image-Based Localization	VIGOR Same Area	SAIG-D	Recall@5	88.08	# 2
Image-Based Localization	VIGOR Same Area	SAIG-D	Recall@10	-	# 4
Image-Based Localization	VIGOR Same Area	SAIG-D	Recall@1%	99.68	# 1
Image-Based Localization	VIGOR Same Area	SAIG-D	Hit Rate	74.11	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-effective-and-general-a-new-backbone/image-based-localization-on-cvact)](https://paperswithcode.com/sota/image-based-localization-on-cvact?p=simple-effective-and-general-a-new-backbone)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-effective-and-general-a-new-backbone/image-based-localization-on-cvusa-1)](https://paperswithcode.com/sota/image-based-localization-on-cvusa-1?p=simple-effective-and-general-a-new-backbone)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-effective-and-general-a-new-backbone/image-based-localization-on-vigor-cross-area)](https://paperswithcode.com/sota/image-based-localization-on-vigor-cross-area?p=simple-effective-and-general-a-new-backbone)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-effective-and-general-a-new-backbone/image-based-localization-on-vigor-same-area)](https://paperswithcode.com/sota/image-based-localization-on-vigor-same-area?p=simple-effective-and-general-a-new-backbone)`

Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization

3 Feb 2023 · Yingying Zhu, Hongji Yang, Yuxin Lu, Qiang Huang ·

In this work, we aim at an important but less explored problem of a simple yet effective backbone specific for cross-view geo-localization task. Existing methods for cross-view geo-localization tasks are frequently characterized by 1) complicated methodologies, 2) GPU-consuming computations, and 3) a stringent assumption that aerial and ground images are centrally or orientation aligned. To address the above three challenges for cross-view image matching, we propose a new backbone network, named Simple Attention-based Image Geo-localization network (SAIG). The proposed SAIG effectively represents long-range interactions among patches as well as cross-view correspondence with multi-head self-attention layers. The "narrow-deep" architecture of our SAIG improves the feature richness without degradation in performance, while its shallow and effective convolutional stem preserves the locality, eliminating the loss of patchify boundary information. Our SAIG achieves state-of-the-art results on cross-view geo-localization, while being far simpler than previous works. Furthermore, with only 15.9% of the model parameters and half of the output dimension compared to the state-of-the-art, the SAIG adapts well across multiple cross-view datasets without employing any well-designed feature aggregation modules or feature alignment algorithms. In addition, our SAIG attains competitive scores on image retrieval benchmarks, further demonstrating its generalizability. As a backbone network, our SAIG is both easy to follow and computationally lightweight, which is meaningful in practical scenario. Moreover, we propose a simple Spatial-Mixed feature aggregation moDule (SMD) that can mix and project spatial information into a low-dimensional space to generate feature descriptors... (The code is available at https://github.com/yanghongji2007/SAIG)

PDF Abstract

Code

Add Remove Mark official

yanghongji2007/saig official

Tasks

Add Remove

Image-Based Localization

Image Retrieval

Retrieval

Datasets

CVUSA CVACT VIGOR

Results from the Paper

Edit

Ranked #2 on Image-Based Localization on VIGOR Cross Area

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image-Based Localization	cvact	SAIG-D	Recall@1	89.21	# 2	Compare
			Recall@5	96.07	# 2	Compare
			Recall@10	97.04	# 2	Compare
			Recall@1 (%)	98.74	# 3	Compare
Image-Based Localization	cvusa	SAIG-D	Recall@10	99.50	# 2	Compare
			Recall@1	96.34	# 2	Compare
			Recall@5	99.10	# 2	Compare
			Recall@top1%	99.86	# 2	Compare
Image-Based Localization	VIGOR Cross Area	SAIG-D	Recall@1	33.05	# 2	Compare
			Recall@5	55.94	# 2	Compare
			Recall@10	-	# 4	Compare
			Recall@1%	94.64	# 2	Compare
			Hit Rate	36.71	# 2	Compare
Image-Based Localization	VIGOR Same Area	SAIG-D	Recall@1	65.23	# 2	Compare
			Recall@5	88.08	# 2	Compare
			Recall@10	-	# 4	Compare
			Recall@1%	99.68	# 1	Compare
			Hit Rate	74.11	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove