TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image-Based Localization	VIGOR Cross Area	SAFA	Recall@1	8.20	# 4
Image-Based Localization	VIGOR Cross Area	SAFA	Recall@5	19.59	# 4
Image-Based Localization	VIGOR Cross Area	SAFA	Recall@10	26.36	# 3
Image-Based Localization	VIGOR Cross Area	SAFA	Recall@1%	77.61	# 4
Image-Based Localization	VIGOR Cross Area	SAFA	Hit Rate	8.85	# 4
Image-Based Localization	VIGOR Same Area	SAFA	Recall@1	33.93	# 4
Image-Based Localization	VIGOR Same Area	SAFA	Recall@5	58.42	# 4
Image-Based Localization	VIGOR Same Area	SAFA	Recall@10	68.12	# 3
Image-Based Localization	VIGOR Same Area	SAFA	Recall@1%	98.24	# 4
Image-Based Localization	VIGOR Same Area	SAFA	Hit Rate	36.87	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spatial-aware-feature-aggregation-for-image/image-based-localization-on-vigor-cross-area)](https://paperswithcode.com/sota/image-based-localization-on-vigor-cross-area?p=spatial-aware-feature-aggregation-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spatial-aware-feature-aggregation-for-image/image-based-localization-on-vigor-same-area)](https://paperswithcode.com/sota/image-based-localization-on-vigor-same-area?p=spatial-aware-feature-aggregation-for-image)`

Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization

NeurIPS 2019 · Yujiao Shi, Liu Liu, Xin Yu, Hongdong Li ·

In this paper, we develop a new deep network to explicitly address these inherent differences between ground and aerial views. We observe there exist some approximate domain correspondences between ground and aerial images. Specifically, pixels lying on the same azimuth direction in an aerial image approximately correspond to a vertical image column in the ground view image. Thus, we propose a two-step approach to exploit this prior knowledge. The first step is to apply a regular polar transform to warp an aerial image such that its domain is closer to that of a ground-view panorama. Note that polar transform as a pure geometric transformation is agnostic to scene content, hence cannot bring the two domains into full alignment. Then, we add a subsequent spatial-attention mechanism which further brings corresponding deep features closer in the embedding space. To improve the robustness of feature representation, we introduce a feature aggregation strategy via learning multiple spatial embeddings. By the above two-step approach, we achieve more discriminative deep representations, facilitating cross-view Geo-localization more accurate. Our experiments on standard benchmark datasets show significant performance boosting, achieving more than doubled recall rate compared with the previous state of the art.

PDF Abstract

Code

Add Remove Mark official

shiyujiao/SAFA official

Tasks

Add Remove

Image-Based Localization

Datasets

VIGOR

Results from the Paper

Add Remove

Ranked #4 on Image-Based Localization on VIGOR Cross Area

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image-Based Localization	VIGOR Cross Area	SAFA	Recall@1	8.20	# 4	Compare
			Recall@5	19.59	# 4	Compare
			Recall@10	26.36	# 3	Compare
			Recall@1%	77.61	# 4	Compare
			Hit Rate	8.85	# 4	Compare
Image-Based Localization	VIGOR Same Area	SAFA	Recall@1	33.93	# 4	Compare
			Recall@5	58.42	# 4	Compare
			Recall@10	68.12	# 3	Compare
			Recall@1%	98.24	# 4	Compare
			Hit Rate	36.87	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove