TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Graph Matching	PASCAL VOC	GMTR	matching accuracy	0.836	# 2
Graph Matching	PASCAL VOC	GMT-BBGM	matching accuracy	0.8411	# 1
Graph Matching	SPair-71k	GMTR	matching accuracy	0.832	# 2
Graph Matching	SPair-71k	GMT-BBGM	matching accuracy	0.8296	# 3
Graph Matching	Willow Object Class	GMT-BBGM	matching accuracy	0.9813	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gmtr-graph-matching-transformers/graph-matching-on-pascal-voc)](https://paperswithcode.com/sota/graph-matching-on-pascal-voc?p=gmtr-graph-matching-transformers)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gmtr-graph-matching-transformers/graph-matching-on-spair-71k)](https://paperswithcode.com/sota/graph-matching-on-spair-71k?p=gmtr-graph-matching-transformers)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gmtr-graph-matching-transformers/graph-matching-on-willow-object-class)](https://paperswithcode.com/sota/graph-matching-on-willow-object-class?p=gmtr-graph-matching-transformers)`

GMTR: Graph Matching Transformers

14 Nov 2023 · Jinpei Guo, Shaofeng Zhang, Runzhong Wang, Chang Liu, Junchi Yan ·

Vision transformers (ViTs) have recently been used for visual matching beyond object detection and segmentation. However, the original grid dividing strategy of ViTs neglects the spatial information of the keypoints, limiting the sensitivity to local information. Therefore, we propose QueryTrans (Query Transformer), which adopts a cross-attention module and keypoints-based center crop strategy for better spatial information extraction. We further integrate the graph attention module and devise a transformer-based graph matching approach GMTR (Graph Matching TRansformers) whereby the combinatorial nature of GM is addressed by a graph transformer neural GM solver. On standard GM benchmarks, GMTR shows competitive performance against the SOTA frameworks. Specifically, on Pascal VOC, GMTR achieves $\mathbf{83.6\%}$ accuracy, $\mathbf{0.9\%}$ higher than the SOTA framework. On Spair-71k, GMTR shows great potential and outperforms most of the previous works. Meanwhile, on Pascal VOC, QueryTrans improves the accuracy of NGMv2 from $80.1\%$ to $\mathbf{83.3\%}$, and BBGM from $79.0\%$ to $\mathbf{84.5\%}$. On Spair-71k, QueryTrans improves NGMv2 from $80.6\%$ to $\mathbf{82.5\%}$, and BBGM from $82.1\%$ to $\mathbf{83.9\%}$. Source code will be made publicly available.

PDF Abstract

Code

Add Remove Mark official

jp-guo/gm-transformer official

Tasks

Add Remove

Graph Attention

Graph Matching

object-detection

Object Detection

Datasets

PASCAL VOC

SPair-71k

Results from the Paper

Edit

Ranked #1 on Graph Matching on PASCAL VOC (matching accuracy metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Graph Matching	PASCAL VOC	GMTR	matching accuracy	0.836	# 2	Compare
Graph Matching	PASCAL VOC	GMT-BBGM	matching accuracy	0.8411	# 1	Compare
Graph Matching	SPair-71k	GMTR	matching accuracy	0.832	# 2	Compare
Graph Matching	SPair-71k	GMT-BBGM	matching accuracy	0.8296	# 3	Compare
Graph Matching	Willow Object Class	GMT-BBGM	matching accuracy	0.9813	# 5	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Concatenated Skip Connection • Cross-Attention Module • Dense Connections • Dropout • Graph Transformer • Label Smoothing • LapEigen • Laplacian PE • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

GMTR: Graph Matching Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove