TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Node Classification	COCO-SP	GCN-tuned	macro F1	0.1338±0.0007	# 13
Node Classification	COCO-SP	GPS-tuned	macro F1	0.3884±0.0055	# 1
Node Classification	COCO-SP	GatedGCN-tuned	macro F1	0.2922±0.0018	# 4
Node Classification	COCO-SP	GINE-tuned	macro F1	0.2125±0.0009	# 10
Node Classification	PascalVOC-SP	GPS-tuned	macro F1	0.4440±0.0065	# 1
Node Classification	PascalVOC-SP	GatedGCN-tuned	macro F1	0.3880±0.0040	# 3
Node Classification	PascalVOC-SP	GINE-tuned	macro F1	0.2718±0.0054	# 11
Node Classification	PascalVOC-SP	GCN-tuned	macro F1	0.2078±0.0031	# 13
Link Prediction	PCQM-Contact	GPS-tuned	MRR	0.3498±0.0005	# 4
Link Prediction	PCQM-Contact	GPS-tuned	MRR-ext-filtered	0.4703±0.0014	# 1
Link Prediction	PCQM-Contact	GatedGCN-tuned	MRR	0.3495±0.0010	# 5
Link Prediction	PCQM-Contact	GatedGCN-tuned	MRR-ext-filtered	0.4670±0.0004	# 2
Link Prediction	PCQM-Contact	GINE-tuned	MRR	0.3509±0.0006	# 3
Link Prediction	PCQM-Contact	GINE-tuned	MRR-ext-filtered	0.4617±0.0005	# 3
Link Prediction	PCQM-Contact	GCN-tuned	MRR	0.3424±0.0007	# 7
Link Prediction	PCQM-Contact	GCN-tuned	MRR-ext-filtered	0.4526±0.0006	# 4
Graph Classification	Peptides-func	GCN-tuned	AP	0.6860±0.0050	# 5
Graph Classification	Peptides-func	GPS-tuned	AP	0.6534±0.0091	# 16
Graph Classification	Peptides-func	GatedGCN-tuned	AP	0.6765±0.0047	# 8
Graph Classification	Peptides-func	GINE-tuned	AP	0.6621±0.0067	# 12
Graph Regression	Peptides-struct	GPS-tuned	MAE	0.2509±0.0014	# 13
Graph Regression	Peptides-struct	GatedGCN-tuned	MAE	0.2477±0.0009	# 8
Graph Regression	Peptides-struct	GINE-tuned	MAE	0.2473±0.0017	# 6
Graph Regression	Peptides-struct	GCN-tuned	MAE	0.2460±0.0007	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-did-the-gap-go-reassessing-the-long/node-classification-on-coco-sp)](https://paperswithcode.com/sota/node-classification-on-coco-sp?p=where-did-the-gap-go-reassessing-the-long)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-did-the-gap-go-reassessing-the-long/node-classification-on-pascalvoc-sp-1)](https://paperswithcode.com/sota/node-classification-on-pascalvoc-sp-1?p=where-did-the-gap-go-reassessing-the-long)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-did-the-gap-go-reassessing-the-long/link-prediction-on-pcqm-contact)](https://paperswithcode.com/sota/link-prediction-on-pcqm-contact?p=where-did-the-gap-go-reassessing-the-long)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-did-the-gap-go-reassessing-the-long/graph-regression-on-peptides-struct)](https://paperswithcode.com/sota/graph-regression-on-peptides-struct?p=where-did-the-gap-go-reassessing-the-long)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-did-the-gap-go-reassessing-the-long/graph-classification-on-peptides-func)](https://paperswithcode.com/sota/graph-classification-on-peptides-func?p=where-did-the-gap-go-reassessing-the-long)`

Where Did the Gap Go? Reassessing the Long-Range Graph Benchmark

1 Sep 2023 · Jan Tönshoff, Martin Ritzert, Eran Rosenbluth, Martin Grohe ·

The recent Long-Range Graph Benchmark (LRGB, Dwivedi et al. 2022) introduced a set of graph learning tasks strongly dependent on long-range interaction between vertices. Empirical evidence suggests that on these tasks Graph Transformers significantly outperform Message Passing GNNs (MPGNNs). In this paper, we carefully reevaluate multiple MPGNN baselines as well as the Graph Transformer GPS (Ramp\'a\v{s}ek et al. 2022) on LRGB. Through a rigorous empirical analysis, we demonstrate that the reported performance gap is overestimated due to suboptimal hyperparameter choices. It is noteworthy that across multiple datasets the performance gap completely vanishes after basic hyperparameter optimization. In addition, we discuss the impact of lacking feature normalization for LRGB's vision datasets and highlight a spurious implementation of LRGB's link prediction metric. The principal aim of our paper is to establish a higher standard of empirical rigor within the graph machine learning community.

PDF Abstract

Code

Add Remove Mark official

toenshoff/lrgb official

Tasks

Add Remove

Graph Classification

Graph Learning

Graph Regression

Hyperparameter Optimization

Link Prediction

Node Classification

Datasets

PASCAL VOC

Long Range Graph Benchmark (LRGB)

Results from the Paper

Edit

Ranked #1 on Link Prediction on PCQM-Contact (MRR-ext-filtered metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Node Classification	COCO-SP	GCN-tuned	macro F1	0.1338±0.0007	# 13	Compare
Node Classification	COCO-SP	GPS-tuned	macro F1	0.3884±0.0055	# 1	Compare
Node Classification	COCO-SP	GatedGCN-tuned	macro F1	0.2922±0.0018	# 4	Compare
Node Classification	COCO-SP	GINE-tuned	macro F1	0.2125±0.0009	# 10	Compare
Node Classification	PascalVOC-SP	GPS-tuned	macro F1	0.4440±0.0065	# 1	Compare
Node Classification	PascalVOC-SP	GatedGCN-tuned	macro F1	0.3880±0.0040	# 3	Compare
Node Classification	PascalVOC-SP	GINE-tuned	macro F1	0.2718±0.0054	# 11	Compare
Node Classification	PascalVOC-SP	GCN-tuned	macro F1	0.2078±0.0031	# 13	Compare
Link Prediction	PCQM-Contact	GPS-tuned	MRR	0.3498±0.0005	# 4	Compare
Link Prediction	PCQM-Contact	GPS-tuned	MRR-ext-filtered	0.4703±0.0014	# 1	Compare
Link Prediction	PCQM-Contact	GatedGCN-tuned	MRR	0.3495±0.0010	# 5	Compare
Link Prediction	PCQM-Contact	GatedGCN-tuned	MRR-ext-filtered	0.4670±0.0004	# 2	Compare
Link Prediction	PCQM-Contact	GINE-tuned	MRR	0.3509±0.0006	# 3	Compare
Link Prediction	PCQM-Contact	GINE-tuned	MRR-ext-filtered	0.4617±0.0005	# 3	Compare
Link Prediction	PCQM-Contact	GCN-tuned	MRR	0.3424±0.0007	# 7	Compare
Link Prediction	PCQM-Contact	GCN-tuned	MRR-ext-filtered	0.4526±0.0006	# 4	Compare
Graph Classification	Peptides-func	GCN-tuned	AP	0.6860±0.0050	# 5	Compare
Graph Classification	Peptides-func	GPS-tuned	AP	0.6534±0.0091	# 16	Compare
Graph Classification	Peptides-func	GatedGCN-tuned	AP	0.6765±0.0047	# 8	Compare
Graph Classification	Peptides-func	GINE-tuned	AP	0.6621±0.0067	# 12	Compare
Graph Regression	Peptides-struct	GPS-tuned	MAE	0.2509±0.0014	# 13	Compare
Graph Regression	Peptides-struct	GatedGCN-tuned	MAE	0.2477±0.0009	# 8	Compare
Graph Regression	Peptides-struct	GINE-tuned	MAE	0.2473±0.0017	# 6	Compare
Graph Regression	Peptides-struct	GCN-tuned	MAE	0.2460±0.0007	# 3	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • GPS • Graph Transformer • Label Smoothing • LapEigen • Laplacian PE • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Where Did the Gap Go? Reassessing the Long-Range Graph Benchmark

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove