Simplifying Subgraph Representation Learning for Scalable Link Prediction

29 Jan 2023  ·  Paul Louis, Shweta Ann Jacob, Amirali Salehi-Abari ·

Link prediction on graphs is a fundamental problem. Subgraph representation learning approaches (SGRLs), by transforming link prediction to graph classification on the subgraphs around the links, have achieved state-of-the-art performance in link prediction. However, SGRLs are computationally expensive, and not scalable to large-scale graphs due to expensive subgraph-level operations. To unlock the scalability of SGRLs, we propose a new class of SGRLs, that we call Scalable Simplified SGRL (S3GRL). Aimed at faster training and inference, S3GRL simplifies the message passing and aggregation operations in each link's subgraph. S3GRL, as a scalability framework, accommodates various subgraph sampling strategies and diffusion operators to emulate computationally-expensive SGRLs. We propose multiple instances of S3GRL and empirically study them on small to large-scale graphs. Our extensive experiments demonstrate that the proposed S3GRL models scale up SGRLs without significant performance compromise (even with considerable gains in some cases), while offering substantially lower computational footprints (e.g., multi-fold inference and training speedup).

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Link Property Prediction ogbl-citation2 S3GRL (PoS Plus) Test MRR 0.8814 ± 0.0008 # 6
Validation MRR 0.8809 ± 0.0074 # 6
Number of params 142275001 # 6
Ext. data No # 1
Link Property Prediction ogbl-collab S3GRL (PoS Plus) Test Hits@50 0.6683 ± 0.0030 # 9
Validation Hits@50 0.9861 ± 0.0006 # 6
Number of params 5913025 # 11
Ext. data No # 1
Link Property Prediction ogbl-ppa S3GRL (PoS Plus) Test Hits@100 0.4242 ± 0.0180 # 13
Validation Hits@100 0.6512 ± 0.0109 # 4
Number of params 32270001 # 6
Ext. data No # 1

Methods


No methods listed for this paper. Add relevant methods here