Large-scale graph representation learning with very deep GNNs and self-supervision


Effectively and efficiently deploying graph neural networks (GNNs) at scale remains one of the most challenging aspects of graph representation learning. Many powerful solutions have only ever been validated on comparatively small datasets, often with counter-intuitive outcomes---a barrier which has recently been broken by the Open Graph Benchmark Large-Scale Challenge (OGB-LSC). We entered the OGB-LSC with two large-scale GNNs: a deep transductive node classifier powered by bootstrapping, and a very deep (up to 50-layer) inductive graph regressor regularised by denoising objectives. Our models achieved an award-level (top-3) performance on both the MAG240M-LSC and PCQM4M-LSC benchmarks. In doing so, we demonstrate strong evidence of scalable self-supervised graph representation learning, and utility of very deep GNNs---both very important open issues. Our code is publicly available at: