Merge Cluster-GCN and GCN graph classification into GCN layer #1205

huonw · 2020-04-06T02:39:34Z

ClusterGraphConvolution and GraphConvolution were practically identical, with just some minor differences in the details of the code, and GraphClassificationConvolution only differed by:

not using any output indices, which is moved out of GraphConvolution in Gather output nodes in full batch models, not the layers #1204
supporting a batch of multiple graphs (i.e. multiple sets of node features and adjacency matrices); the core of this patch is thus expanding GraphConvolution to handle a batch size > 1 for dense matrices, not sparse ones (yet, since it's harder: Keras's .batch_dot doesn't seem to support sparse matrices).

That is, by building on #1204, this patch can merge all three of our implementations of the GCN layer.

This deprecates ClusterGraphConvolution because it has been released for a while, but removes GraphClassificationConvolution entirely, because it has only been released @experimental-y.

See: #1201

review-notebook-app · 2020-04-06T02:39:40Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

The core layer in a full batch method (like `GraphConvolution` inside `GCN`) natively computes information about a whole graph, but this doesn't directly work with machine learning (and, in particular, how it works with `Keras`). In particular, there's usually only a subset of labelled nodes, so training needs to only compute candidate embeddings for those nodes to compare against the ground-truth target array. We currently handle this in full batch methods by having each layer know whether it is the `final_layer` in the model, and do `tf.gather` using a tensor of indices of relevant nodes. This means that the individual full-batch layers have to: - always accept the tensor of indices, even if they won't use it (and, potentially, even if they are being used as part of a model that doesn't ever use it) - do the appropriate `tf.gather` invocation to select the relevant nodes This patch adjusts this approach to make the output filtering a model level concern: layers always compute all the information, and models that need to can do a `tf.gather` call (via the new `layer.misc.GatherIndices`) to select out the relevant output elements. This has a few benefits: - it seems conceptually more "correct" to me, because it's the training/using of a overall model that needs to filter out elements, _not_ the operation of an individual layer - it reduces the amount of code required noticeably, and will reduce if further when we remove the left-over-`final_layer` detection (that is designed to help us migrate). - it makes #1201 easier (in #1205): graph classification doesn't have any output indices to feed into the graph convolution layers, and so this patch makes `GraphConvolution` closer to the behaviour required for that - it may also ease implementing models that incorporate information from multiple convolution layers, because it's easy to make sure all the convolution layers compute embeddings for all nodes See: #1201

codeclimate · 2020-04-07T01:57:03Z

Code Climate has analyzed commit bf54b3a and detected 0 issues on this pull request.

View more on Code Climate.

kjun9

Looks good to me 👍 Just some minor comments

stellargraph/layer/gcn.py

stellargraph/layer/cluster_gcn.py

kjun9

👍

stellargraph/layer/gcn.py

…licate-gcn

This was referenced Apr 6, 2020

Gather output nodes in full batch models, not the layers #1204

Merged

Remove batch_dim == 1 restriction from full batch methods #1214

Open

huonw changed the base branch from feature/model-level-output-indices to develop April 7, 2020 01:55

Merge Cluster-GCN and GCN graph classification into GCN layer

62d18e2

huonw force-pushed the feature/1201-deduplicate-gcn branch from dd91833 to 62d18e2 Compare April 7, 2020 01:56

huonw marked this pull request as ready for review April 7, 2020 01:56

huonw requested review from PantelisElinas and kjun9 April 7, 2020 02:38

kjun9 reviewed Apr 8, 2020

View reviewed changes

stellargraph/layer/gcn.py Outdated Show resolved Hide resolved

stellargraph/layer/gcn.py Outdated Show resolved Hide resolved

huonw commented Apr 8, 2020

View reviewed changes

stellargraph/layer/gcn.py Outdated Show resolved Hide resolved

stellargraph/layer/gcn.py Outdated Show resolved Hide resolved

stellargraph/layer/cluster_gcn.py Outdated Show resolved Hide resolved

Revert some unnecessary/incorrect changes

9eaf421

kjun9 approved these changes Apr 8, 2020

View reviewed changes

stellargraph/layer/gcn.py Outdated Show resolved Hide resolved

huonw mentioned this pull request Apr 8, 2020

Support batch size > 1 for sparse tensors in GCN #1222

Open

huonw added 4 commits April 8, 2020 15:32

add fixme

3cc11ae

Merge remote-tracking branch 'origin/develop' into feature/1201-dedup…

e103d46

…licate-gcn

Merge remote-tracking branch 'origin/develop' into feature/1201-dedup…

45117c9

…licate-gcn

Merge remote-tracking branch 'origin/develop' into feature/1201-dedup…

132ca49

…licate-gcn

PantelisElinas approved these changes Apr 17, 2020

View reviewed changes

huonw added 2 commits April 17, 2020 15:43

Merge remote-tracking branch 'origin/develop' into feature/1201-dedup…

7276db7

…licate-gcn

Rerun

bf54b3a

huonw merged commit 56e6adb into develop Apr 20, 2020

huonw deleted the feature/1201-deduplicate-gcn branch April 20, 2020 01:15

huonw mentioned this pull request Apr 20, 2020

Use multiple graphs in mini batch generator #1275

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge Cluster-GCN and GCN graph classification into GCN layer #1205

Merge Cluster-GCN and GCN graph classification into GCN layer #1205

huonw commented Apr 6, 2020 •

edited

Loading

review-notebook-app bot commented Apr 6, 2020

codeclimate bot commented Apr 7, 2020 •

edited

Loading

kjun9 left a comment

kjun9 left a comment

Merge Cluster-GCN and GCN graph classification into GCN layer #1205

Merge Cluster-GCN and GCN graph classification into GCN layer #1205

Conversation

huonw commented Apr 6, 2020 • edited Loading

review-notebook-app bot commented Apr 6, 2020

codeclimate bot commented Apr 7, 2020 • edited Loading

kjun9 left a comment

Choose a reason for hiding this comment

kjun9 left a comment

Choose a reason for hiding this comment

huonw commented Apr 6, 2020 •

edited

Loading

codeclimate bot commented Apr 7, 2020 •

edited

Loading