Feature/node2vec for issue Word2Vec in StellarGraph #255 #536

daokunzhang · 2019-11-29T05:47:59Z

This is the pull request for adding Keras Node2Vec layer to StellarGraph library.

I mainly made the following changes:

Add the Node2Vec layer.
Add the Node2VecNodeGenerator and Node2VecLinkGenerator mapper.
Change the interface for the __init__ and run function of BiasedRandomWalk to make UnSupervisedSampler support BiasedRandomWalk
Write the Keras-node2vec notebook examples for embedding learning and node classification
Add the unit test for the added node2vec layer and mappers

…mWalk

…BiasedRandomWalk interface

… changed BiasedRandomWalk interface

…pdated BiasedRandomWalk interface

…mbedding learning with keras node2vec implementation

…xample of node classification with keras node2vec implementation

…'cites'

…ut node features

…tting the edge type to 'cites'

review-notebook-app · 2019-11-29T05:48:06Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

codeclimate · 2019-11-29T05:48:44Z

stellargraph/data/explorer.py

+        self.edge_weight_label = edge_weight_label
+        self._check_weights(self.p, self.q, self.weighted, self.edge_weight_label)
+
+    def run(self, nodes=None, n=None, length=None, seed=None):


Refactor this function to reduce its Cognitive Complexity from 56 to the 15 allowed.

codeclimate · 2019-11-29T05:48:44Z

stellargraph/data/unsupervised_sampler.py

+                        sample_counter += 1
+
+                        # If the batch_size number of samples are accumulated, yield.
+                        if sample_counter == batch_size:


Avoid deeply nested control flow statements.

codeclimate · 2019-11-29T05:48:45Z

stellargraph/data/unsupervised_sampler.py

+
+                            yield edge_ids, edge_labels
+
+                        if self.bidirectional is False:


Avoid deeply nested control flow statements.

codeclimate · 2019-11-29T05:48:45Z

stellargraph/data/unsupervised_sampler.py

    """

-    def __init__(self, G, nodes=None, length=2, number_of_walks=1, seed=None):
+    def __init__(
+        self,


Method "init" has 9 parameters, which is greater than the 7 authorized.

codeclimate · 2019-11-29T05:48:47Z

Code Climate has analyzed commit 18274d2 and detected 2 issues on this pull request.

Here's the issue category breakdown:

Category	Count
Security	2

View more on Code Climate.

daokunzhang · 2020-04-26T01:57:26Z

Hi @kjun9 ，

Thanks for your careful review! I have made necessary changes as per your suggestions. You can start the next round review now.

Best regards!
Daokun

kjun9 · 2020-04-27T04:56:30Z

@daokunzhang

Though it is claimed that the same parameter is used to represent input and output embeddings, the implementation provided by authors directly deploys the gensim Word2Vec function, which is consistent with our gensim node2vec notebook. Here, I try to make a consistency with the gensim Word2Vec implementation, which uses different parameters to represent the so-called input and output embeddings.

Thanks for the explanation, that makes sense 👍 This looks good to me now, just one last thing to update that I can see is that in the demo notebooks, we're trying to keep the first markdown cell only contain the title (for example https://github.com/stellargraph/stellargraph/blob/develop/demos/node-classification/graphsage/graphsage-cora-node-classification-example.ipynb) since this lets us render the colab/binder buttons right underneath the title. So I'd suggest just moving the rest of the markdown in that first cell to a second markdown cell!

daokunzhang · 2020-04-28T02:54:10Z

Hi @kjun9 ,

The notebooks have been formatted as you requested.

Thanks and best regards!
Daokun

daokunzhang · 2020-05-07T02:59:10Z

@kjun9 ,

The code has been updated to use node ilocs in sampled node and link generaters #1267! Could you please help me review it again and merge it ?

Thanks,
Daokun

kjun9 · 2020-05-07T03:32:01Z

@daokunzhang Hi daokun, sorry for the delay I'll take a look now!

kjun9

Thanks @daokunzhang looks great 👍

Before merging, could you revert the mode change on scripts/notebook_text_checker.py using chmod 664 like you did for the previous ones? I'm guessing you might have something in your setup that keeps automatically changing this 😅

Also, there's been a number of updates to how we document each notebook (they're all now included in our API docs https://stellargraph.readthedocs.io/en/latest/demos/index.html ) which means there's a couple of additional steps to add these new notebooks to the API documentation, but I think we can do this separately after landing this first - could you file an issue with a title like "Add keras node2vec demos to API documentation"? And feel free to paste some of the details below in the issue:

Create new files that allow us to include these notebooks in the API docs:

docs/demos/embeddings/keras-node2vec-embeddings.nblink
docs/demos/node-classification/keras-node2vec-node-classification.nblink

Both of these should contain something like (you can look at another one as reference):

{
  "path": "../../../demos/embeddings/keras-node2vec-embeddings.ipynb"
}

and there should be symlinks for the images being used by the demo notebooks, similar to https://github.com/stellargraph/stellargraph/blob/develop/docs/demos/node-classification/Cora-features.png

Update the tables in demo README and index.rst files to include an entry for each of these notebooks - I think this is done automatically (or semi-automatically?) using scripts/demo_indexing.py, I can double check exactly what should be done here.

daokunzhang · 2020-05-07T07:04:38Z

Thanks @kjun9 ,

I have revised the scripts/notebook_text_checker.py file mode change problem. I have created the nblink files for the keras-node2vec-node-classification and keras-node2vec-embeddings demo files in the docs folder and added the word2vec_illustration image file to the docs folder. After that I added the component about keras node2vec to the scripts/demo_indexing.py file and rerun it to overwrite the ReadME files. You can check it.

For my part, I cannot merge this pull request. Maybe it is caused by that Yuriy ever requested changes but he haven't approved yet. Could you please help me merge it on your part if you are convenient?

Best regards!
Daokun

kjun9 · 2020-05-07T07:16:09Z

Thanks @daokunzhang I can see the notebook in the API documentation 👍 I'll merge this now.

I think the demo-indexing could still be tweaked a bit, so I might still file an issue about that - I think this is the first time where we have two versions of an algorithm presented as demos (gensim vs stellargraph components) so I think it'd be nice for the table to communicate this more clearly, rather than showing them as two separate algorithms.

daokunzhang · 2020-05-07T07:18:14Z

Thanks @kjun9 !

kjun9 · 2020-05-07T07:23:03Z

#1534

daokunzhang added 23 commits November 29, 2019 16:04

update upsupervised_sampler.py to make it compatible with BiasedRando…

4dd9a65

…mWalk

Add the unit test for Node2VecNodeGenerator

1e2cd91

add the unit test for node2vec layer

8f9aef1

add the unit test for Node2VecLinkGenerator

36106c9

change the unit test for BiasedRandomWalker

f490205

update the the stellargraph-node2vec.ipynb notebook with the updated …

8e982e0

…BiasedRandomWalk interface

update the stellargraph-node2vec-weighted-random-walks.ipynb with the…

79187cf

… changed BiasedRandomWalk interface

update the stellargraph-node2vec-node-classification.ipynb with the u…

da134ec

…pdated BiasedRandomWalk interface

add the stellargraph-keras-node2vec.ipynb notebook as an example of e…

5f107d6

…mbedding learning with keras node2vec implementation

add the stellargraph-keras-node2vec-node-classification.ipynb as an e…

9235243

…xample of node classification with keras node2vec implementation

rerun the stellargraph-attri2vec-DBLP.ipynb with expected results

3505ad3

update stellargraph-attri2vec-citeseer.ipynb by setting edge type to …

d5458de

…'cites'

reformat unsupervised_sampler.py with black

69b7360

reformat sequences.py with black

81740b3

reformat sampled_node_generators.py with black

e15f523

add Node2VecLinkGenerator

56a20f0

update the demos/node-classification/node2vec/README.md file

2e43bbd

update demos/embeddings/README.md file

4189874

add node2vec layer

a452b58

make StellarGraph object support node index access for networks witho…

446f64c

…ut node features

update the interface of BiasedRandomWalk

a3c1ddc

update the attri2vec-citeseer-node-classification-example.ipynb by se…

fa805d0

…tting the edge type to 'cites'

update the layer/__init__.py file by importing node2vec layer

44b54f3

daokunzhang requested review from adocherty and youph November 29, 2019 05:47

codeclimate bot reviewed Nov 29, 2019

View reviewed changes

daokunzhang added 2 commits November 29, 2019 18:22

update unsupervised_sampler

73c510d

rerun stellargraph-node2vec notebook

cc6e284

daokunzhang added 2 commits April 26, 2020 08:02

Merge branch 'develop' into feature/node2vec

4633053

format notebooks

5c2d9c3

daokunzhang added 3 commits April 28, 2020 08:57

Merge branch 'develop' into feature/node2vec

20b9571

reformat keras-node2vec notebooks

cf51029

reformat keras-node2vec notebooks

01f85eb

huonw mentioned this pull request May 1, 2020

Use node ilocs in sampled node and link generators #1267

Merged

daokunzhang added 6 commits May 6, 2020 09:55

use node ilocs for node2vec link and node generator

d55346e

merge to develop branch

d6ef1d4

use node ilocs in sampled node2vec node and link generators

aaedf5b

merge to the develop branch

20f817a

rename keras node2vec embedding demo

ce7da23

reformat keras node2vec embedding demo

9608e9f

merge to develop branch

ebc02d3

kjun9 approved these changes May 7, 2020

View reviewed changes

daokunzhang added 4 commits May 7, 2020 13:44

Merge branch 'develop' into feature/node2vec

ed1c19f

change mode back

3f8570b

add keras node2vec demos to API documentation

f174ead

add word2vec illustration image to keras node2vec API documentation

18274d2

kjun9 mentioned this pull request May 7, 2020

Improve demo indexing for keras Node2Vec notebooks #1534

Closed

kjun9 merged commit 318cd9c into develop May 7, 2020

kjun9 deleted the feature/node2vec branch May 7, 2020 07:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/node2vec for issue Word2Vec in StellarGraph #255 #536

Feature/node2vec for issue Word2Vec in StellarGraph #255 #536

daokunzhang commented Nov 29, 2019

review-notebook-app bot commented Nov 29, 2019

codeclimate bot Nov 29, 2019

codeclimate bot Nov 29, 2019

codeclimate bot Nov 29, 2019

codeclimate bot Nov 29, 2019

codeclimate bot commented Nov 29, 2019 •

edited

Loading

daokunzhang commented Apr 26, 2020

kjun9 commented Apr 27, 2020

daokunzhang commented Apr 28, 2020

daokunzhang commented May 7, 2020

kjun9 commented May 7, 2020

kjun9 left a comment

daokunzhang commented May 7, 2020 •

edited

Loading

kjun9 commented May 7, 2020

daokunzhang commented May 7, 2020

kjun9 commented May 7, 2020

Feature/node2vec for issue Word2Vec in StellarGraph #255 #536

Feature/node2vec for issue Word2Vec in StellarGraph #255 #536

Conversation

daokunzhang commented Nov 29, 2019

review-notebook-app bot commented Nov 29, 2019

codeclimate bot Nov 29, 2019

Choose a reason for hiding this comment

codeclimate bot Nov 29, 2019

Choose a reason for hiding this comment

codeclimate bot Nov 29, 2019

Choose a reason for hiding this comment

codeclimate bot Nov 29, 2019

Choose a reason for hiding this comment

codeclimate bot commented Nov 29, 2019 • edited Loading

daokunzhang commented Apr 26, 2020

kjun9 commented Apr 27, 2020

daokunzhang commented Apr 28, 2020

daokunzhang commented May 7, 2020

kjun9 commented May 7, 2020

kjun9 left a comment

Choose a reason for hiding this comment

daokunzhang commented May 7, 2020 • edited Loading

kjun9 commented May 7, 2020

daokunzhang commented May 7, 2020

kjun9 commented May 7, 2020

codeclimate bot commented Nov 29, 2019 •

edited

Loading

daokunzhang commented May 7, 2020 •

edited

Loading