Add PageRank #788

IvanIsCoding · 2023-01-23T02:23:46Z

Related to #315

Adds an implementation of the PageRank algorithm using sparse matrices. It uses the sprs crate combined with ndarray to implement a Power Method approach of finding the PageRank.

Also, we test this implementation against NetworkX's implementation of the PageRank. We accept all the arguments that NetworkX accepts: tolerance, max_iter, personalization, dangling, etc.

n.b: it's ready for review

coveralls · 2023-01-23T05:54:43Z

Pull Request Test Coverage Report for Build 5086986067

129 of 129 (100.0%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.06%) to 96.603%

Totals
Change from base Build 5063931930:	0.06%
Covered Lines:	14704
Relevant Lines:	15221

💛 - Coveralls

IvanIsCoding · 2023-01-23T20:28:13Z

Everything should be working now

mtreinish

This is great, thanks for writing this! I really like using sprs for this, it also avoids us having to figure out how to link against and leverage blas/lapack to compute the eigenvectors of the google matrix from an ndarray. I need to refresh my memory on the algorithm before I do a detailed review on the algorithm code. I just has some quick high level comments from a quick scan of the code.

releasenotes/notes/add-pagerank-bef0de7d46026071.yaml

src/link_analysis.rs

.github/workflows/wheels.yml

src/link_analysis.rs

IvanIsCoding · 2023-02-01T16:23:10Z

This is great, thanks for writing this! I really like using sprs for this, it also avoids us having to figure out how to link against and leverage blas/lapack to compute the eigenvectors of the google matrix from an ndarray. I need to refresh my memory on the algorithm before I do a detailed review on the algorithm code. I just has some quick high level comments from a quick scan of the code.

The algorithm is approximating the eigencector of the transition matrix. You might want to check NetworkX’s Python code directly because they have some quirks on how they handle dangling nodes etc.

Co-authored-by: Matthew Treinish <mtreinish@kortar.org>

mtreinish

Sorry for the delay in review, this looks excellent to me. The code LGTM and nothing real stands out to me as being incorrect. Just a couple small inline suggestions and questions but other than I think this is ready to merge.

Cargo.lock

mtreinish · 2023-05-25T19:22:38Z

src/link_analysis.rs

+///
+/// :returns: a read-only dict-like object whose keys are the node indices and values are the
+///      PageRank score for that node.
+/// :rtype: CentralityMapping


Do you want to document that it will raise FailedToConverge if max_iter is reached? It's something people might want to catch.

I think I will add the notes for this, HITS, eigenvector centrality and all the centralities that cannot converge in a separate PR

releasenotes/notes/add-pagerank-bef0de7d46026071.yaml

src/link_analysis.rs

mtreinish

LGTM, thanks for the quick update

bionicles · 2024-01-18T00:10:12Z

gonna put a pin in it and double check everything and maybe make a new issue about it, but somehow i'm getting totally uniform results from this where networkx is (a lot) slower but produces some results that make more sense

IvanIsCoding · 2024-01-18T01:22:02Z

gonna put a pin in it and double check everything and maybe make a new issue about it, but somehow i'm getting totally uniform results from this where networkx is (a lot) slower but produces some results that make more sense

Please open a new issue. If you could give the graph that triggers the difference that would be great as well.

We use the Power Method and NetworkX uses SVD decomposition to approximate the eigenvector so there might be some discrepancies.

IvanIsCoding added 7 commits January 21, 2023 23:36

Add the sketch of PageRank

d32d265

More progress towards pagerank

a4de4e7

Use CentralityMapping and FailedToConverge

e099998

Finalize PageRank

92bb4f6

First test does not run

2041cfc

Remove unwanted triplet

fa35e90

Fix clippy warning

1ce674a

IvanIsCoding added 3 commits January 23, 2023 10:28

Handle personalization correctly

8f569f5

Add more tests

03df46f

Cargo fmt

eb0dc24

IvanIsCoding added 6 commits January 23, 2023 13:50

Add scipy to test requirements

428715f

Skip SciPy tests in case architecture does not have it

bdf45af

Ignore flake8 errors that do not help

b270e9a

Flake8

9cf13b5

Handle dangling weights

d025677

Add more tests

47df879

IvanIsCoding marked this pull request as ready for review January 24, 2023 02:59

IvanIsCoding added 3 commits January 23, 2023 19:28

Add nstart argument

5f98164

Cargo Clippy

36bda8d

Documentation

4e139f8

IvanIsCoding requested a review from mtreinish January 24, 2023 06:00

Fix typo in URL

3e54cb5

IvanIsCoding mentioned this pull request Jan 25, 2023

Add HITS algorithm #790

Merged

IvanIsCoding added 2 commits January 27, 2023 15:23

Merge branch 'main' into pagerank

0480034

Merge branch 'main' into pagerank

45a63a1

mtreinish reviewed Feb 1, 2023

View reviewed changes

releasenotes/notes/add-pagerank-bef0de7d46026071.yaml Outdated Show resolved Hide resolved

src/link_analysis.rs Show resolved Hide resolved

.github/workflows/wheels.yml Show resolved Hide resolved

src/link_analysis.rs Outdated Show resolved Hide resolved

Update releasenotes/notes/add-pagerank-bef0de7d46026071.yaml

7f953d8

Co-authored-by: Matthew Treinish <mtreinish@kortar.org>

IvanIsCoding added 3 commits February 1, 2023 11:25

Merge remote-tracking branch 'upstream/main' into pagerank

641c8e5

Tweak pyfunction signature

a125b86

Add scipy to aarch64 test requirements

d4a25ca

mtreinish self-assigned this Feb 4, 2023

IvanIsCoding and others added 3 commits March 20, 2023 20:27

Merge remote-tracking branch 'origin/main' into pagerank

3b6c6e7

Merge branch 'main' into pagerank

bc9a720

Merge branch 'main' into pagerank

15323c0

mtreinish added this to the 0.13.0 milestone May 10, 2023

Merge branch 'main' into pagerank

f7e2f41

mtreinish reviewed May 25, 2023

View reviewed changes

IvanIsCoding added 3 commits May 25, 2023 23:11

Merge remote-tracking branch 'upstream/main' into pagerank

56017e8

Address comments from code review

e0369e0

Clippy is always right

efd954a

mtreinish approved these changes May 26, 2023

View reviewed changes

mtreinish merged commit afc3627 into Qiskit:main May 26, 2023

IvanIsCoding deleted the pagerank branch May 28, 2023 22:46

IvanIsCoding mentioned this pull request Feb 28, 2024

Include most library functionality in rustworkx-core #1121

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PageRank #788

Add PageRank #788

IvanIsCoding commented Jan 23, 2023 •

edited

Loading

coveralls commented Jan 23, 2023 •

edited

Loading

IvanIsCoding commented Jan 23, 2023 •

edited

Loading

mtreinish left a comment

IvanIsCoding commented Feb 1, 2023

mtreinish left a comment

mtreinish May 25, 2023

IvanIsCoding May 26, 2023

mtreinish left a comment

bionicles commented Jan 18, 2024 •

edited

Loading

IvanIsCoding commented Jan 18, 2024 •

edited

Loading

Add PageRank #788

Add PageRank #788

Conversation

IvanIsCoding commented Jan 23, 2023 • edited Loading

coveralls commented Jan 23, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5086986067

💛 - Coveralls

IvanIsCoding commented Jan 23, 2023 • edited Loading

mtreinish left a comment

Choose a reason for hiding this comment

IvanIsCoding commented Feb 1, 2023

mtreinish left a comment

Choose a reason for hiding this comment

mtreinish May 25, 2023

Choose a reason for hiding this comment

IvanIsCoding May 26, 2023

Choose a reason for hiding this comment

mtreinish left a comment

Choose a reason for hiding this comment

bionicles commented Jan 18, 2024 • edited Loading

IvanIsCoding commented Jan 18, 2024 • edited Loading

IvanIsCoding commented Jan 23, 2023 •

edited

Loading

coveralls commented Jan 23, 2023 •

edited

Loading

IvanIsCoding commented Jan 23, 2023 •

edited

Loading

bionicles commented Jan 18, 2024 •

edited

Loading

IvanIsCoding commented Jan 18, 2024 •

edited

Loading