Enable edge based temporal sampling in `torch_geometric.distributed` #8428

kgajdamo · 2023-11-23T12:43:04Z

This PR enables edge based temporal distributed training for node and link sampling.

Comment about the edge temporal data definition:
In the case of distributed training, it is necessary to create a separate vector for each partition that will store the time information of the edges included in the partition. (I mention this just to point out that this works differently than with node-based temporal sampling, where we can have one vector common to each partition because we operate on node ids.)
Why:
Each partition has its own unique edge_index in COO format, which is later converted to a matrix in CSR/CSC format in the neighbor sampler. Therefore, we do not have information about the global edge IDs when sampling and we would not be able to find the correct time information for a specific edge. Therefore, this information must be local.

Changes made:

added support for edge_time argument
seed_time needs to be specified (requirement for edge level temporal sampling)
added unit tests
unit tests for link sampler are in this PR #8375

codecov · 2023-11-23T12:59:04Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (1ba2743) 87.69% compared to head (4d6c30f) 88.37%.

❗ Current head 4d6c30f differs from pull request most recent head e449483. Consider uploading reports for the commit e449483 to get more accurate results

Files	Patch %	Lines
...rch_geometric/distributed/dist_neighbor_sampler.py	93.75%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #8428      +/-   ##
==========================================
+ Coverage   87.69%   88.37%   +0.68%     
==========================================
  Files         478      478              
  Lines       29400    29403       +3     
==========================================
+ Hits        25781    25985     +204     
+ Misses       3619     3418     -201

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

torch_geometric/distributed/dist_neighbor_sampler.py

kgajdamo added sampler distributed labels Nov 23, 2023

kgajdamo requested review from wsad1, mananshah99, a team and rusty1s as code owners November 23, 2023 12:43

kgajdamo requested review from ZhengHongming888 and JakubPietrakIntel November 23, 2023 12:47

kgajdamo force-pushed the dist-edge-temporal branch from 89f3052 to ce8b7b8 Compare November 23, 2023 12:51

kgajdamo force-pushed the dist-edge-temporal branch from 0ac4a14 to 5ecda0b Compare November 28, 2023 13:29

kgajdamo added 2 commits November 28, 2023 14:35

enable distributed edge based temporal sampling

0f7889a

update CHANGELOG.md

5ecda0b

rusty1s changed the title ~~Enable edge based temporal distributed training for homo~~ Enable edge based temporal sampling in torch_geometric.distributed Nov 29, 2023

rusty1s assigned kgajdamo Nov 29, 2023

rusty1s added feature 0 - Priority P0 labels Nov 29, 2023

rusty1s added 2 commits November 29, 2023 15:47

update

dbe95fa

update

4d6c30f

rusty1s approved these changes Nov 29, 2023

View reviewed changes

Merge branch 'master' into dist-edge-temporal

e449483

rusty1s enabled auto-merge (squash) November 29, 2023 15:52

rusty1s merged commit 12a2fb2 into pyg-team:master Nov 29, 2023

kgajdamo commented Nov 29, 2023

View reviewed changes

torch_geometric/distributed/dist_neighbor_sampler.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable edge based temporal sampling in `torch_geometric.distributed` #8428

Enable edge based temporal sampling in `torch_geometric.distributed` #8428

kgajdamo commented Nov 23, 2023 •

edited

Loading

codecov bot commented Nov 23, 2023 •

edited

Loading

Enable edge based temporal sampling in torch_geometric.distributed #8428

Enable edge based temporal sampling in torch_geometric.distributed #8428

Conversation

kgajdamo commented Nov 23, 2023 • edited Loading

codecov bot commented Nov 23, 2023 • edited Loading

Codecov Report

Enable edge based temporal sampling in `torch_geometric.distributed` #8428

Enable edge based temporal sampling in `torch_geometric.distributed` #8428

kgajdamo commented Nov 23, 2023 •

edited

Loading

codecov bot commented Nov 23, 2023 •

edited

Loading