Graph (multi-dimention edge, node) Classification Task on HIV dataset

Usage

# 
python train.py --gnn gcn 
(python train.py --dataset $DATASET --gnn $GNN_TYPE --filename $FILENAME)

python evaluate.py --filename ./models/gcn_2.pt
(python evaluate.py --dataset ogbg-molhiv --filename ./models/gcn_2.pt)

`$DATASET`

$DATASET specified the name of the molecule dataset. It should be one of the followings:

ogbg-molhiv

`$GNN_TYPE`

$GNN_TYPE specified the GNN architecture. It should be one of the followings:

gin: GIN [2]
~~gin-virtual: GIN over graphs augmented with virtual nodes* [4] (doesn't work)~~
gcn: GCN [3]
~~gcn-virtual: GCN over graphs augmented with virtual nodes* [4] (doesn't work)~~
gcn-pyg
gat[5]
gatv2 [6]
transformerconv [7]
tag-pyg [8]

* Additional nodes that are connected to all the nodes in the original graphs.

`$FILENAME`: Specifying output file.

$FILENAME specifies the filename to save the result. The result is a dictionary containing (1) best training performance ('BestTrain'), (2) best validation performance ('Val'), (3) test performance at the best validation epoch ('Test'), and (4) training performance at the best validation epoch ('Train').

Dataset

ogbg-molhiv

(in moduleNet[1])

The HIV dataset was introduced by the Drug
Therapeutics Program (DTP) AIDS Antiviral Screen, which
tested the ability to inhibit HIV replication for over 40 000
compounds.47 Screening results were evaluated and placed into
three categories: conrmed inactive (CI), conrmed active (CA)
and conrmed moderately active (CM). We further combine the
latter two labels, making it a classication task between inactive
(CI) and active (CA and CM). As we are more interested in
discover new categories of HIV inhibitors, scaffold splitting
(introduce

model types

'-pyg' It handles one-dimension edge features. e.g., GCNConv(in torch geometric) handles only one dimension edge (as a edge weights). To solve the multi-dimension problem, we have to use multi-parallel-model,

w/o 'pyg' It handles multi-dimension edge features.

Experiment results

model	rocauc
gat	75.87
gatv2	74.49
gcn	74.09
gcn-pyg	74.45
gin	71.08
TAG-pyg	72.71
transformerconv	72.42

References

[1] Wu, Z., Ramsundar, B., Feinberg, E. N., Gomes, J., Geniesse, C., Pappu, A. S., ... & Pande, V. (2018). MoleculeNet: a benchmark for molecular machine learning. Chemical science, 9(2), 513-530.

[2] Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2019). How powerful are graph neural networks?. ICLR 2019

[3] Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. ICLR 2017

[4] Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O., & Dahl, G. E. Neural message passing for quantum chemistry. ICML 2017.

[5] Veličković, Petar, et al. "Graph Attention Networks." International Conference on Learning Representations. 2018.

[6] Brody, Shaked, Uri Alon, and Eran Yahav. "How attentive are graph attention networks?." arXiv preprint arXiv:2105.14491 (2021).

[7] Shi, Yunsheng, et al. "Masked label prediction: Unified message passing model for semi-supervised classification." arXiv preprint arXiv:2009.03509 (2020).

[8] Du, Jian, et al. "Topology adaptive graph convolutional networks." arXiv preprint arXiv:1710.10370 (2017).

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
assets		assets
models		models
result		result
.gitignore		.gitignore
README.md		README.md
README.pdf		README.pdf
conv.py		conv.py
evaluate.py		evaluate.py
gnn.py		gnn.py
model.py		model.py
predict.py		predict.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph (multi-dimention edge, node) Classification Task on HIV dataset

Table of Contents

Usage

`$DATASET`

`$GNN_TYPE`

`$FILENAME`: Specifying output file.

Dataset

model types

Experiment results

References

About

Releases

Packages

Languages

epsilon-deltta/graph-prediction

Folders and files

Latest commit

History

Repository files navigation

Graph (multi-dimention edge, node) Classification Task on HIV dataset

Table of Contents

Usage

$DATASET

$GNN_TYPE

$FILENAME: Specifying output file.

Dataset

model types

Experiment results

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`$DATASET`

`$GNN_TYPE`

`$FILENAME`: Specifying output file.

Packages