GitHub - snap-stanford/ConE

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones

This is the official codebase for the paper Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones.

Overview

We present ConE (Cone Embedding for knowledge graphs), the first knowledge graph (KG) embedding method that can capture the transitive closure properties of heterogeneous hierarchical relations as well as other non-hierarchical properties. The figure below shows an illustration of our model.

This is the PyTorch implementation of our proposed ConE model based on the code framework provided by RotatE' repo for knowledge graph embedding.

Implemented features

Datasets

WN18RR
DDB14
GO21
FB15k-237

Notice: We propose GO21, a hierarchical biological knowledge graph containing genes, proteins, drugs and diseases as entities. Please cite our paper if you use this dataset.

Models:

Notice: RotC is the degenerate version of ConE that uses empty relation-specific subspace. RotC is utilized to initialize the embedding for ConE, to stabilize the training procedure.

Tasks and Evaluation Metrics:

Knowledge graph completion:
- MRR, MR, HITS@1, HITS@3, HITS@10 (filtered)
Ancestor-descendant prediction:
- mAP, AUROC
LCA prediction:
- HITS@1, HITS@3, HITS@10

Loss Function:

Uniform Negative Sampling
Self-Adversarial Negative Sampling
Cone angle loss (our)

Usage

Knowledge Graph Data:

entities.dict: a dictionary mapping entities to unique ids
relations.dict: a dictionary mapping relations to unique ids
train.txt: the KGE model is trained to fit this data set
valid.txt: create a blank file if no validation data is available
test.txt: the KGE model is evaluated on this data set
relation_category.txt: a dictionary mapping relations to their type (1-1 indicates non-hierarchical, 1-M indicates hyponym, M-1 indicates hypernym), required for ConE model
class_test_X.txt: Test data for ancestor-descendant prediction task, X=easy: 0% inferred descendant pairs, X=medium: 50% inferred descendant pairs, X=hard: 100% inferred descendant pairs
lca_test_X.txt: LCA prediction under X-hop is evaluated on this data set

Training and Evaluation of ConE

Train

The run.sh script provides an easy way to search hyper-parameters (such as batch size, learning rate, etc) during training. For example, this command train a ConE model on WN18RR dataset with GPU 0.

bash run.sh train ConE wn18rr 0 1 1024 50 500 10 0.5 0.001 40000 4 -de \
    --tail_batch_only --do_valid --valid_steps 20000 --save_checkpoint 40000 \
    --train_with_relation_category --uni_weight --lr_decay_epoch "30000" \
    --do_test_relation_category --cone_penalty --fix_att 100 \
    --w 0.5 --pretrained "./models/RotC_wn18rr_1/checkpoint/ckpt_39999"

Check argparse configuration at codes/run.py for more arguments and more details.

Moreover, we provide example training scripts for ConE in folder examples/ to reproduce our results.

Test

Trained model is automatically saved in folder models/, evaluations can be conducted using the following commands.

bash run.sh category ConE wn18rr 0 1 1024 50 500 0.1 0.5 0.001 20000 4 -de \
    -init models/ConE_wn18rr_1 -ckpt "ckpt_39999"  --fix_att 100 \
    --do_test_relation_category --do_classification --do_lca 1 \
    --pretrained "./models/RotC_wn18rr_1/checkpoint/ckpt_39999"

The model will be evaluated on three tasks: KG completion task, ancestor-descendant prediction task and LCA prediction task.

Extend to more datasets

For hierarchical KG datasets, the training of ConE requires a pretrained RotC model to initialize the embedding. We refer to examples/WN18RR.sh for an example.

For non-hierarchical KG datasets, RotC model is comparable to many strong baseline models. We refer to examples/FB15k237.sh for an example.

Citation

Please cite our paper if you use our method or dataset in your work (Bibtex below).

@inproceedings{bai2021cone,
   title={Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones},
   author={Bai, Yushi and Ying, Rex and Ren, Hongyu and Leskovec, Jure},
   booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
   year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
codes		codes
data		data
examples		examples
figure		figure
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones

Overview

Implemented features

Datasets

Models:

Tasks and Evaluation Metrics:

Loss Function:

Usage

Training and Evaluation of ConE

Train

Test

Extend to more datasets

Citation

About

Releases

Packages

Languages

License

snap-stanford/ConE

Folders and files

Latest commit

History

Repository files navigation

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones

Overview

Implemented features

Datasets

Models:

Tasks and Evaluation Metrics:

Loss Function:

Usage

Training and Evaluation of ConE

Train

Test

Extend to more datasets

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages