Skip to content

Connectome-Implementation-Team/author_name_disambiguation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Author Name Disambiguation with Document Representation Learning and Knowledge Graph Embeddings

Prerequisites:

Hardware

Tested on the following configuration:

  • Ubuntu 18.04 and CUDA 10.1
  • 64G RAM
  • 500G SSD
  • 4x GeForce GTX TITAN X

How to run

  • Place RESCS files in /input
# set environment in /disambiguation
cd $project_path/disambiguation
export PYTHONPATH="$project_path/disambiguation:$PYTHONPATH"

# convert files from /input to required format
python rescs_to_disambig.py

# extract entities from texts in BLINK, then store their embeddings using
python integrate_entities.py

# preprocess data: required for training and applying the model
python disambiguation/scripts/preprocessing.py

# train global model: save and reuse
python disambiguation/global_/gen_train_data.py
python disambiguation/global_/global_model.py

# local model: document network as input to GAE representation learning
python disambiguation/global_/prepare_local_data.py
python disambiguation/local/gae/train.py
python disambiguation/cluster_size/count.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages