Skip to content

Repository for SEPAL: Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

License

Notifications You must be signed in to change notification settings

soda-inria/sepal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License Hugging Face arXiv

SEPAL: Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

SEPAL pipeline

This repository contains the implementation of the paper:

Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning
Félix Lefebvre and Gaël Varoquaux
NeurIPS 2025
PDF: https://arxiv.org/pdf/2507.00965v2

✨ Highlights

  • Scales to knowledge graphs with millions of entities
  • Robust to highly skewed degree distributions
  • Produces embeddings for downstream regression and classification tasks

Method details and ablations are in the paper.

🧪 Example

  • Mini YAGO3 tutorial: examples/mini_yago3_embeddings.ipynb

📊 Datasets

📣 Citation

If you use SEPAL, please cite:

@inproceedings{lefebvre2025scalable,
  title={Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning},
  author={Lefebvre, Félix and Varoquaux, Gaël},
  booktitle={Advances in Neural Information Processing Systems},
  volume={38},
  year={2025}
}

About

Repository for SEPAL: Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages