Embedding Learning for Heterogeneous Cloud Service Graphs

This project explores and implements embedding learning techniques for heterogeneous cloud service graphs using advanced Graph Neural Networks (GNNs). The aim is to map complex, cloud-based network graphs into low-dimensional, machine-readable representations for classification and clustering tasks. The project also employs data augmentation techniques using graphons to overcome data scarcity and enhance model robustness.

Project Overview

Author: Eya Jlassi
Supervisors: Dr. Hayet Brabra (Telecom SudParis), Dr. Zacharie Ales (ENSTA Paris)
Academic Year: 2024/2025
Institution: Télécom SudParis

Project Goals

Graph Embedding: Map heterogeneous graphs to low-dimensional vectors that capture structural information and semantics using:
- Relational Graph Convolutional Networks (R-GCN)
- Heterogeneous Graph Transformers (HGT)
Data Augmentation: Generate synthetic data through Graphon-based methods to enhance the dataset's volume and diversity, making the model more robust.

Features

Embedding Techniques: Implements R-GCN and HGT for transforming complex service graphs.
Data Augmentation: Uses graphons for creating synthetic graphs to improve model generalization.
Classification and Clustering: Evaluates embedding quality using classifiers and clustering algorithms.

Project Structure

src/: Source code implementing graph embedding, data augmentation, and model evaluation.
data/: Dataset of cloud service graphs with node and edge files in CSV format.
models/: Pre-trained models and training scripts.
docs/: Project documentation and detailed report.

Installation

Clone the Repository:

git clone https://github.com/EyaJlassi695/Autonomous-Car-Pathfinding.git
cd Autonomous-Car-Pathfinding

Install Dependencies:
- Python (>= 3.8)
- PyTorch, PyTorch Geometric
- Additional packages in requirements.txt:
```
pip install -r requirements.txt
```

Usage

Prepare the Dataset: Place node and edge files in the data/ directory, structured by service categories.
Run Embedding Models:
- Train the R-GCN or HGT model on the dataset:
```
python src/train_model.py --model rgcn
```
Evaluate and Visualize:
- Evaluate model performance with classifiers and clustering algorithms.
- Visualize embeddings for training and test sets.

Results

R-GCN: Achieved high classification and clustering accuracy.
HGT: Showed excellent performance in capturing heterogeneous relations.

Project Documentation

Refer to the docs/ folder for detailed information on:

Graph embedding techniques
Data augmentation with graphons
Model architecture and evaluation metrics

Acknowledgments

Special thanks to Dr. Hayet Brabra and Dr. Zacharie Ales for their invaluable support and guidance throughout this project.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Data_augmentation_GNN.ipynb		Data_augmentation_GNN.ipynb
Data_augmentation_HGT_modified.ipynb		Data_augmentation_HGT_modified.ipynb
GNN.ipynb		GNN.ipynb
HGT_modified.ipynb		HGT_modified.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Embedding Learning for Heterogeneous Cloud Service Graphs

Project Overview

Project Goals

Features

Project Structure

Installation

Usage

Results

Project Documentation

Acknowledgments

About

Releases

Packages

Languages

EyaJlassi695/Embedding-Learning-for-Heterogeneous-Cloud-Service-Graphs

Folders and files

Latest commit

History

Repository files navigation

Embedding Learning for Heterogeneous Cloud Service Graphs

Project Overview

Project Goals

Features

Project Structure

Installation

Usage

Results

Project Documentation

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages