Skip to content

Latest commit

 

History

History
26 lines (19 loc) · 1.03 KB

README.md

File metadata and controls

26 lines (19 loc) · 1.03 KB

🛍️ Segmentation of e-commerce customers

OPENCLASSROOMS - Data Scientist - Project 5

This repository contains notebooks for a machine learning project that segments customers of the Olist marketplace based on their orders history.

📊 Data

The dataset used for this project is the Brazilian E-Commerce Public Dataset by Olist, which includes information of 100k orders from 2016 to 2018 in Brazil.

📁 Files

  • barbier_victor_1_notebook_exploration_102022.ipynb : Exploratory data analysis of the customers informations
  • barbier_victor_2_notebook_essais_102022.ipynb : Machine learning models for the clustering of the customers
  • barbier_victor_3_notebook_simulation_102022.ipynb : Analysis of the best model performance through time
  • barbier_victor_4_presentation_102022.pdf: Final presentation of the project

🛠️ Tools

  • Python 3.x
  • Jupyter Notebook
  • NumPy
  • Pandas
  • Matplotlib / Seaborn
  • Scikit-learn : PCA, KMeans, t-SNE
  • HDBSCAN
  • YellowBrick