This project (among others) has been submitted for my Jedha Data Fullstack program certification
Checkout the 10min project presentation video (in French) here: https://share.vidyard.com/watch/1nfKZvDSiWkrQgkVrrgE5s?
In order to advise Uber drivers on where to be at which time, the goal of this project is to identify "hot-zones" from Uber pick-up requests data, and display theses zones on a map.
We'll use 2 different clustering algorithms for this task (KMeans and DBSCAN) and compare their results.
The data provided for this project gathers Uber pick-uprequests in NYC from april to september 2014 and from january to june 2015, available here. This analysis focuses on September 2014 data.
-
Clone this repository to create your project folder :
git clone https://github.com/Acsts/Clustering--Uber_pick-ups.git
-
Installations :
All the source code is written in Python3. The dependencies you need to run the code are listed in the 'requirements.txt' file, you can install them with pip by running
pip install -r requirements.txt
If you want to access the results of the analysis without running any line of code, all interactive visualizations are accessible in the 'figures' folder, in the form of .html files you can download and open with your web browser.
You can also visualize them directly without downloading by copying their github link in https://htmlpreview.github.io/, developped by niu-tech.
These files corresponds to the outputs obtained from running the code in the notebook Uber pickups clustering.ipynb
.