Skip to content

Latest commit

 

History

History

Course 10 - IBM Capstone Project

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

🚀 Applied Data Science Capstone

This Capstone Project is the final and 10th module of this course in IBM Data Science Professional Certificate specialization, and it actually summarizes all materials that have been learnt during this specialization.

📄 Project Background

SpaceX is the most successful commercial company of space age, making space travel affordable. The company advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars. At the heart of SpaceX’s spate of successes is the Falcon 9, which has brought down the cost of reaching space and become a springboard. While other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore, if we can determine if the first stage will land, we can determine the cost of a launch. Based on public information and machine learning models, we are going to predict if SpaceX will reuse the first stage.

📄 Questions to be answered

  • How do variables such as payload mass, launch site, number of flights, and orbits affect the success of the first stage landing?
  • Does the rate of successful landings increase over the years?
  • What is the best algorithm that can be used for binary classification in this case?

📄 Methodology

1. Data collection methodology

  • Using SpaceX Rest API
  • Using Web Scrapping from Wikipedia

2. Performed data wrangling

  • Filtering the data
  • Dealing with missing values
  • Tranforming the data and adding new columns
  • Using One Hot Encoding to prepare the data to a binary classification

3. Performed exploratory data analysis (EDA) using visualization and SQL

4. Performed interactive visual analytics using Folium and Plotly Dash

5. Performed predictive analysis using classification models

  • Selected required features, split the data, build models, tuned and performed evaluation to ensure an optimized model

6. Constructed a brief presentation report