Skip to content

angheljf/machine-learning-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Machine Learning Homework - Exoplanet Exploration

exoplanets.jpg

Background

Over a period of nine years in deep space, the NASA Kepler space telescope has been out on a planet-hunting mission to discover hidden planets outside of our solar system.

To help process this data, you will create machine learning models capable of classifying candidate exoplanets from the raw dataset.

In this homework assignment, you will need to:

  1. Preprocess the raw data
  2. Tune the models
  3. Compare two or more models

Instructions

Preprocess the Data

  • Preprocess the dataset prior to fitting the model.
  • Perform feature selection and remove unnecessary features.
  • Use MinMaxScaler to scale the numerical data.
  • Separate the data into training and testing data.

Tune Model Parameters

  • Use GridSearch to tune model parameters.
  • Tune and compare at least two different classifiers.

Reporting

  • Create a README that reports a comparison of each model's performance as well as a summary about your findings and any assumptions you can make based on your model (is your model good enough to predict new exoplanets? Why or why not? What would make your model be better at predicting new exoplanets?).

For this exercise I created two models (Logistic Regression and Random Forest Classifier) in order to see which would perform better at finding new exoplantes based on the data. The Random Forest Classifier had a better score but I feel that is it overfitted to this particular dataset. I believe both models are good enough to predict new explanets, but the Logistic Regression one is more flexible and would work with other datasets.


© 2021 Trilogy Education Services, LLC, a 2U, Inc. brand. Confidential and Proprietary. All Rights Reserved.

About

Solution for Machine Learning HW

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published