Skip to content

sandipanrakshit34/Hunting-for-Exoplanets-with-Machine-Learning-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Hunting-for-Exoplanets-with-Machine-Learning

Project Overview

The "Hunt for Exoplanets using Machine Learning" project aims to identify potential exoplanets from a large dataset of stellar observations. Using machine learning techniques, this project analyzes light curves—graphs of light intensity over time—to detect the subtle dimming that indicates an exoplanet passing in front of a star. This project employs a Random Forest classifier to distinguish between exoplanet candidates and non-exoplanets (false positives), achieving high accuracy in the predictions.

Technologies Used

Dataset

The dataset used in this project is sourced from Kaggle and contains time series data from the Kepler Space Telescope. The dataset includes labeled instances indicating the presence of exoplanets. You can download the dataset from here.

Project Structure

  • data/: Directory containing the dataset files.
  • notebooks/: Jupyter notebooks used for data exploration, preprocessing, and model development.
  • src/: Source code for the project, including the implementation of the Random Forest classifier.
  • models/: Saved models for future use and analysis.
  • README.md: Project documentation.

Author

How It Works

  1. Data Preprocessing: The raw time series data is cleaned and preprocessed using Pandas and Numpy. Features such as mean, variance, and skewness of the light curves are extracted to serve as inputs to the model.
  2. Model Training: A Random Forest classifier is trained on the preprocessed dataset to classify instances as either exoplanets or non-exoplanets.
  3. Model Evaluation: The trained model is evaluated using metrics like accuracy, precision, recall, and F1 score to ensure its effectiveness in identifying exoplanets.

Results

The Random Forest classifier achieved an F1 score of 0.89, indicating a strong ability to balance precision and recall in identifying exoplanet candidates from the dataset.

Conclusion

The project demonstrates the effectiveness of machine learning techniques, specifically the Random Forest classifier, in identifying exoplanets from stellar observation data with high accuracy.

Installation and Usage

  1. Clone the repository:
    git clone https://github.com/sandipanrakshit34/hunt-for-exoplanets.git

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published