The "Hunt for Exoplanets using Machine Learning" project aims to identify potential exoplanets from a large dataset of stellar observations. Using machine learning techniques, this project analyzes light curves—graphs of light intensity over time—to detect the subtle dimming that indicates an exoplanet passing in front of a star. This project employs a Random Forest classifier to distinguish between exoplanet candidates and non-exoplanets (false positives), achieving high accuracy in the predictions.
- Programming Language: Python
- Libraries/Frameworks:
- TensorFlow
- Scikit-learn
- Pandas
- Numpy
- Matplotlib
- Dataset: Kepler Labeled Time Series Data
The dataset used in this project is sourced from Kaggle and contains time series data from the Kepler Space Telescope. The dataset includes labeled instances indicating the presence of exoplanets. You can download the dataset from here.
- data/: Directory containing the dataset files.
- notebooks/: Jupyter notebooks used for data exploration, preprocessing, and model development.
- src/: Source code for the project, including the implementation of the Random Forest classifier.
- models/: Saved models for future use and analysis.
- README.md: Project documentation.
- Data Preprocessing: The raw time series data is cleaned and preprocessed using Pandas and Numpy. Features such as mean, variance, and skewness of the light curves are extracted to serve as inputs to the model.
- Model Training: A Random Forest classifier is trained on the preprocessed dataset to classify instances as either exoplanets or non-exoplanets.
- Model Evaluation: The trained model is evaluated using metrics like accuracy, precision, recall, and F1 score to ensure its effectiveness in identifying exoplanets.
The Random Forest classifier achieved an F1 score of 0.89, indicating a strong ability to balance precision and recall in identifying exoplanet candidates from the dataset.
The project demonstrates the effectiveness of machine learning techniques, specifically the Random Forest classifier, in identifying exoplanets from stellar observation data with high accuracy.
- Clone the repository:
git clone https://github.com/sandipanrakshit34/hunt-for-exoplanets.git