👋 Welcome! This short interactive tutorial will show you how to use the scikit-learn Python package to perform basic machine learning analysis. It will also cover how to visualize your results with the matplotlib and seaborn Python packages.
This tutorial was developed to be presented at the 2021 edition of the OHBM BrainHack 🧠
- The content of this tutorial is divided into the following notebooks:
Notebook | Content |
---|---|
01. Preliminaries | Introduction to the tutorial and dataset |
02. Core concepts | Estimators, regression, classification and clustering in scikit-learn |
03. Pipelines | Transformers, preprocessing, feature selection, feature engineering, dimensionality reduction and pipelines in scikit-learn |
04. Overfitting | The problem of overfitting, cross-validation, regularization and hyper-parameter tuning in scikit-learn |
05. Visualization | Basic components of a matplotlib plot and basic plots with seaborn |
- Each notebook provides small comprehension exercises, and a final integration exercise.
-
To run these notebooks in Binder head here:
-
To run these notebooks locally, follow these steps (you need to have installed git and python in your computer):
$ git clone https://github.com/martinagvilas/ML-Vis-BrainHack-2021.git
$ cd {path_to_cloned_folder}
$ pip install -r requirements.txt
- Learn how to load and prepare your data for machine learning analysis with scikit-learn.
- Learn how to perform regression, classification and clustering analysis with scikit-learn.
- Learn the concepts of regularization, cross-validation, hyper-parameter tuning, and how to implement them using scikit-learn.
- Learn how to inspect and evaluate a machine learning model.
- Learn how to plot your results with matplotlib and seaborn.
- For this tutorial, we assume you have:
- Basic knowledge of machine learning concepts. For example, you know the difference between supervised/unsupervised learning, or the difference between classification, regression and clustering models.
- Basic experience with Python.
- We also assume that you have seen the video "A tutorial on machine learning" by Laura Suarez.
This event follows the OHBM BrainHack code of conduct. Please make sure to read it before participating in our session.
If you run into any problems while executing this tutorial, please open an issue describing the problem.
We also welcome any enhancements, ideas or discussions on the structure or contents covered in the notebooks.