Skip to content

Analyze & clean a football matches database, and train 3 ML models to predict match outcomes.

Notifications You must be signed in to change notification settings

amrawadk/football-data-analysis

Repository files navigation

Football data analysis

This project aims to clean up football data inside a kaggle database and then train 3 machine learning models to predict match results.

The data clean up was done with Pandas and contained some complex manipulation due to the original dataformat vs. the desired output.

The trained models are:

  1. ADA Boost
  2. Random Forest
  3. Tensorflow NN

Note: The ML part only shows the code & data flow, no model optimization was done.

Setup

  1. Inside the project directory, Create a virtual environment.
C:\Python36\python.exe -m venv venv
  1. activate the virtual environment and install the required packages.
venv\Scripts\activate
# the prompt should change and have (venv) before the path
python -m pip install -r requirements.txt
  1. Run Jupyter notebook
jupyter notebook

About

Analyze & clean a football matches database, and train 3 ML models to predict match outcomes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published