Skip to content

A simple example project using Kedro 0.16.x to run Scikit-learn tasks based on Kaggle's Titanic data.

Notifications You must be signed in to change notification settings

Minyus/kedro_sklearn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kedro Scikit-learn

This project works with Kedro 0.16.x only.

Please use kedro-starters-sklearn which works with Kedro 0.17.x.

A simple example project using Kedro and Scikit-learn

Pipeline visualized by Kedro-viz

How to run:

1. Install dependencies

$ pip install kedro scikit-learn pandas kedro-viz 

Note: kedro-viz is optional.

2. Clone this repository and cd into it

$ git clone https://github.com/Minyus/kedro_sklearn.git
$ cd kedro_sklearn

3. Go to next step to use the Iris datasets included in this project.

Optionally, you may dowonload another dataset such as Kaggle Titanic, replace train.csv and test.csv in data/input directory, and modify base/parameters.yml to set parameters appropriate for the dataset.

4. Run main.py

$ python main.py

Alternatively, Kedro CLI can be used to run.

$ kedro run

Kedro catalog configuration

In this example project, the Kedro catalog is configured in Python code at src/catalogs/catalog.py using a hook at src/hooks/add_catalog_dict.py so users can get benefit from linting/auto-completion by the Python IDE. It is also possible to configure in catalog.yml as well.

Simplified Kedro project template

This project was created from the GitHub template repository at https://github.com/Minyus/kedro_template

To use for a new project, fork the template repository and hit Use this template button next to Clone or download.

About

A simple example project using Kedro 0.16.x to run Scikit-learn tasks based on Kaggle's Titanic data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages