GitHub - kedro-org/kedro at 938948ba8372bd6af98dfbadcf2864f581c9b1ae

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 2,039 Commits
.circleci		.circleci
.github		.github
dependency		dependency
docs		docs
features		features
kedro		kedro
static		static
tests		tests
tools		tools
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
RELEASE.md		RELEASE.md
behave.ini		behave.ini
kedro_technical_charter.pdf		kedro_technical_charter.pdf
pyproject.toml		pyproject.toml
setup.py		setup.py
test_requirements.txt		test_requirements.txt
trufflehog-ignore.txt		trufflehog-ignore.txt

Repository files navigation

What is Kedro?

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning. Kedro is hosted by the LF AI & Data Foundation.

How do I install Kedro?

To install Kedro from the Python Package Index (PyPI) simply run:

pip install kedro

It is also possible to install Kedro using conda:

conda install -c conda-forge kedro

Our Get Started guide contains full installation instructions, and includes how to set up Python virtual environments.

What are the main features of Kedro?

A pipeline visualisation generated using Kedro-Viz

Feature	What is this?
Project Template	A standard, modifiable and easy-to-use project template based on Cookiecutter Data Science.
Data Catalog	A series of lightweight data connectors used to save and load data across many different file formats and file systems, including local and network file systems, cloud object stores, and HDFS. The Data Catalog also includes data and model versioning for file-based systems.
Pipeline Abstraction	Automatic resolution of dependencies between pure Python functions and data pipeline visualisation using Kedro-Viz.
Coding Standards	Test-driven development using `pytest`, produce well-documented code using Sphinx, create linted code with support for `flake8`, `isort` and `black` and make use of the standard Python logging library.
Flexible Deployment	Deployment strategies that include single or distributed-machine deployment as well as additional support for deploying on Argo, Prefect, Kubeflow, AWS Batch and Databricks.

How do I use Kedro?

The Kedro documentation includes three examples to help get you started:

A typical "Hello World" example, for an entry-level description of the main Kedro concepts
An introduction to the project template using the Iris dataset
A more detailed spaceflights tutorial to give you hands-on experience

Why does Kedro exist?

Kedro is built upon our collective best-practice (and mistakes) trying to deliver real-world ML applications that have vast amounts of raw unvetted data. We developed Kedro to achieve the following:

To address the main shortcomings of Jupyter notebooks, one-off scripts, and glue-code because there is a focus on creating maintainable data science code
To enhance team collaboration when different team members have varied exposure to software engineering concepts
To increase efficiency, because applied concepts like modularity and separation of concerns inspire the creation of reusable analytics code

The humans behind Kedro

Kedro is maintained by a product team and a number of contributors from across the world.

Can I contribute?

Yes! Want to help build Kedro? Check out our guide to contributing to Kedro.

Where can I learn more?

There is a growing community around Kedro. Have a look at the Kedro FAQs to find projects using Kedro and links to articles, podcasts and talks.

Who likes Kedro?

There are Kedro users across the world, who work at start-ups, major enterprises and academic institutions like Absa, Acensi, Advanced Programming Solutions SL, AI Singapore, Augment Partners, AXA UK, Belfius, Beamery, Caterpillar, CRIM, Dendra Systems, Element AI, GetInData, GMO, Indicium, Imperial College London, ING, Jungle Scout, Helvetas, Leapfrog, McKinsey & Company, Mercado Libre Argentina, Modec, Mosaic Data Science, NaranjaX, NASA, Open Data Science LatAm, Prediqt, QuantumBlack, Retrieva, Roche, Sber, Société Générale, Telkomsel, Universidad Rey Juan Carlos, UrbanLogiq, Wildlife Studios, WovenLight and XP.

Kedro has also won Best Technical Tool or Framework for AI in the 2019 Awards AI competition and a merit award for the 2020 UK Technical Communication Awards. It is listed on the 2020 ThoughtWorks Technology Radar and the 2020 Data & AI Landscape.

How can I cite Kedro?

If you're an academic, Kedro can also help you, for example, as a tool to solve the problem of reproducible research. Use the "Cite this repository" button on our repository to generate a citation from the CITATION.cff file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Kedro?

How do I install Kedro?

What are the main features of Kedro?

How do I use Kedro?

Why does Kedro exist?

The humans behind Kedro

Can I contribute?

Where can I learn more?

Who likes Kedro?

How can I cite Kedro?

About

Releases 55

Used by 2.7k

Contributors 228

Languages

License

kedro-org/kedro

Folders and files

Latest commit

History

Repository files navigation

What is Kedro?

How do I install Kedro?

What are the main features of Kedro?

How do I use Kedro?

Why does Kedro exist?

The humans behind Kedro

Can I contribute?

Where can I learn more?

Who likes Kedro?

How can I cite Kedro?

About

Topics

Resources

License

Code of conduct

Security policy

Citation

Stars

Watchers

Forks

Releases 55

Used by 2.7k

Contributors 228

Languages