This repo will be used as a base python template for the Data Engineering course.
Date Created: 2023-08-31
contains the information about the repository and instructions for using it
contains the list of packages and libraries which are required for running the project.
Note limited items have been added as this is an example repository, the list will be updated as per the requirements over time
github actions is used to automate the following 4 actions whenever a change is made to the files in the repository:
install
: installs the packages and libraries mentioned in the requirements.txtformat
: uses black to format the python fileslint
: uses pylint to lint the python filestest
: uses pytest to test the python codes using the test_* files to test the main files
Note -if all the processes run successfully the following output will be visible in github actions:
contains the instructions for the processes used in github actions
contains the dockerfile
and devcontainer.json
files which are used to build and define the setting of the virtual environment (codespaces - python) for running the code.
This repository is used only as a template, however the following 2 files are present which will be used for testing and as an example:
example.py
: this file contains the actual code for the project. For this example, we use a simple function which returns the sum of 2 numberstest_example.py
: this file is used for testing and it contains the test conditions and the expected output from the example.py code for the given cases. Sample execution of test file: