Skip to content

revanth7667/Duke_IDS_706-DE

Repository files navigation

Duke IDS-706 Data Engineering Python Template CI

This repo will be used as a base python template for the Data Engineering course.

Date Created: 2023-08-31

1. README.md

contains the information about the repository and instructions for using it

2. requirements.txt

contains the list of packages and libraries which are required for running the project.

Note limited items have been added as this is an example repository, the list will be updated as per the requirements over time

3. github_actions.yml

github actions is used to automate the following 4 actions whenever a change is made to the files in the repository:

  • install : installs the packages and libraries mentioned in the requirements.txt
  • format : uses black to format the python files
  • lint : uses pylint to lint the python files
  • test : uses pytest to test the python codes using the test_* files to test the main files

Note -if all the processes run successfully the following output will be visible in github actions: Success Build

4. Makefile

contains the instructions for the processes used in github actions

5. .devcontainer

contains the dockerfile and devcontainer.json files which are used to build and define the setting of the virtual environment (codespaces - python) for running the code.

6. Python files

This repository is used only as a template, however the following 2 files are present which will be used for testing and as an example:

  • example.py : this file contains the actual code for the project. For this example, we use a simple function which returns the sum of 2 numbers
  • test_example.py : this file is used for testing and it contains the test conditions and the expected output from the example.py code for the given cases. Sample execution of test file: test_output

About

Repository for MIDS Data Engineering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published