This is a project centered towards applying machine learning and time series algorithms in the air-quality sensor data. A device known as Purple Air used to measure the Air quality especially particulate matter in the atmosphere was used for data collection since September 2019 to January 2020. The data was collected but there was none to make the sense out of the data and thus gave me an idea to delve deep into it and find out the insights.
Step | Description | Tags |
---|---|---|
Problem Framing | A text file outlining the statement of the the problem. | |
Data Sourcing | Data was collected by Purple Air device from some of the mining sites in Kenya. About 94 csv files were used for the data storage. | |
Data Cleaning | Data From purple air is a bit messy. Needs some Massaging. | |
EDA | Feature exploration with much of visualizations to enhance data understanding. |