Data Engineering 1 Project - Group 20
The goal of this project was to design and implement a scalable data processing solution for the Million Song Dataset. The repository consists of classes to convert the files of the Million Song Dataset to a more suitable CSV file format with the help of a getter class provided by the Million Song Dataset. Additionally several analysis have been made as well as a scalability study.