Skip to content

Hakimovich99/distributed-data-management-phase-1

Repository files navigation

Welcome to the group 5 repository

The group is composed of:

  • Rania Baguia 000459242
  • Hakim Amri 000459153
  • Julian Cailliau 000459856
  • Mehdi Jdaoudi 000457507

Important note

The data is not being pushed on the git, as github does not tolerate more the 25MB per file (the full data is arround 88MB). As such, we just pushed the local files per sensor. To get the merged one, please run the preprocessing notebook.

The notebook is able to download the data and structured the project. Please also use the docker image. As we are parallelising the query process, there is a known bug on ARM macs which doesn't allow spark nodes to make external requests.

In case your having troubles getting the files, please do not hesitate to reach out to us as we have put a lot of effort in this work.

Thank you for your comprehension,

Group 5

Video link

Please click on the following link to see the video.

Review Assignment Due Date

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published