Skip to content

Latest commit

 

History

History
119 lines (101 loc) · 8.87 KB

README.md

File metadata and controls

119 lines (101 loc) · 8.87 KB

Stars Badge Forks Badge Pull Requests Badge Issues Badge GitHub contributors

Don't forget to hit the ⭐ if you like this repo.

About Us

The information on this Github is part of the materials for the subject High Performance Data Processing (SECP3133). This folder contains general big data information as well as big data case studies using Malaysian datasets. This case study was created by a Bachelor of Computer Science (Data Engineering), Universiti Teknologi Malaysia student.

Contents:

Notes

Big Data: Pandas

Big Data: Alternatives to Pandas for Processing Large Datasets

Modin

Dask

Datatable

🎖️ Comparison between libraries

Big Data: Case study

Lab

Pandas

Modin

Dask

Comparison between libraries

Contribution 🛠️

Please create an Issue for any improvements, suggestions or errors in the content.

You can also contact me using Linkedin for any other queries or feedback.