Don't forget to hit the ⭐ if you like this repo.
The information on this Github is part of the materials for the subject High Performance Data Processing (SECP3133). This folder contains general big data information as well as big data case studies using Malaysian datasets. This case study was created by a Bachelor of Computer Science (Data Engineering), Universiti Teknologi Malaysia student.
- Python for beginners
- Web scraping and Python web framework
- Exploratory data analysis
- Big data processing
- Case Study
📚 5 amazing Github repos for data science! Learn skills or discover useful resources with these repositories.
1️⃣ 𝗠𝗟 𝗳𝗼𝗿 𝗕𝗲𝗴𝗶𝗻𝗻𝗲𝗿𝘀 𝗯𝘆 𝗠𝗶𝗰𝗿𝗼𝘀𝗼𝗳𝘁 Learn machine learning with Microsoft’s hands-on curriculum
2️⃣ 𝗗𝗲𝗲𝗽 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗗𝗿𝗶𝘇𝘇𝗹𝗲 Find top universities’ publicly available deep learning classes
3️⃣ 𝗗𝗮𝘁𝗮 𝗦𝗰𝗶𝗲𝗻𝗰𝗲 𝗜𝗻𝘁𝗲𝗿𝘃𝗶𝗲𝘄𝘀 Prepare for your upcoming interview with this repository of questions.
4️⃣ 𝗔𝘄𝗲𝘀𝗼𝗺𝗲 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 Discover machine learning tools and resources for beginners and advanced practitioners alike
- Awesome Public Datasets
- Portal Data Terbuka Malaysia
- Department of Statistics Malaysia
- data.world
- Dataportal.asia
- knoema
- The World Bank
- Dataset Search - Google
- UCI Machine Learning Repository
- Kaggle datasets
- Awesome-public-datasets
- Datahub.io
- Earthdata
- CERN Open Data Portal
Please create an Issue for any improvements, suggestions or errors in the content.
You can also contact me using Linkedin for any other queries or feedback.