This repository is mainly used to store projects of Data Science which I have followed & worked for a period of time. You can find all the codes for the corresponding projects in the repository. The following is a list of the projects. When I set up to do these work, I was just a beginner of this field, so may be there are many questions remained that I have not noticed. Welcome to point them out and if you have any problems with my work welcome to contact me by e-mail: deepesch@gmail.com
-
Demo of the most common techniques used in industry from clustering to streaming.
-
Wikipedia Data Engineering project on Databricks cloud-based Spark platform. Wikimapper helps active contributors to find most searched non-existent wikipedia articles.
-
Visualization libraries in Python:
There are many different Python plotting packages. Amoung all of the available libraries some may suit your needs better. Some of these are interfaces to existing plotting libraries while others are Python-centered new implementations.
-
Word2Vec - Kaggle Competition:
Machine Learning Competition(Movie Ratings).
-
Naive Bayes Implementation: Natural Language Processing Project
-
Support Vector Machine Implementation: Machine Learning Project