Machine-Learning

This project has been created for the exam of Machine Learning of the Master's Degree course at the University of Turin.

The project is divided in 4 sections:

Decision Tree Models
Distance Based Models
Linear Models
Probabilistic Models

Decision Tree Models

In this section there is the iris_classification.ipynb. The purpose of this notebook is to manipulate the sk-learn iris dataset by applying transformations to the data within it and training different Decision Tree classifiers.

Then we analyze the performance of the classifiers according to different metrics including the accuracy score, f1 score and plotting the ROC curves.

Distance Based Models

In this section we find two python notebooks.

In iris_classification_knn.ipynb we compare the prediction results obtained by decision trees and k-nearest neighbors on the dataset Iris. We use different types of weight (uniform or distance) to test prediction accuracy of k-nn and we verify which is the best k to choose for the (split) dataset at hand.

Finally we test and tune the gamma hyperparameter of a Radial Basis Function (RBF) Kernel used in k-nn as weight between data-points.

In clustering.ipynb we use and compare different clustering algorithms: K-means and DBScan.

Linear Models

In this section we apply the support vector machines to an artificial dataset built by my professor.

Probabilistic Models

In this section we compare two models for categorical data probabilistic modeling:

multivariate Bernoulli
multinomial on a dataset

We adopt a dataset on Twitter messages labelled with emotions (Joy vs Sadness).

Setup

Clone repository
Create a virtual environment and activate it
Install all the required libraries through pip install -r requirements.txt
Open and launch any notebook

Contributing

Libraries used:

Author

Lorenzo Favaro

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Decision Tree Models		Decision Tree Models
Distance Based Models		Distance Based Models
Linear Models		Linear Models
Probabilistic Models		Probabilistic Models
docs		docs
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Learning

Decision Tree Models

Distance Based Models

Linear Models

Probabilistic Models

Setup

Contributing

Author

About

Releases

Packages

Contributors 2

Languages

lorenzofavaro/machine-learning

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning

Decision Tree Models

Distance Based Models

Linear Models

Probabilistic Models

Setup

Contributing

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages