-
Introduction
This is the final project, named "Telco Customer Churn Prediction", for the course "CS 513 - Knowledge Discovery and Data Mining" at Stevens Institute of Technology. The project was conducted in collaboration among three Computer Science students, namely Shiva Rama Krishna Mandadapu, Hantao Gao, and Thanapoom Phatthanaphan.
The objective of the project is to provide a comprehensive overview of Exploratory Data Analysis (EDA) principles, specifically for analyzing all relevant customer data of a telecommunications company. This process aims to gain a clear understanding of the dataset. Additionally, the project will involve the implementation of nine different machine learning models to predict whether a customer is likely to churn (leave) a telecommunications company's service or not. Furthermore, we will evaluate the performance of each model by presenting the classification report and confusion matrix to determine the optimal one.
-
Problem Statement
Develop a classification model to predict whether a customer will churn (leave) a telecommunications company’s service or not
-
Dataset Description
The dataset contains information on telecom customers such as their demographics, usage patterns, and account information. The target variable is whether the customer churned (1) or not (0). The dataset has 7,043 observations and 20 input features.
- Customers who left within the last month – the column is called Churn
- Services that each customer has signed up for – phone, multiple lines, internet, online security, online backup, device protection, tech support, and streaming TV and movies
- Customer account information – how long they’ve been a customer, contract, payment method, paperless billing, monthly charges, and total charges
- Demographic info about customers – gender, age range, and if they have partners and dependents
Source of Dataset: https://www.kaggle.com/datasets/blastchar/telco-customer-churn
-
Implementation Strategy and Algorithms used
- Decision Trees
- Naive Bayes Classifier
- AdaBoost Classifier
- Multilayer Perceptron
- Bagging along with Random Forest
- K-nearest neighbor with Grid Search CV
- Logistic Regression with Grid Search CV
- Random Forest with Randomized Search CV
- Support Vector Machine with Grid Search CV
-
Notifications
You must be signed in to change notification settings - Fork 0
Developed a churn prediction classification model using various techniques including: EDA, Decision trees, Naive Bayes, AdaBoost, MLP, Bagging, RF, KNN, logistic regression, SVM, Hyperparameter tuning using Grid Search CV and Randomized Search CV.
ohmthanap/Telecom-Customer-Churn-Predictions
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Developed a churn prediction classification model using various techniques including: EDA, Decision trees, Naive Bayes, AdaBoost, MLP, Bagging, RF, KNN, logistic regression, SVM, Hyperparameter tuning using Grid Search CV and Randomized Search CV.
Topics
python
random-forest
scikit-learn
exploratory-data-analysis
jupyter-notebook
seaborn
naive-bayes-classifier
logistic-regression
support-vector-machine
adaboost
decision-tree-classifier
churn-prediction
k-nearest-neighbours
bagging
multilayer-perceptron
numpy-library
pandas-library
gridsearchcv
matplotlib-pyplot
randomizedsearchcv
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published