Bank Term Deposit Prediction

This project aims to predict bank term deposits using various machine learning algorithms.

Prerequisites

The following R libraries are required:

class
e1071
caret
rpart.plot
ggplot2
ranger
dplyr
corrplot
pROC
reshape2
shiny
xgboost

Dataset Description

File: bank-full.csv
Description: Main dataset used for training and evaluation.
File: DatasetTable.csv
Description: Provides a detailed description of the attributes present in the main dataset.

Data Pre-processing

Check for missing values in the dataset.
Identify and handle duplicated rows.
Convert the target variable 'y' to a binary format (0 for "no" and 1 for "yes").

Exploratory Data Analysis (EDA)

Plot histograms for numeric attributes.
Visualize the distribution of the target variable 'y'.
Generate a correlation matrix for numeric variables.
Display bar plots for categorical variables.

Model Building and Evaluation

The following models are trained and evaluated:

Logistic Regression

A generalized linear model (GLM) with a binomial family.

Decision Tree

A recursive partitioning method using the rpart library.

Random Forest

An ensemble learning method that constructs a multitude of decision trees at training time.

The results of the models are then compared in terms of accuracy, sensitivity, specificity, and balanced accuracy.

Visualization

Feature importance from the Random Forest model.
Comparison of model performance metrics using bar plots.

Acknowledgement

The Portuguese bank dataset used for this project came from the following paper:

Moro, S., Laureano, R. and Cortez, P. (2011). Using data mining for bank direct marketing: An application of the crisp-dm methodology.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Bank_DM_Code_Final.R		Bank_DM_Code_Final.R
DatasetTable.xlsx		DatasetTable.xlsx
README.md		README.md
bank-full.csv		bank-full.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bank Term Deposit Prediction

Prerequisites

Dataset Description

Data Pre-processing

Exploratory Data Analysis (EDA)

Model Building and Evaluation

Logistic Regression

Decision Tree

Random Forest

Visualization

Acknowledgement

About

Releases

Packages

Languages

oakhamis/Bank_Data_Mining

Folders and files

Latest commit

History

Repository files navigation

Bank Term Deposit Prediction

Prerequisites

Dataset Description

Data Pre-processing

Exploratory Data Analysis (EDA)

Model Building and Evaluation

Logistic Regression

Decision Tree

Random Forest

Visualization

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages