Missing data are a common problem in many fields of human endeavor ranging from social sciences to economics and from political research to entertainment industry. The objective of this thesis is to study and apply missing data algortihms for treatment of missing data which encountered on bank marketing data sets. Six missing data imputation algortihms are analyzed which are mean, hot-deck, k-nearest neighbor, regression, expectation maximization and multiple imputation method. By imputation of missing data, the accuracy of clustering and association with regard to aspect of the data can be increased. This thesis also explores the advantages and disadvantages of each used imputation method. Similarities and differences between each imputation method is discussed, as well.
-
Notifications
You must be signed in to change notification settings - Fork 1
kaplansinan/Imputation-of-Missing-Data
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published