Notebook might not open because of heavy visuals, you can see it on kaggle also https://www.kaggle.com/shweta2407/oversampling-vs-undersampling-techniques
An unbalanced data or skewed data is the dataset that has its most of the data falling in one class and rest in others.
To classify this type of data, we need to first balance the data.
Apply different resampling techniques to balance the data : there are 2 kinds of resampling techniques - OVERSAMPLING & UNDERSAMPLING techniques.
SMOTE - Synthetic Minority Oversampling Technique
NearMiss Version 1, 2, 3
Tomek Links
Condensed Nearest Neighbor
Edited Nearest Neighbor
One Sided Selection (Tomek Links and the Condensed Nearest Neighbor (CNN)
Neighborhood Cleaning Rule (Condensed Nearest Neighbor & Edited Nearest Neighbors )