Description
Describe the bug
Hi.The following is the sampling method of BRF in the paper Using Random Forest to Learn Imbalanced Data:
For each iteration in random forest, draw a bootstrap sample from the minority class.
Randomly draw the same number of cases, with replacement, from the majority class.
My interpretation is that the minority samples in each sub-training set are selected by bootstrap and each sub-training set is balanced.Then these sub-training sets are given to traditional random forest's trees.
But in the code of imblearn\ensemble\_forest.py\_local_parallel_build_trees
,I found that all minority samples in the training set are used in each sub-training set,the minority samples in the sub-training set of each tree are the same.
Metadata
Metadata
Assignees
Labels
No labels