Skip to content

[BUG]The sampling method of the BRFClassifier is different from the paper #838

@Chengwen-98

Description

@Chengwen-98

Describe the bug

Hi.The following is the sampling method of BRF in the paper Using Random Forest to Learn Imbalanced Data

For each iteration in random forest, draw a bootstrap sample from the minority class. 
Randomly draw the same number of cases, with replacement, from the majority class.

My interpretation is that the minority samples in each sub-training set are selected by bootstrap and each sub-training set is balanced.Then these sub-training sets are given to traditional random forest's trees.
But in the code of imblearn\ensemble\_forest.py\_local_parallel_build_trees ,I found that all minority samples in the training set are used in each sub-training set,the minority samples in the sub-training set of each tree are the same.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions