First Faculty Advisor
Second Faculty Advisor
bankruptcy; imbalance; sampling; machine learning
CC 4.0 BY-NC-SA
Bankruptcy prediction is a widely researched topic. However, few studies focus on dealing with the imbalance problem. This paper proposes a new technique that applies a bagging undersampling procedure to balance the data and compares it to random undersampling and five oversampling techniques. The performance of the algorithm is evaluated by a random forest’s balanced accuracy, sensitivity, and specificity. The results show that models trained after applying the oversampling techniques are prone to overfitting, and the model trained after applying the proposed method had the highest balanced accuracy without overfitting.