Effective Bankruptcy Prediction Models for North American Companies
bankruptcy; prediction models; North American companies; undersampling; oversampling; balanced accuracy
Encyclopedia of Data Science and Machine Learning
Bankruptcy prediction is a widely researched topic. However, few studies focus on dealing with the imbalance problem. This article proposes a new technique that applies a bagging undersampling procedure to balance the data and compares it to random undersampling and five oversampling techniques. The performance of the algorithm is evaluated by a random forest's balanced accuracy, sensitivity, and specificity. The results show that models trained after applying the oversampling techniques are prone to overfitting, and the model trained after applying the proposed method had the highest balanced accuracy without overfitting.