Financial data classification plays an important role in investment and banking industry with the purpose to control default risk, improve cash and select the best customers. Ensemble learning and classification systems are becoming gradually more applied to classify financial data where outputs from different classification systems are combined. The objective of this research is to assess the relative performance of existing state-of-the-art ensemble learning and classification systems with applications to corporate bankruptcy prediction and credit scoring. The considered ensemble systems include AdaBoost, LogitBoost, RUSBoost, subspace, and bagging ensemble system. The experimental results from three datasets: one is composed of quantitative attributes, one encompasses qualitative data, and another one combines both quantitative and qualitative attributes. By using ten-fold cross-validation method, the experimental results show that AdaBoost is effective in terms of low classification error, limited complexity, and short time processing of the data. In addition, the experimental results show that ensemble classification systems outperform existing models that were recently validated on the same databases. Therefore, ensemble classification system can be employed to increase the reliability and consistency of financial data classification task.

Performance assessment of ensemble learning systems in financial data classification

Bekiros S.;
2020-01-01

Abstract

Financial data classification plays an important role in investment and banking industry with the purpose to control default risk, improve cash and select the best customers. Ensemble learning and classification systems are becoming gradually more applied to classify financial data where outputs from different classification systems are combined. The objective of this research is to assess the relative performance of existing state-of-the-art ensemble learning and classification systems with applications to corporate bankruptcy prediction and credit scoring. The considered ensemble systems include AdaBoost, LogitBoost, RUSBoost, subspace, and bagging ensemble system. The experimental results from three datasets: one is composed of quantitative attributes, one encompasses qualitative data, and another one combines both quantitative and qualitative attributes. By using ten-fold cross-validation method, the experimental results show that AdaBoost is effective in terms of low classification error, limited complexity, and short time processing of the data. In addition, the experimental results show that ensemble classification systems outperform existing models that were recently validated on the same databases. Therefore, ensemble classification system can be employed to increase the reliability and consistency of financial data classification task.
2020
27
1
3
9
bankruptcy prediction; credit scoring; ensemble classifiers; ensemble learning; financial data classification
Lahmiri S.; Bekiros S.; Giakoumelou A.; Bezzina F.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1924273
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 30
  • ???jsp.display-item.citation.isi??? 23
social impact