Machine Learning or Econometrics for Credit Scoring: Let's Get the Best of Both Worlds * - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Machine Learning or Econometrics for Credit Scoring: Let's Get the Best of Both Worlds *

Elena Dumitrescu
  • Fonction : Auteur
  • PersonId : 866455
Sullivan Hué
  • Fonction : Auteur
Christophe Hurlin

Résumé

Decision trees and related ensemble methods like random forest are state-of-the-art tools in the field of machine learning for credit scoring. Although they are shown to outperform logistic regression, they lack interpretability and this drastically reduces their use in the credit risk management industry, where decision-makers and regulators need transparent score functions. This paper proposes to get the best of both worlds, introducing a new, simple and interpretable credit scoring method which uses information from decision trees to improve the performance of logistic regression. Formally, rules extracted from various short-depth decision trees built with couples of predictive variables are used as predictors in a penalized or regularized logistic regression. By modeling such univariate and bivariate threshold effects, we achieve significant improvement in model performance for the logistic regression while preserving its simple interpretation. Applications using simulated and four real credit defaults datasets show that our new method outperforms traditional logistic regressions. Moreover, it compares competitively to random forest, while providing an interpretable scoring function. JEL Classification: G10 C25, C53
Fichier principal
Vignette du fichier
Dumitrescu, Hue, Hurlin and Tokpavi (2020).pdf (450.96 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02507499 , version 1 (13-03-2020)
hal-02507499 , version 2 (02-11-2020)
hal-02507499 , version 3 (15-01-2021)

Identifiants

  • HAL Id : hal-02507499 , version 1

Citer

Elena Dumitrescu, Sullivan Hué, Christophe Hurlin, Sessi Tokpavi. Machine Learning or Econometrics for Credit Scoring: Let's Get the Best of Both Worlds *. 2020. ⟨hal-02507499v1⟩
591 Consultations
6649 Téléchargements

Partager

Gmail Facebook X LinkedIn More