Parametric and XGBoost Hurdle Model for estimating accident frequency - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2023

Parametric and XGBoost Hurdle Model for estimating accident frequency

Résumé

The Poisson model is a commonly used method for modeling count data with exogenous variables, but it can be limiting when dealing with data that has a high proportion of zeros. To address this issue, we propose the use of the Hurdle model as an alternative approach. We discuss the properties of the Hurdle model and demonstrate how it can be implemented using both parametric and nonparametric estimates, including the XGBoost method. To evaluate the effectiveness of our proposed XGBoost Hurdle model, we apply it to a car insurance dataset from a French insurance company. This dataset includes a significant number of drivers with zero accidents per year. Our results show that the XGBoost Hurdle model outperforms several other models when applied to this data type.
Fichier principal
Vignette du fichier
article_hurdle (1).pdf (431.99 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03739838 , version 1 (28-07-2022)
hal-03739838 , version 2 (13-01-2023)

Identifiants

  • HAL Id : hal-03739838 , version 2

Citer

Dafnis Krasniqi, Jean-Marc Bardet, Joseph Rynkiewicz. Parametric and XGBoost Hurdle Model for estimating accident frequency. 2023. ⟨hal-03739838v2⟩
297 Consultations
793 Téléchargements

Partager

Gmail Facebook X LinkedIn More