Dataset shift quantification for credit card fraud detection

Yvan Lucas; Pierre-Edouard Portier; Léa Laporte; Sylvie Calabretto; Liyun He-Guelton; Frédéric Oble; Michael Granitzer

doi:10.1109/AIKE.2019.00024

Communication Dans Un Congrès Année : 2019

Dataset shift quantification for credit card fraud detection

, (1) , (1) , (1) , (2) , (2) , (3)

1
2
3

Yvan Lucas

Fonction : Auteur

Pierre-Edouard Portier

Fonction : Auteur
PersonId : 2642
IdHAL : pierre-edouard-portier
IdRef : 156423898

Distribution, Recherche d'Information et Mobilité

Léa Laporte

Fonction : Auteur
PersonId : 3200
IdHAL : lea-laporte
ORCID : 0000-0001-5227-2735
IdRef : 180044990

Distribution, Recherche d'Information et Mobilité

Sylvie Calabretto

Fonction : Auteur
PersonId : 7155
IdHAL : sylvie-calabretto
ORCID : 0000-0002-4597-4680
IdRef : 061333654

Distribution, Recherche d'Information et Mobilité

Liyun He-Guelton

Fonction : Auteur

Atos Worldline

Frédéric Oble

Fonction : Auteur

Atos Worldline

Michael Granitzer

Fonction : Auteur
PersonId : 1026059

University of Passau

Résumé

Machine learning and data mining techniques have been used extensively in order to detect credit card frauds. However purchase behaviour and fraudster strategies may change over time. This phenomenon is named dataset shift or concept drift in the domain of fraud detection. In this paper, we present a method to quantify day-by-day the dataset shift in our face-to-face credit card transactions dataset (card holder located in the shop) . In practice, we classify the days against each other and measure the efficiency of the classification. The more efficient the classification, the more different the buying behaviour between two days, and vice versa. Therefore, we obtain a distance matrix characterizing the dataset shift. After an agglomerative clustering of the distance matrix, we observe that the dataset shift pattern matches the calendar events for this time period (holidays, week-ends, etc). We then incorporate this dataset shift knowledge in the credit card fraud detection task as a new feature. This leads to a small improvement of the detection.

Domaines

Intelligence artificielle [cs.AI]

Pierre-Edouard Portier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02178042

Soumis le : mardi 9 juillet 2019-15:06:15

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-02178042 , version 1 (09-07-2019)

Identifiants

HAL Id : hal-02178042 , version 1
ARXIV : 1906.06977
DOI : 10.1109/AIKE.2019.00024

Citer

Yvan Lucas, Pierre-Edouard Portier, Léa Laporte, Sylvie Calabretto, Liyun He-Guelton, et al.. Dataset shift quantification for credit card fraud detection. AIKE IEEE International Conference on Artificial Intelligence and Knowledge Engineering, Jun 2019, Cagliari, Italy. ⟨10.1109/AIKE.2019.00024⟩. ⟨hal-02178042⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INSA-GROUPE UDL

454 Consultations

0 Téléchargements

Dataset shift quantification for credit card fraud detection

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager