A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2021

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

Eustache Diemert
Artem Betlei
  • Fonction : Auteur
Christophe Renaudin
  • Fonction : Auteur
Massih-Reza Amini
Théophane Gregoir
  • Fonction : Auteur
Thibaud Rahier
  • Fonction : Auteur

Résumé

Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collection of 13.9 million samples collected from several randomized control trials, scaling up previously available datasets by a healthy 210x factor. We provide details on the data collection and perform sanity checks to validate the use of this data for causal inference tasks. First, we formalize the task of uplift modeling (UM) that can be performed with this data, along with the relevant evaluation metrics. Then, we propose synthetic response surfaces and heterogeneous treatment assignment providing a general set-up for ITE prediction. Finally, we report experiments to validate key characteristics of the dataset leveraging its size to evaluate and compare - with high statistical significance - a selection of baseline UM and ITE prediction methods.

Dates et versions

hal-03439178 , version 1 (22-11-2021)

Identifiants

Citer

Eustache Diemert, Artem Betlei, Christophe Renaudin, Massih-Reza Amini, Théophane Gregoir, et al.. A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling. 2021. ⟨hal-03439178⟩
25 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More