A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

Eustache Diemert; Artem Betlei; Christophe Renaudin; Massih-Reza Amini; Théophane Gregoir; Thibaud Rahier

Pré-Publication, Document De Travail Année : 2021

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

(1) , , , , ,

Eustache Diemert

Fonction : Auteur
PersonId : 183385
IdHAL : eustache-diemert
ORCID : 0000-0003-2240-501X
IdRef : 253119650

Criteo AI Lab

Artem Betlei

Fonction : Auteur

Christophe Renaudin

Fonction : Auteur

Massih-Reza Amini

Fonction : Auteur
PersonId : 747054
IdHAL : massih-reza-amini
ORCID : 0000-0001-9032-4233
IdRef : 132277042

Théophane Gregoir

Fonction : Auteur

Thibaud Rahier

Fonction : Auteur

Résumé

Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collection of 13.9 million samples collected from several randomized control trials, scaling up previously available datasets by a healthy 210x factor. We provide details on the data collection and perform sanity checks to validate the use of this data for causal inference tasks. First, we formalize the task of uplift modeling (UM) that can be performed with this data, along with the relevant evaluation metrics. Then, we propose synthetic response surfaces and heterogeneous treatment assignment providing a general set-up for ITE prediction. Finally, we report experiments to validate key characteristics of the dataset leveraging its size to evaluate and compare - with high statistical significance - a selection of baseline UM and ITE prediction methods.

Domaines

Intelligence artificielle [cs.AI]

Eustache Diemert : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03439178

Soumis le : lundi 22 novembre 2021-10:31:57

Dernière modification le : jeudi 14 mars 2024-14:42:50

Dates et versions

hal-03439178 , version 1 (22-11-2021)

Identifiants

HAL Id : hal-03439178 , version 1
ARXIV : 2111.10106

Citer

Eustache Diemert, Artem Betlei, Christophe Renaudin, Massih-Reza Amini, Théophane Gregoir, et al.. A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling. 2021. ⟨hal-03439178⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

25 Consultations

0 Téléchargements

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager