Skip to Main content Skip to Navigation
Conference papers

A Large Scale Benchmark for Uplift Modeling

Abstract : Uplift modeling is an important yet novel area of research in machine learning which aims to explain and to estimate the causal impact of a treatment at the individual level. In the digital advertising industry, the treatment is exposure to different ads and uplift modeling is used to direct marketing efforts towards users for whom it is the most efficient [1]. To foster research in this topic we release a publicly available collection of 25 million samples from a randomized control trial, scaling up previously available datasets by a healthy 590x factor. We provide details on the data collection and sanity checks performed that allow the use of this data for counter-factual prediction. We formalize the task of uplift prediction that could be performed with this data, along with the relevant evaluation metrics. Finally we show that the dataset size makes it now possible to reach statistical significance when evaluating baseline methods on the most challenging target.
Document type :
Conference papers
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download
Contributor : Eustache Diemert <>
Submitted on : Monday, March 23, 2020 - 4:18:05 PM
Last modification on : Tuesday, May 11, 2021 - 11:36:38 AM


Files produced by the author(s)





Eustache Diemert, Artem Betlei, Christophe Renaudin, Massih-Reza Amini. A Large Scale Benchmark for Uplift Modeling. KDD, 2018, London, United Kingdom. ⟨10.1145/nnnnnnn.nnnnnnn⟩. ⟨hal-02515860⟩



Record views


Files downloads