General risk measures for robust machine learning

Emilie Chouzenoux; Henri Gérard; Jean-Christophe Pesquet

doi:10.3934/fods.2019011

Journal Articles Foundations of Data Science Year : 2019

General risk measures for robust machine learning

Mesures de risque générales pour l'apprentissage automatique robuste

(1) , (2) , (1)

1
2

Emilie Chouzenoux

Function : Author
PersonId : 10209
IdHAL : emilie-chouzenoux
ORCID : 0000-0003-3631-6093
IdRef : 192528572

OPtimisation Imagerie et Santé

Henri Gérard

Function : Author
PersonId : 16181
IdHAL : henri-gerard
ORCID : 0000-0001-6274-8290

Centre d'Enseignement et de Recherche en Mathématiques et Calcul Scientifique

Jean-Christophe Pesquet

Function : Author
PersonId : 8124
IdHAL : jean-christophe-pesquet
IdRef : 122058577

OPtimisation Imagerie et Santé

Abstract

A wide array of machine learning problems are formulated as the minimization of the expectation of a convex loss function on some parameter space. Since the probability distribution of the data of interest is usually unknown, it is is often estimated from training sets, which may lead to poor out-of-sample performance. In this work, we bring new insights in this problem by using the framework which has been developed in quantitative finance for risk measures. We show that the original min-max problem can be recast as a convex minimization problem under suitable assumptions. We discuss several important examples of robust formulations, in particular by defining ambiguity sets based on $\varphi$-divergences and the Wasserstein metric. We also propose an efficient algorithm for solving the corresponding convex optimization problems involving complex convex constraints. Through simulation examples, we demonstrate that this algorithm scales well on real data sets.

Keywords

robust statistics Risk measures machine learning convex optimization divergences Wasserstein distance

Domains

Optimization and Control [math.OC] Machine Learning [stat.ML]

Fichier principal

article.pdf (940.07 Ko)

Origin : Files produced by the author(s)

Henri Gérard : Connect in order to contact the contributor

https://hal.science/hal-02109418

Submitted on : Thursday, April 25, 2019-2:27:32 PM

Last modification on : Thursday, May 16, 2024-3:00:04 PM

Dates and versions

hal-02109418 , version 1 (25-04-2019)

Identifiers

HAL Id : hal-02109418 , version 1
ARXIV : 1904.11707
DOI : 10.3934/fods.2019011

Cite

Emilie Chouzenoux, Henri Gérard, Jean-Christophe Pesquet. General risk measures for robust machine learning. Foundations of Data Science, 2019, 1 (3), pp.249-269. ⟨10.3934/fods.2019011⟩. ⟨hal-02109418⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENPC INRIA CERMICS PARISTECH CVN CENTRALESUPELEC INRIA2 TDS-MACS UNIV-PARIS-SACLAY GS-ENGINEERING GS-COMPUTER-SCIENCE

225 View

339 Download

General risk measures for robust machine learning

Mesures de risque générales pour l'apprentissage automatique robuste

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share