Online optimization and regret guarantees for non-additive long-term constraints

Rodolphe Jenatton; Jim Huang; Dominik Csiba; Cedric Archambeau

Pré-Publication, Document De Travail Année : 2016

Online optimization and regret guarantees for non-additive long-term constraints

(1) , (2) , (3) , (1)

1
2
3

Rodolphe Jenatton

Fonction : Auteur

Amazon, Berlin

Jim Huang

Fonction : Auteur

Amazon, Seattle

Dominik Csiba

Fonction : Auteur
PersonId : 983177

School of Mathematics - University of Edinburgh

Cedric Archambeau

Fonction : Auteur

Amazon, Berlin

Résumé

We consider online optimization in the 1-lookahead setting, where the objective does not decompose additively over the rounds of the online game. The resulting formulation enables us to deal with non-stationary and/or long-term constraints , which arise, for example, in online display advertising problems. We propose an on-line primal-dual algorithm for which we obtain dynamic cumulative regret guarantees. They depend on the convexity and the smoothness of the non-additive penalty, as well as terms capturing the smoothness with which the residuals of the non-stationary and long-term constraints vary over the rounds. We conduct experiments on synthetic data to illustrate the benefits of the non-additive penalty and show vanishing regret convergence on live traffic data collected by a display advertising platform in production.

Mots clés

online learning dynamic regret convex optimization

Domaines

Machine Learning [stat.ML] Optimisation et contrôle [math.OC] Statistiques [math.ST]

Fichier principal

nonAdditiveLongTermConstraints.pdf (648.88 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rodolphe Jenatton : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01273728

Soumis le : mardi 7 juin 2016-21:26:50

Dernière modification le : jeudi 14 mars 2024-03:11:26

Dates et versions

hal-01273728 , version 1 (12-02-2016)

hal-01273728 , version 2 (07-06-2016)

Identifiants

HAL Id : hal-01273728 , version 2
ARXIV : 1602.05394

Citer

Rodolphe Jenatton, Jim Huang, Dominik Csiba, Cedric Archambeau. Online optimization and regret guarantees for non-additive long-term constraints. 2016. ⟨hal-01273728v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

TDS-MACS

336 Consultations

275 Téléchargements

Online optimization and regret guarantees for non-additive long-term constraints

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager