Online optimization and regret guarantees for non-additive long-term constraints - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2016

Online optimization and regret guarantees for non-additive long-term constraints

Rodolphe Jenatton
  • Fonction : Auteur
Jim Huang
  • Fonction : Auteur
Cedric Archambeau
  • Fonction : Auteur

Résumé

We consider online optimization in the 1-lookahead setting, where the objective does not decompose additively over the rounds of the online game. The resulting formulation enables us to deal with non-stationary and/or long-term constraints , which arise, for example, in online display advertising problems. We propose an on-line primal-dual algorithm for which we obtain dynamic cumulative regret guarantees. They depend on the convexity and the smoothness of the non-additive penalty, as well as terms capturing the smoothness with which the residuals of the non-stationary and long-term constraints vary over the rounds. We conduct experiments on synthetic data to illustrate the benefits of the non-additive penalty and show vanishing regret convergence on live traffic data collected by a display advertising platform in production.
Fichier principal
Vignette du fichier
nonAdditiveLongTermConstraints.pdf (648.88 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01273728 , version 1 (12-02-2016)
hal-01273728 , version 2 (07-06-2016)

Identifiants

Citer

Rodolphe Jenatton, Jim Huang, Dominik Csiba, Cedric Archambeau. Online optimization and regret guarantees for non-additive long-term constraints. 2016. ⟨hal-01273728v2⟩

Collections

TDS-MACS
336 Consultations
275 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More