Exploiting easy data in online optimization

Amir Sani; Gergely Neu; Alessandro Lazaric

Communication Dans Un Congrès Année : 2014

Exploiting easy data in online optimization

(1) , (1) , (1)

Amir Sani

Fonction : Auteur
PersonId : 8209
IdHAL : amirsani
IdRef : 188701648

Sequential Learning

Gergely Neu

Fonction : Auteur
PersonId : 961171

Sequential Learning

Alessandro Lazaric

Fonction : Auteur
PersonId : 851
IdHAL : alessandro-lazaric
ORCID : 0000-0002-8970-413X
IdRef : 188701486

Sequential Learning

Résumé

We consider the problem of online optimization, where a learner chooses a decision from a given decision set and suffers some loss associated with the decision and the state of the environment. The learner's objective is to minimize its cumulative regret against the best fixed decision in hindsight. Over the past few decades numerous variants have been considered, with many algorithms designed to achieve sub-linear regret in the worst case. However, this level of robustness comes at a cost. Proposed algorithms are often over-conservative, failing to adapt to the actual complexity of the loss sequence which is often far from the worst case. In this paper we introduce a general algorithm that, provided with a "safe" learning algorithm and an opportunistic "benchmark", can effectively combine good worst-case guarantees with much improved performance on "easy" data. We derive general theoretical bounds on the regret of the proposed algorithm and discuss its implementation in a wide range of applications, notably in the problem of learning with shifting experts (a recent COLT open problem). Finally, we provide numerical simulations in the setting of prediction with expert advice with comparisons to the state of the art.

Domaines

Apprentissage [cs.LG]

Fichier principal

SNL14.pdf (426.85 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gergely Neu : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01079428

Soumis le : samedi 1 novembre 2014-22:59:51

Dernière modification le : vendredi 24 mars 2023-14:52:59

Archivage à long terme le : lundi 2 février 2015-17:01:06

Dates et versions

hal-01079428 , version 1 (01-11-2014)

Identifiants

HAL Id : hal-01079428 , version 1

Citer

Amir Sani, Gergely Neu, Alessandro Lazaric. Exploiting easy data in online optimization. Advances in Neural Information Processing 27, Dec 2014, Montreal, Canada. ⟨hal-01079428⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS CRISTAL INRIA2 CRISTAL-SEQUEL

199 Consultations

121 Téléchargements

Exploiting easy data in online optimization

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager