Mirror Descent Meets Fixed Share (and feels no regret)

Abstract : Mirror descent with an entropic regularizer is known to achieve shifting regret bounds that are logarithmic in the dimension. This is done using either a carefully designed projection or by a weight sharing technique. Via a novel unified analysis, we show that these two approaches deliver essentially equivalent bounds on a notion of regret generalizing shifting, adaptive, discounted, and other related regrets. Our analysis also captures and extends the generalized weight sharing technique of Bousquet and Warmuth, and can be refined in several ways, including improvements for small losses and adaptive tuning of parameters.
Type de document :
Communication dans un congrès
NIPS 2012, Dec 2012, Lake Tahoe, United States. 25, Paper 471, 2012
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00670514
Contributeur : Gilles Stoltz <>
Soumis le : jeudi 27 septembre 2012 - 14:51:59
Dernière modification le : vendredi 25 mai 2018 - 12:02:06
Document(s) archivé(s) le : vendredi 28 décembre 2012 - 06:20:08

Fichiers

CBGaLuSt-FShare--NIPS.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00670514, version 2
  • ARXIV : 1202.3323

Collections

Citation

Nicolò Cesa-Bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz. Mirror Descent Meets Fixed Share (and feels no regret). NIPS 2012, Dec 2012, Lake Tahoe, United States. 25, Paper 471, 2012. 〈hal-00670514v2〉

Partager

Métriques

Consultations de la notice

1031

Téléchargements de fichiers

313