Improved second-order bounds for prediction with expert advice

Nicolo Cesa-Bianchi; Yishay Mansour; Gilles Stoltz

Communication Dans Un Congrès Année : 2005

Improved second-order bounds for prediction with expert advice

(1) , (2) , (3)

1
2
3

Nicolo Cesa-Bianchi

Fonction : Auteur

Dipartimento di Scienze dell'Informazione [Milano]

Yishay Mansour

Fonction : Auteur

School of Computer Science

Gilles Stoltz

Fonction : Auteur
PersonId : 738739
IdHAL : gilles-stoltz
ORCID : 0000-0003-1240-1007
IdRef : 091575419

Département de Mathématiques et Applications - ENS Paris

Résumé

This work studies external regret in sequential prediction games with arbitrary payoffs (nonnegative or non-positive). External regret measures the difference between the payoff obtained by the forecasting strategy and the payoff of the best action. We focus on two important parameters: $M$, the largest absolute value of any payoff, and $Q^*$, the sum of squared payoffs of the best action. Given these parameters we derive first a simple and new forecasting strategy with regret at most order of $\\sqrt{Q^*(\\ln N)} + M\\,\\ln N$, where $N$ is the number of actions. We extend the results to the case where the parameters are unknown and derive similar bounds. We then devise a refined analysis of the weighted majority forecaster, which yields bounds of the same flavour. The proof techniques we develop are finally applied to the adversarial multi-armed bandit setting, and we prove bounds on the performance of an online algorithm in the case where there is no lower bound on the probability of each action.

Domaines

Statistiques [math.ST] Apprentissage [cs.LG]

Gilles Stoltz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00007539

Soumis le : vendredi 15 juillet 2005-17:09:16

Dernière modification le : vendredi 19 avril 2024-16:18:54

Dates et versions

hal-00007539 , version 1 (15-07-2005)

Identifiants

HAL Id : hal-00007539 , version 1

Citer

Nicolo Cesa-Bianchi, Yishay Mansour, Gilles Stoltz. Improved second-order bounds for prediction with expert advice. 2005, pp.217-232. ⟨hal-00007539⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS PSL MATH_ENS_PARIS

51 Consultations

0 Téléchargements

Improved second-order bounds for prediction with expert advice

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager