Strategies for prediction under imperfect monitoring

Gabor Lugosi; Shie Mannor; Gilles Stoltz

Article Dans Une Revue Mathematics of Operations Research Année : 2008

Strategies for prediction under imperfect monitoring

(1) , (2) , (3, 4)

1
2
3
4

Gabor Lugosi

Fonction : Auteur
PersonId : 832564

Institució Catalana de Recerca i Estudis Avançats = Catalan Institution for Research and Advanced Studies

Shie Mannor

Fonction : Auteur
PersonId : 837619

McGill University = Université McGill [Montréal, Canada]

Gilles Stoltz

Fonction : Auteur correspondant
PersonId : 738739
IdHAL : gilles-stoltz
ORCID : 0000-0003-1240-1007
IdRef : 091575419

Connectez-vous pour contacter l'auteur

Département de Mathématiques et Applications - ENS Paris

Groupement de Recherche et d'Etudes en Gestion à HEC

Résumé

We propose simple randomized strategies for sequential prediction under imperfect monitoring, that is, when the forecaster does not have access to the past outcomes but rather to a feedback signal. The proposed strategies are consistent in the sense that they achieve, asymptotically, the best possible average reward. It was Rustichini (1999) who first proved the existence of such consistent predictors. The forecasters presented here offer the first constructive proof of consistency. Moreover, the proposed algorithms are computationally efficient. We also establish upper bounds for the rates of convergence. In the case of deterministic feedback, these rates are optimal up to logarithmic terms.

Mots clés

individual sequences repeated games with partial monitoring approachability

Domaines

Statistiques [math.ST] Théorie [stat.TH] Apprentissage [cs.LG]

Fichier principal

LugosiMannorStoltz-Final.pdf (312.47 Ko)

INFORMS.ps (77.26 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Format : Autre

Gilles Stoltz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00124679

Soumis le : lundi 7 janvier 2008-15:03:46

Dernière modification le : vendredi 19 avril 2024-16:18:54

Archivage à long terme le : vendredi 25 novembre 2016-18:32:14

Dates et versions

hal-00124679 , version 1 (15-01-2007)

hal-00124679 , version 2 (21-04-2007)

hal-00124679 , version 3 (12-07-2007)

hal-00124679 , version 4 (07-01-2008)

Identifiants

HAL Id : hal-00124679 , version 4
ARXIV : math/0701419

Citer

Gabor Lugosi, Shie Mannor, Gilles Stoltz. Strategies for prediction under imperfect monitoring. Mathematics of Operations Research, 2008, à paraître. ⟨hal-00124679v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS HEC CNRS PSL MATH_ENS_PARIS

640 Consultations

253 Téléchargements

Strategies for prediction under imperfect monitoring

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager