Hedging under uncertainty: regret minimization meets exponentially fast convergence

Johanne Cohen; Amélie Héliou; Panayotis Mertikopoulos

doi:10.1007/978-3-319-66700-3_20

Communication Dans Un Congrès Année : 2017

Hedging under uncertainty: regret minimization meets exponentially fast convergence

(1) , (2, 3) , (4)

1
2
3
4

Johanne Cohen

Fonction : Auteur
PersonId : 7467
IdHAL : johanne-cohen
ORCID : 0000-0002-9548-5260
IdRef : 145022269

Laboratoire de Recherche en Informatique

Amélie Héliou

Fonction : Auteur
PersonId : 14758
IdHAL : amelie-heliou

Algorithms and Models for Integrative BIOlogy

Algorithms and Models for Integrative Biology

Panayotis Mertikopoulos

Fonction : Auteur
PersonId : 1933
IdHAL : mertikop
ORCID : 0000-0003-2026-9616
IdRef : 253119758

Performance analysis and optimization of LARge Infrastructures and Systems

Résumé

This paper examines the problem of multi-agent learning in N-person non-cooperative games. For concreteness, we focus on the so-called “hedge” variant of the exponential weights (EW) algorithm, one of the most widely studied algorithmic schemes for regret minimization in online learning. In this multi-agent context, we show that a) dominated strategies become extinct (a.s.); and b) in generic games, pure Nash equilibria are attracting with high probability, even in the presence of uncertainty and noise of arbitrarily high variance. Moreover, if the algorithm’s step-size does not decay too fast, we show that these properties occur at a quasi-exponential rate – that is, much faster than the algorithm’s O(1/\sqrt{T}) worst-case regret guarantee would suggest.

Domaines

Optimisation et contrôle [math.OC]

Panayotis Mertikopoulos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01382290

Soumis le : dimanche 16 octobre 2016-15:29:10

Dernière modification le : vendredi 5 avril 2024-03:09:33

Dates et versions

hal-01382290 , version 1 (16-10-2016)

Identifiants

HAL Id : hal-01382290 , version 1
ARXIV : 1607.08863
DOI : 10.1007/978-3-319-66700-3_20

Citer

Johanne Cohen, Amélie Héliou, Panayotis Mertikopoulos. Hedging under uncertainty: regret minimization meets exponentially fast convergence. Symposium on Algorithmic Game Theory (SAGT) 2017, Sep 2017, L'Aquila, Italy. ⟨10.1007/978-3-319-66700-3_20⟩. ⟨hal-01382290⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X UGA CNRS INRIA LIX LIG X-LIX X-DEP-INFO LIG_SRCPR UMR8623 CENTRALESUPELEC LRI-GALAC INRIA2 TDS-MACS LIG-SRCPR-POLARIS UNIV-PARIS-SACLAY ANR GS-COMPUTER-SCIENCE LIG_SIDCH

408 Consultations

0 Téléchargements

Hedging under uncertainty: regret minimization meets exponentially fast convergence

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager