Approachability in unknown games: Online learning meets multi-objective optimization

Shie Mannor; Vianney Perchet; Gilles Stoltz

Pré-Publication, Document De Travail Année : 2014

Approachability in unknown games: Online learning meets multi-objective optimization

(1) , (2) , (3)

1
2
3

Shie Mannor

Fonction : Auteur
PersonId : 837619

Department of Electrical Engineering - Technion [Haïfa]

Vianney Perchet

Fonction : Auteur
PersonId : 871940

Laboratoire de Probabilités et Modèles Aléatoires

Gilles Stoltz

Fonction : Auteur
PersonId : 738739
IdHAL : gilles-stoltz
ORCID : 0000-0003-1240-1007
IdRef : 091575419

Groupement de Recherche et d'Etudes en Gestion à HEC

Résumé

In the standard setting of approachability there are two players and a target set. The players play a repeated vector-valued game where one of them wants to have the average vector-valued payoff converge to the target set which the other player tries to exclude. We revisit the classical setting and consider the setting where the player has a preference relation between target sets: she wishes to approach the smallest (''best'') set possible given the observed average payoffs in hindsight. Moreover, as opposed to previous works on approachability, and in the spirit of online learning, we do not assume that there is a known game structure with actions for two players. Rather, the player receives an arbitrary vector-valued reward vector at every round. We show that it is impossible, in general, to approach the best target set in hindsight. We further propose a concrete strategy that approaches a non-trivial relaxation of the best-in-hindsight given the actual rewards. Our approach does not require projection onto a target set and amounts to switching between scalar regret minimization algorithms that are performed in episodes.

Mots clés

Online learning multi-objective optimization approachability

Domaines

Machine Learning [stat.ML] Statistiques [math.ST] Théorie [stat.TH] Apprentissage [cs.LG]

Fichier principal

MOL.pdf (361.99 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gilles Stoltz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00943664

Soumis le : samedi 8 février 2014-10:22:44

Dernière modification le : vendredi 24 mars 2023-14:52:58

Archivage à long terme le : lundi 12 mai 2014-12:31:39

Dates et versions

hal-00943664 , version 1 (08-02-2014)

hal-00943664 , version 2 (16-06-2016)

Identifiants

HAL Id : hal-00943664 , version 1
ARXIV : 1402.2043

Citer

Shie Mannor, Vianney Perchet, Gilles Stoltz. Approachability in unknown games: Online learning meets multi-objective optimization. 2014. ⟨hal-00943664v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 UPMC

389 Consultations

312 Téléchargements

Approachability in unknown games: Online learning meets multi-objective optimization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager