Online learning and Blackwell approachability with partial monitoring: optimal convergence rates

Joon Kwon; Vianney Perchet

Communication Dans Un Congrès Année : 2017

Online learning and Blackwell approachability with partial monitoring: optimal convergence rates

(1) , (2)

1
2

Joon Kwon

Fonction : Auteur
PersonId : 181898
IdHAL : joon-kwon
ORCID : 0000-0002-3464-9081
IdRef : 197710840

Centre de mathématiques appliquées

Vianney Perchet

Fonction : Auteur

Ecole Normale Supérieure Paris-Saclay

Résumé

Blackwell approachability is an online learning setup generalizing the classical problem of regret minimization by allowing for instance multi-criteria optimization, global (online) optimization of a convex loss, or online linear optimization under some cumulative constraint. We consider partial monitoring where the decision maker does not necessarily observe the outcomes of his decision (unlike the traditional regret/bandit literature). Instead, he receives a random signal correlated to the decision–outcome pair, or only to the outcome. We construct, for the first time, approachability algorithms with convergence rate of order O(T −1/2 ) when the signal is independent of the decision and of order O(T −1/3 ) in the case of general signals. Those rates are optimal in the sense that they cannot be improved without further assumption on the structure of the objectives and/or the signals.

Domaines

Sciences du Vivant [q-bio]

Fichier principal

approachability-partial-optimal_1.pdf (223.11 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Migration ProdInra : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02734035

Soumis le : mardi 2 juin 2020-13:21:03

Dernière modification le : vendredi 5 août 2022-14:58:08

Archivage à long terme le : mercredi 2 décembre 2020-13:22:51

Dates et versions

hal-02734035 , version 1 (02-06-2020)

Identifiants

HAL Id : hal-02734035 , version 1
PRODINRA : 481342

Citer

Joon Kwon, Vianney Perchet. Online learning and Blackwell approachability with partial monitoring: optimal convergence rates. 20. International Conference on Artificial Intelligence and Statistics (AISTATS), Apr 2017, Fort Lauderdale, United States. ⟨hal-02734035⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-CACHAN INRAE ENS-PARIS-SACLAY

14 Consultations

16 Téléchargements

Online learning and Blackwell approachability with partial monitoring: optimal convergence rates

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager