Online allocation and homogeneous partitioning for piecewise constant mean-approximation

Odalric-Ambrym Maillard; Alexandra Carpentier

Pré-Publication, Document De Travail Année : 2012

Online allocation and homogeneous partitioning for piecewise constant mean-approximation

(1) , (2)

1
2

Odalric-Ambrym Maillard

Fonction : Auteur
PersonId : 5563
IdHAL : odalric-ambrym-maillard
ORCID : 0000-0001-7935-7026
IdRef : 158055594

Montanuniversität Leoben

Alexandra Carpentier

Fonction : Auteur

Statistical Laboratory [Cambridge]

Résumé

In the setting of active learning for the multi-armed bandit, where the goal of a learner is to estimate with equal precision the mean of a finite number of arms, recent results show that it is possible to derive strategies based on finite-time confidence bounds that are competitive with the best possible strategy. We here consider an extension of this problem to the case when the arms are the cells of a finite partition P of a continuous sampling space X \subset \Real^d. Our goal is now to build a piecewise constant approximation of a noisy function (where each piece is one region of P and P is fixed beforehand) in order to maintain the local quadratic error of approximation on each cell equally low. Although this extension is not trivial, we show that a simple algorithm based on upper confidence bounds can be proved to be adaptive to the function itself in a near-optimal way, when |P| is chosen to be of minimax-optimal order on the class of \alpha-Hölder functions.

Mots clés

Multi-armed bandits Active sampling Histograms

Domaines

Statistiques [math.ST] Théorie [stat.TH] Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

nips965supplementary.pdf (358.13 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Odalric-Ambrym Maillard : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00742893

Soumis le : jeudi 1 novembre 2012-07:00:35

Dernière modification le : vendredi 18 septembre 2020-18:06:02

Archivage à long terme le : samedi 17 décembre 2016-01:58:32

Dates et versions

hal-00742893 , version 1 (01-11-2012)

Identifiants

HAL Id : hal-00742893 , version 1

Citer

Odalric-Ambrym Maillard, Alexandra Carpentier. Online allocation and homogeneous partitioning for piecewise constant mean-approximation. 2012. ⟨hal-00742893⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSMI

164 Consultations

40 Téléchargements

Online allocation and homogeneous partitioning for piecewise constant mean-approximation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager