A penalized bandit algorithm

Damien Lamberton; Gilles Pagès

doi:10.1214/EJP.v13-489

Article Dans Une Revue Electronic Journal of Probability Année : 2008

A penalized bandit algorithm

(1) , (2)

1
2

Damien Lamberton

Fonction : Auteur
PersonId : 10233
IdHAL : damien-lamberton
ORCID : 0009-0003-9817-2424
IdRef : 032457308

Laboratoire d'Analyse et de Mathématiques Appliquées

Gilles Pagès

Fonction : Auteur
PersonId : 8458
IdHAL : gilles-pages
ORCID : 0000-0001-6487-3079
IdRef : 030737605

Laboratoire de Probabilités et Modèles Aléatoires

Résumé

We study a two armed-bandit algorithm with penalty. We show the convergence of the algorithm and establish the rate of convergence. For some choices of the parameters, we obtain a central limit theorem in which the limit distribution is characterized as the unique stationary distribution of a discontinuous Markov process.

Mots clés

Two-armed bandit algorithm Stochastic Approximation learning automata asset allocation

Domaines

Probabilités [math.PR]

Fichier principal

PenalBandit.pdf (293.93 Ko)

Gilles Pagès : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00012187

Soumis le : mardi 18 octobre 2005-17:57:56

Dernière modification le : jeudi 14 mars 2024-03:08:17

Archivage à long terme le : jeudi 1 avril 2010-22:47:25

Dates et versions

hal-00012187 , version 1 (18-10-2005)

Identifiants

HAL Id : hal-00012187 , version 1
ARXIV : math.PR/0510384
DOI : 10.1214/EJP.v13-489

Citer

Damien Lamberton, Gilles Pagès. A penalized bandit algorithm. Electronic Journal of Probability, 2008, 13, 341-373 ; http://dx.doi.org/10.1214/EJP.v13-489. ⟨10.1214/EJP.v13-489⟩. ⟨hal-00012187⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 UPMC PMA CNRS UNIV-MLV LAMA_UMR8050 LAMA_PS UPEC LPSM SORBONNE-UNIVERSITE SU-SCIENCES UNIV-EIFFEL

110 Consultations

224 Téléchargements

A penalized bandit algorithm

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager