A penalized bandit algorithm - Archive ouverte HAL Access content directly
Journal Articles Electronic Journal of Probability Year : 2008

A penalized bandit algorithm

Abstract

We study a two armed-bandit algorithm with penalty. We show the convergence of the algorithm and establish the rate of convergence. For some choices of the parameters, we obtain a central limit theorem in which the limit distribution is characterized as the unique stationary distribution of a discontinuous Markov process.
Fichier principal
Vignette du fichier
PenalBandit.pdf (293.93 Ko) Télécharger le fichier
Loading...

Dates and versions

hal-00012187 , version 1 (18-10-2005)

Identifiers

Cite

Damien Lamberton, Gilles Pagès. A penalized bandit algorithm. Electronic Journal of Probability, 2008, 13, 341-373 ; http://dx.doi.org/10.1214/EJP.v13-489. ⟨10.1214/EJP.v13-489⟩. ⟨hal-00012187⟩
110 View
224 Download

Altmetric

Share

Gmail Facebook X LinkedIn More