Improved Algorithms for Linear Stochastic Bandits, Advances in Neural Information Processing Systems, 2011. ,
Thompson Sampling for Contextual Bandits with Linear Payoffs, International Conference on Machine Learning (ICML), 2013. ,
Active Learning in Multi-armed Bandits, Algorithmic Learning Theory, 2008. ,
DOI : 10.1007/978-3-540-87987-9_25
The minimum description length principle in coding and modeling. Information Theory, IEEE Transactions on, vol.44, issue.6, pp.2743-2760, 1998. ,
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, Machine Learning, pp.1-122, 2012. ,
DOI : 10.1561/2200000024
X-armed bandits, Journal of Machine Learning Research, vol.12, pp.1587-1627, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00450235
Kullback???Leibler upper confidence bounds for optimal sequential allocation, The Annals of Statistics, vol.41, issue.3, pp.1516-1541, 2013. ,
DOI : 10.1214/13-AOS1119SUPP
A minimum description length approach to hidden Markov models with Poisson and Gaussian emissions. Application to order identification, Journal of Statistical Planning and Inference, vol.139, issue.3, pp.962-977, 2009. ,
DOI : 10.1016/j.jspi.2008.06.010
Sequential Design of Experiments, The Annals of Mathematical Statistics, vol.30, issue.3, pp.755-770, 1959. ,
DOI : 10.1214/aoms/1177706205
Unimodal Bandits without Smoothness, 2014. ,
Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems, Journal of Machine Learning Research, vol.7, pp.1079-1105, 2006. ,
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence, Advances in Neural Information Processing Systems, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00747005
Consistency of the Unlimited BIC Context Tree Estimator, IEEE Transactions on Information Theory, vol.52, issue.10, pp.4630-4635, 2006. ,
DOI : 10.1109/TIT.2006.881742
Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains, SIAM Journal on Control and Optimization, vol.35, issue.3, pp.715-743, 1997. ,
DOI : 10.1137/S0363012994275440
The Minimum Description Length Principle (Adaptive Computation and Machine Learning), 2007. ,
UCB: an Optimal Exploration Algorithm for Multi-Armed Bandits, Proceedings of the 27th Conference on Learning Theory, 2014. ,
PAC subset selection in stochastic multiarmed bandits, International Conference on Machine Learning (ICML), 2012. ,
Information complexity in bandit subset selection, Proceeding of the 26th Conference On Learning Theory, 2013. ,
On the Complexity of A/B Testing, Proceedings of the 27th Conference On Learning Theory, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00990254
On the Complexity of Best Arm Identification in Multi- Armed Bandit Models, Journal of Machine Learning Research, p.2015 ,
URL : https://hal.archives-ouvertes.fr/hal-01024894
The performance of universal encoding, IEEE Transactions on Information Theory, vol.27, issue.2, pp.199-206, 1981. ,
Asymptotically efficient adaptive allocation rules, Advances in Applied Mathematics, vol.6, issue.1, pp.4-22, 1985. ,
DOI : 10.1016/0196-8858(85)90002-8
Lipschitz Bandits: Regret lower bounds and optimal algorithms, Proceedings on the 27th Conference On Learning Theory, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01092791
The Sample Complexity of Exploration in the Multi-Armed Bandit Problem, Journal of Machine Learning Research, pp.623-648, 2004. ,
From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning, Machine Learning, 2014. ,
DOI : 10.1561/2200000038
URL : https://hal.archives-ouvertes.fr/hal-00747575
Modeling by shortest data description, Automatica, vol.14, issue.5, pp.465-471, 1978. ,
DOI : 10.1016/0005-1098(78)90005-5
Gaussian Process Optimization in the Bandit Setting : No Regret and Experimental Design, Proceedings of the International Conference on Machine Learning, 2010. ,
Learning to detect an oddball target, 2015. ,
The context tree weighting method: Basic properties, IEEE Transactions on Information Theory, vol.41, pp.653-664, 1995. ,