From bandits to experts: A tale of domination and independence, Neural Information Processing Systems, 2013. ,
Online learning with feedback graphs: Beyond bandits, Conference on Learning Theory, 2015. ,
Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.2/3, pp.235-256, 2002. ,
DOI : 10.1023/A:1013689704352
Adaptive and Self-Confident On-Line Learning Algorithms, Journal of Computer and System Sciences, vol.64, issue.1, pp.48-75, 2002. ,
DOI : 10.1006/jcss.2001.1795
Minimax regret of finite partial-monitoring games in stochastic environments, Conference on Learning Theory, 2011. ,
Partial Monitoring???Classification, Regret Bounds, and Algorithms, Mathematics of Operations Research, vol.39, issue.4, pp.967-997, 2014. ,
DOI : 10.1287/moor.2014.0663
Regret Analysis of Stochastic and Nonstochastic Multiarmed Bandit Problems, Machine Learning, pp.1-122, 2012. ,
Prediction, learning, and games, 2006. ,
DOI : 10.1017/CBO9780511546921
Online learning of noisy data with kernels, Conference on Learning Theory, 2010. ,
Prediction by random-walk perturbation, Conference on Learning Theory, 2013. ,
Sequential Prediction of Unbounded Stationary Time Series, IEEE Transactions on Information Theory, vol.53, issue.5, pp.1866-1872, 2007. ,
DOI : 10.1109/TIT.2007.894660
Efficient learning by implicit excitedon in bandit problems with side observations, Neural Information Processing Systems, 2014. ,
From bandits to experts: On the value of side-observations, Neural Information Processing Systems, 2011. ,
Online learning with Gaussian payoffs and side observations, Neural Information Processing Systems, 2015. ,