P. Auer, N. Cesa-bianchi, Y. Freund, and R. Schapire, The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, vol.32, issue.1, pp.48-77, 2002.
DOI : 10.1137/S0097539701398375

P. Auer, N. Cesa-bianchi, and C. Gentile, Adaptive and Self-Confident On-Line Learning Algorithms, Journal of Computer and System Sciences, vol.64, issue.1, pp.48-75, 2002.
DOI : 10.1006/jcss.2001.1795

K. Azuma, Weighted sums of certain dependent random variables, Tohoku Mathematical Journal, vol.19, issue.3, pp.357-367, 1967.
DOI : 10.2748/tmj/1178243286

A. Baños, On Pseudo-Games, The Annals of Mathematical Statistics, vol.39, issue.6, pp.1932-1945, 1968.
DOI : 10.1214/aoms/1177698023

D. P. Bertsekas, Nonlinear Programming, Athena Scientific, 1995.

D. Blackwell, Controlled random walks, Proceedings of the International Congress of Mathematicians, pp.336-338, 1954.

N. Cesa-bianchi, Analysis of two gradient-based algorithms for on-line regression, Proceedings of the tenth annual conference on Computational learning theory , COLT '97, pp.392-411, 1999.
DOI : 10.1145/267460.267492

N. Cesa-bianchi, Y. Freund, D. Haussler, D. P. Helmbold, R. Schapire et al., How to use expert advice, Journal of the ACM, vol.44, issue.3, pp.427-485, 1997.
DOI : 10.1145/258128.258179

N. Cesa-bianchi and G. Lugosi, On Prediction of Individual Sequences, SSRN Electronic Journal, vol.27, pp.1865-1895, 1999.
DOI : 10.2139/ssrn.139692

N. Cesa-bianchi, G. Lugosi, and G. Stoltz, Regret Minimization Under Partial Monitoring, Mathematics of Operations Research, vol.31, issue.3, pp.562-580, 2006.
DOI : 10.1287/moor.1060.0206
URL : https://hal.archives-ouvertes.fr/hal-00007538

N. Cesa-bianchi, Y. Mansour, and G. Stoltz, Improved second-order bounds in prediction with expert advice, Machine Learning

X. Chen and H. White, Laws of Large Numbers for Hilbert Space-Valued Mixingales with Applications, Econometric Theory, vol.3, issue.02, pp.284-304, 1996.
DOI : 10.1016/0167-7152(93)90134-5

D. Foster and R. Vohra, Asymptotic calibration, Biometrika, vol.85, issue.2, pp.379-390, 1998.
DOI : 10.1093/biomet/85.2.379

D. A. Freedman, On Tail Probabilities for Martingales, The Annals of Probability, vol.3, issue.1, pp.100-118, 1975.
DOI : 10.1214/aop/1176996452

J. Hannan, Approximation to Bayes risk in repeated play, Contributions to the Theory of Games, pp.97-139, 1957.

S. Hart and A. Mas, A Simple Adaptive Procedure Leading to Correlated Equilibrium, Econometrica, vol.68, issue.5, pp.1127-1150, 2000.
DOI : 10.1111/1468-0262.00153

S. Hart and A. Mas, A reinforcement procedure leading to correlated equilibrium, Economic Essays: A Festschrift for Werner Hildenbrand, pp.181-200, 2002.

W. Hoeffding, Probability Inequalities for Sums of Bounded Random Variables, Journal of the American Statistical Association, vol.1, issue.301, pp.13-30, 1963.
DOI : 10.1214/aoms/1177730491

J. Kivinen and M. Warmuth, Exponentiated Gradient versus Gradient Descent for Linear Predictors, Information and Computation, vol.132, issue.1, pp.1-63, 1997.
DOI : 10.1006/inco.1996.2612

N. Littlestone and M. Warmuth, The Weighted Majority Algorithm, Information and Computation, vol.108, issue.2, pp.212-261, 1994.
DOI : 10.1006/inco.1994.1009

S. Mannor and N. Shimkin, On-Line Learning with Imperfect Monitoring, Proceedings of the 16th Annual Conference on Learning Theory, pp.552-567, 2003.
DOI : 10.1007/978-3-540-45167-9_40

N. Megiddo, On repeated games with incomplete information played by non-Bayesian players, International Journal of Game Theory, vol.2, issue.3, pp.157-167, 1980.
DOI : 10.1007/BF01781370

A. Piccolboni and C. Schindelhauer, Discrete prediction games with arbitrary feedback and loss, Proceedings of the 14th Annual Conference on Computational Learning Theory, pp.208-223, 2001.

A. Rustichini, Minimizing Regret: The General Case, Games and Economic Behavior, vol.29, issue.1-2, pp.224-243, 1999.
DOI : 10.1006/game.1998.0690

M. Lugosi and S. , Strategies for prediction under imperfect monitoring Mathematics of Operations Research xx(x)

V. Vovk, AGGREGATING STRATEGIES, Proceedings of the 3rd Annual Workshop on Computational Learning Theory, pp.372-383, 1990.
DOI : 10.1016/B978-1-55860-146-8.50032-1

V. Vovk, A Game of Prediction with Expert Advice, Journal of Computer and System Sciences, vol.56, issue.2, pp.153-173, 1998.
DOI : 10.1006/jcss.1997.1556

T. Weissman and N. Merhav, Universal prediction of individual binary sequences in the presence of noise, IEEE Transactions on Information Theory, vol.47, issue.6, pp.2151-2173, 2001.
DOI : 10.1109/18.945240

T. Weissman, N. Merhav, and A. Somekh-baruch, Twofold universal prediction schemes for achieving the finite-state predictability of a noisy individual binary sequence, IEEE Transactions on Information Theory, vol.47, issue.5, pp.1849-1866, 2001.
DOI : 10.1109/18.930923