T. Marzetta, Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas, IEEE Transactions on Wireless Communications, 2010.
DOI : 10.1109/TWC.2010.092810.091092

K. Liu and Q. Zhao, Indexability of retless bandit problems and optimality of whittle index for dynamic multichannel access, pp.5547-5567, 2010.

W. Ouyang, S. Murugesan, A. Eryilmaz, and N. Shroff, Exploiting Channel Memory for Joint Estimation and Scheduling in Downlink Networks???a Whittle???s Indexability Analysis, IEEE Transactions on Information Theory, vol.61, issue.4, pp.1702-1719, 2015.
DOI : 10.1109/TIT.2015.2399923

J. Gittins, K. Glazebrook, and R. Weber, Multi-armed Bandit Allocation Indices, 2011.
DOI : 10.1002/9780470980033

P. Whittle, Restless bandits: activity allocation in a changing world, Journal of Applied Probability, vol.1, issue.A, pp.287-298, 1988.
DOI : 10.1214/aop/1176994469

P. Jacko and S. Villar, Opportunistic schedulers for optimal scheduling of flows in wireless systems with ARQ feedback, 24th International Teletraffic Congress, 2012.

K. Liu, Q. Zhao, and B. Krishnamachari, Dynamic Multichannel Access With Imperfect Channel State Detection, IEEE Transactions on Signal Processing, vol.58, issue.5, pp.2795-2808, 2010.
DOI : 10.1109/TSP.2010.2041600
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.867

R. D. Smallwood and E. J. Sondik, The Optimal Control of Partially Observable Markov Processes over a Finite Horizon, Operations Research, vol.21, issue.5, pp.1071-1088, 1973.
DOI : 10.1287/opre.21.5.1071

M. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming, 2005.
DOI : 10.1002/9780470316887

M. Larrañaga, M. Assaad, A. Destounis, and G. Paschos, Dynamic pilot allocation over Markovian fading channels: A restless bandit approach, 2016 IEEE Information Theory Workshop (ITW), p.1349104
DOI : 10.1109/ITW.2016.7606842

C. H. Papadimitriou and J. N. Tsitsiklis, The Complexity of Optimal Queuing Network Control, Mathematics of Operations Research, vol.24, issue.2, pp.293-305, 1999.
DOI : 10.1287/moor.24.2.293

S. C. Albright, Structural Results for Partially Observable Markov Decision Processes, Operations Research, vol.27, issue.5, pp.1041-1053, 1979.
DOI : 10.1287/opre.27.5.1041

W. S. Lovejoy, Some Monotonicity Results for Partially Observed Markov Decision Processes, Operations Research, vol.35, issue.5, pp.736-743, 1987.
DOI : 10.1287/opre.35.5.736

D. Hodge and K. D. Glazebrook, Dynamic resource allocation in a multi-product make-to-stock production system, Queueing Systems, vol.41, issue.4, pp.333-364, 2011.
DOI : 10.1287/mnsc.41.4.690
URL : https://hal.archives-ouvertes.fr/hal-00803937

I. Verloop, Asymptotically optimal priority policies for indexable and non-indexable restless bandits, To appear in Annals of Applied Probability, 2016.
DOI : 10.1214/15-aap1137
URL : https://hal.archives-ouvertes.fr/hal-00743781

W. Ouyang, A. Eryilmaz, and N. Shroff, Asymptotically optimal downlink scheduling over Markovian fading channels, 2012 Proceedings IEEE INFOCOM, pp.1-9, 2012.
DOI : 10.1109/INFCOM.2012.6195483