E. G. Larsson, O. Edfors, F. Tufvesson, and T. L. Marzetta, Massive MIMO for next generation wireless systems, IEEE Commun. Mag, vol.52, issue.2, pp.186-195, 2014.

J. G. Andrews, S. Buzzi, W. Choi, S. Hanly, A. Lozano et al., What will 5G be?, IEEE J. Sel. Areas Commun, vol.32, issue.6, pp.1065-1082, 2014.

J. S. -b.-orange, A. G. Armada, B. Evans, A. Galis, and H. Karl, White paper for research beyond 5G, Accessed, vol.23, 2015.

E. C. Strinati, S. Barbarossa, J. L. Gonzalez-jimenez, D. Kténas, N. Cassiau et al., 6G: The next frontier, 2019.

J. Hoydis, S. Brink, and M. Debbah, Massive MIMO in the UL/DL of cellular networks: How many antennas do we need?, IEEE Trans. Wireless Commun, vol.31, issue.2, pp.160-171, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00925966

F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta et al., Scaling up MIMO: Opportunities and challenges with very large arrays, IEEE Signal Process. Mag, vol.30, pp.40-60, 2013.

R. S. Cheng and S. Verdú, Gaussian multiaccess channels with ISI: capacity region and multiuser water-filling, IEEE Trans. Inf. Theory, vol.39, issue.3, pp.773-785, 1993.

W. Yu, W. Rhee, S. Boyd, and J. M. Cioffi, Iterative water-filling for Gaussian vector multiple-access channels, IEEE Trans. Inf. Theory, vol.50, issue.1, pp.145-152, 2004.

G. Scutari, D. P. Palomar, and S. Barbarossa, Optimal linear precoding strategies for wideband non-cooperative systems based on game theorypart I: Nash equilibria, IEEE Trans. Signal Process, vol.56, issue.3, pp.1230-1249, 2008.

, Optimal linear precoding strategies for wideband non-cooperative systems based on game theory -part II: algorithms, IEEE Trans. Signal Process, vol.56, issue.3, pp.1250-1267, 2008.

E. V. Belmega, S. Lasaulce, M. Debbah, M. Jungers, and J. Dumont, Power allocation games in wireless networks of multi-antenna terminals, Telecommunication Systems, vol.47, issue.1-2, pp.109-122, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00447006

G. Scutari, D. P. Palomar, and S. Barbarossa, Asynchronous iterative waterfilling for Gaussian frequency-selective interference channels, ISIT '06: Proceedings of the 2006 International Symposium on Information Theory, vol.54, pp.2868-2878, 2006.

P. Mertikopoulos and A. L. Moustakas, Learning in an uncertain world: MIMO covariance matrix optimization with imperfect feedback, IEEE Trans. Signal Process, vol.64, issue.1, pp.5-18, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01382278

R. Liao, B. Bellalta, M. Oliver, and Z. Niu, MU-MIMO MAC protocols for wireless local area networks: A survey, IEEE Commun. Surveys Tuts, vol.18, issue.1, pp.162-183, 2016.

J. C. Spall, A one-measurement form of simultaneous perturbation stochastic approximation, Automatica, vol.33, issue.1, pp.109-112, 1997.

A. D. Flaxman, A. T. Kalai, and H. B. Mcmahan, Online convex optimization in the bandit setting: gradient descent without a gradient, SODA '05: Proceedings of the 16th annual ACM-SIAM Symposium on Discrete Algorithms, pp.385-394, 2005.

W. Li and M. Assaad, Matrix exponential learning schemes with low informational exchange, IEEE Trans. Signal Process, vol.67, issue.12, pp.3140-3153, 2019.

I. E. Telatar, Capacity of multi-antenna Gaussian channels, European Transactions on Telecommunications and Related Technologies, vol.10, issue.6, pp.585-596, 1999.

D. Monderer and L. S. Shapley, Potential games, Games and Economic Behavior, vol.14, issue.1, pp.124-143, 1996.

A. Neyman, Correlated equilibrium and potential games, International Journal of Game Theory, vol.26, issue.2, pp.223-227, 1997.

G. Scutari, D. P. Palomar, and S. Barbarossa, The MIMO iterative waterfilling algorithm, IEEE Trans. Signal Process, vol.57, issue.5, pp.1917-1935, 2009.

Z. Luo and J. Pang, Analysis of iterative waterfllining algorithms for multi-user power control in digital subscriber lines, EURASIP J. Appl. Signal Process, 2006.

P. Mertikopoulos, E. V. Belmega, A. L. Moustakas, and S. Lasaulce, Distributed learning policies for power allocation in multiple access channels, IEEE J. Sel. Areas Commun, vol.30, issue.1, pp.96-106, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00648840

P. Mertikopoulos, E. V. Belmega, and A. L. Moustakas, Matrix exponential learning: Distributed optimization in MIMO systems, ISIT '12: Proceedings of the 2012 IEEE International Symposium on Information Theory, pp.3028-3032, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00741823

Y. Nesterov, Primal-dual subgradient methods for convex problems, Mathematical Programming, vol.120, issue.1, pp.221-259, 2009.

Y. Yu, The strong convexity of von Neumann's entropy, 2013.

A. Shapiro, D. Dentcheva, and A. Ruszczy?ski, Lectures on stochastic programming : modeling and theory, ser. MPS-SIAM series on optimization, 2009.

J. Hiriart-urruty and C. , Fundamentals of Convex Analysis, ser. Grundlehren Text Editions, 2004.

R. T. Rockafellar, Convex analysis, ser, Princeton Mathematical Series, 1970.

S. M. Kakade, S. Shalev-shwartz, and A. Tewari, Regularization techniques for learning with matrices, J. Mach. Learn. Res, vol.13, pp.1865-1890, 2012.

Y. Nesterov, Introductory Lectures on Convex Optimization: A Basic Course, 2014.

M. Bravo, D. S. Leslie, and P. Mertikopoulos, Bandit learning in concave N-person games, NIPS '18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01891523

, Bandit learning in concave n-person games, 2018.

P. Hall and C. C. Heyde, Martingale limit theory and its application / P. Hall, C.C. Heyde, 1980.