A. Marcastel, E. V. Belmega, P. Mertikopoulos, and I. Fijalkow, Online power allocation for opportunistic radio access in dynamic OFDM networks, Vehicular Technology Conference (VTC-Fall), pp.1-5, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01387044

, Online interference mitigation via learning in dynamic IoT environments, Globecom Workshop, Internet of Everything, pp.1-5, 2016.

A. I. Sulyman, S. M. Oteafy, and H. S. Hassanein, Expanding the cellular-IoT umbrella: An architectural approach, IEEE Trans. Wireless Commun, vol.24, issue.3, pp.66-71, 2017.

M. Basharat, W. Ejaz, M. Naeem, A. M. Khattak, and A. Anpalagan, A survey and taxonomy on nonorthogonal multiple-access schemes for 5G networks, Transactions on Emerging Telecommunications Technologies, vol.29, issue.1, pp.1-17, 2018.

, Huawei Technologies, 5G: A technology vision. White paper, 2013.

S. Shalev-shwartz, Online learning and online convex optimization, Foundations and Trends in Machine Learning, vol.4, pp.107-194, 2011.

S. Bubeck and N. Cesa-bianchi, Regret analysis of stochastic and nonstochastic multi-armed bandit problems, Foundations and Trends in Machine Learning, vol.5, pp.1-122, 2012.

I. Caragiannis, C. Kaklamanis, P. Kanellopoulos, M. Kyropoulou, B. Lucier et al., Bounding the inefficiency of outcomes in generalized second price auctions, Journal of Economic Theory, vol.156, pp.343-388, 2015.

N. Cesa-bianchi, C. Gentile, and Y. Mansour, Regret minimization for reserve prices in second-price auctions, IEEE Trans. Inf. Theory, vol.61, issue.1, pp.549-564, 2015.

P. Viappiani and C. Boutilier, Regret-based optimal recommendation sets in conversational recommender systems, Proc. 3rd ACM conference on Recommender Systems, pp.101-108, 2009.

P. Mertikopoulos and E. V. Belmega, Transmit without regrets: Online optimization in MIMO-OFDM cognitive radio systems, IEEE J. Sel. Areas Commun, vol.32, issue.11, 1987.
URL : https://hal.archives-ouvertes.fr/hal-01073500

, Learning to be green: Robust energy efficiency maximization in dynamic MIMO-OFDM systems, IEEE J. Sel. Areas Commun, vol.34, issue.4, pp.743-757, 2016.

D. Miorandi, S. Sicari, F. D. Pellegrini, and I. Chlamtac, Internet of things: Vision, applications and research challenges, Ad Hoc Networks, vol.10, issue.7, pp.1497-1516, 2012.

C. Goursaud and J. Gorce, Dedicated networks for IoT: PHY/-MAC state of the art and challenges, EAI Endorsed Transactions on Internet of Things, vol.1, issue.1, pp.1-11, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01231221

Y. Chen, F. Han, Y. Yang, H. Ma, Y. Han et al., Time-reversal wireless paradigm for green internet of things: An overview, IEEE Internet Things J, vol.1, issue.1, pp.81-98, 2014.

W. Li, M. Assaad, and P. Duhamel, Distributed stochastic optimization in networks with low informational exchange, Communication, Control, and Computing (Allerton), 2017 55th Annual Allerton Conference on, pp.1160-1167, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01578376

D. J. Love, R. W. Heath, V. K. Lau, D. Gesbert, B. D. Rao et al., An overview of limited feedback in wireless communication systems, IEEE J. Sel. Areas Commun, vol.26, issue.8, 2008.

L. Lu, G. Y. Li, A. L. Swindlehurst, A. Ashikhmin, and R. Zhang, An overview of massive MIMO: Benefits and challenges, IEEE J. Sel. Areas Commun, vol.8, issue.5, pp.742-758, 2014.

W. Li, M. Assaad, G. Ayache, and M. Larranaga, Matrix exponential learning for resource allocation with low informational exchange, IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), pp.266-270, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01804897

W. Li and M. Assaad, Matrix exponential learning schemes with low informational exchange, 2018.

W. Yu, W. Rhee, S. Boyd, and J. M. Cioffi, Iterative water-filling for Gaussian vector multiple-access channels, IEEE Trans. Inf. Theory, vol.50, issue.1, pp.145-152, 2004.

J. Pang, G. Scutari, F. Facchinei, and C. Wang, Distributed power allocation with rate constraints in Gaussian parallel interference channels, IEEE Trans. Inf. Theory, vol.54, issue.8, pp.3471-3489, 2008.

G. Scutari, D. P. Palomar, and S. Barbarossa, The MIMO iterative waterfilling algorithm, IEEE Trans. Signal Process, vol.57, issue.5, pp.1917-1935, 2009.

H. Safdar, N. Fisal, R. Ullah, W. Maqbool, F. Asraf et al., Resource allocation for uplink M2M communication: A game theory approach, Wireless Technology and Applications (ISWTA), 2013 IEEE Symp, pp.48-52, 2013.

M. S. Ali, H. Tabassum, and E. Hossain, Dynamic user clustering and power allocation for uplink and downlink non-orthogonal multiple access (NOMA) systems, IEEE Access, vol.4, pp.6325-6343, 2016.

T. Zheng, Y. Qin, H. Zhang, and S. Kuo, Adaptive power control for mutual interference avoidance in industrial Internet-of-Things, China Communications, vol.13, pp.124-131, 2016.

G. J. Foschini and Z. Miljanic, A simple distributed autonomous power control algorithm and its convergence, IEEE Trans. Veh. Technol, vol.42, issue.4, pp.641-646, 1993.

R. Masmoudi, E. V. Belmega, I. Fijalkow, and N. Sellami, A unifying view on energy-efficiency metrics in cognitive radio channels, Signal Processing Conference (EUSIPCO), pp.171-175, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01104190

T. Holliday, N. Bambos, P. Glynn, and A. Goldsmith, Distributed power control for time varying wireless networks: Optimality and convergence, Proc. of the annual ALLERTON Conference on Communication Control and Computing, vol.41, pp.1024-1033, 2003.

R. Tajan, C. Poulliat, and I. Fijalkow, Interference management for cognitive radio systems exploiting primary IR-HARQ: A constrained Markov decision process approach, Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on, pp.1818-1822, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00781963

C. Isheden, Z. Chong, E. Jorswieck, and G. Fettweis, Framework for link-level energy efficiency optimization with informed transmitter, IEEE Trans. Wireless Commun, vol.11, issue.8, pp.2946-2957, 2012.

J. Wang, C. Jiang, Z. Han, Y. Ren, and L. Hanzo, Network association strategies for an energy harvesting aided super-WiFi network relying on measured solar activity, IEEE J. Sel. Areas Commun, vol.34, issue.12, pp.3785-3797, 2016.

T. Chen, A. Mokhtari, X. Wang, A. Ribeiro, and G. B. Giannakis, Stochastic averaging for constrained optimization with application to online resource allocation, IEEE Trans. Signal Process, vol.65, issue.12, pp.3078-3093, 2017.

P. Mertikopoulos, E. V. Belmega, R. Negrel, and L. Sanguinetti, Distributed stochastic optimization via matrix exponential learning, IEEE Trans. Signal Process, vol.65, issue.9, pp.2277-2290, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01382285

C. Jiang, H. Zhang, Y. Ren, Z. Han, K. Chen et al., Machine learning paradigms for next-generation wireless networks, IEEE Wireless Communications, vol.24, issue.2, pp.98-105, 2017.

A. Anandkumar, N. Michael, A. K. Tang, and A. Swami, Distributed algorithms for learning and cognitive medium access with logarithmic regret, IEEE J. Sel. Areas Commun, vol.29, issue.4, pp.731-745, 2011.

M. Hashemi, A. Sabharwal, C. E. Koksal, and N. B. Shroff, Efficient beam alignment in millimeter wave systems using contextual bandits, 2017.

J. Wang, C. Jiang, H. Zhang, X. Zhang, V. C. Leung et al., Learning-aided network association for hybrid indoor LiFi-WiFi systems, IEEE Trans. Veh. Technol, vol.67, issue.4, pp.3561-3574, 2018.

T. Chen, S. Barbarossa, X. Wang, G. B. Giannakis, and Z. Zhang, Learning and management for Internet-of-Things: Accounting for adaptivity and scalability, 2018.

L. Mainetti, L. Patrono, and A. Vilei, Evolution of wireless sensor networks towards the internet of things: A survey, Software, Telecommunications and Computer Networks (SoftCOM), IEEE 19th Intl. Conf. on, pp.1-6, 2011.

G. M. Lee and N. Crespi, The Internet of Things: Challenge for a new architecture from problems, IAB Interconnecting Smart Objects with the Internet Workshop, pp.1-2, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01304069

T. Alpcan, T. Ba?ar, R. Srikant, and E. Altman, CDMA uplink power control as a noncooperative game, Wireless Networks, vol.8, issue.6, pp.659-670, 2002.

E. Altman and L. Wynter, Equilibrium, games, and pricing in transportation and telecommunication networks, Networks and Spatial Economics, vol.4, issue.1, pp.7-21, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00071953

M. Chiang, P. Hande, T. Lan, and C. W. Tan, Power control in wireless cellular networks, Foundations and Trends in Networking, vol.2, issue.4, pp.381-533, 2008.

E. V. Belmega, P. Mertikopoulos, R. Negrel, and L. Sanguinetti, Online convex optimization and no-regret learning: Algorithms, guarantees and applications, submitted to IEEE Signal Processing Magazine, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01891562

M. Zinkevich, Online convex programming and generalized infinitesimal gradient ascent, Proc. of the 20th International Conference on Machine Learning (ICML-03), pp.928-936, 2003.

G. Calcev, D. Chizhik, B. Göransson, S. Howard, H. Huang et al., A wideband spatial channel model for system-wide simulations, IEEE Trans. Veh. Technol, vol.56, issue.2, pp.389-403, 2007.

D. Tse and P. Viswanath, Fundamentals of wireless communication, 2005.

L. B. Klebanov, S. T. Rachev, and F. J. Fabozzi, Robust and NonRobust models in Statistics, 2009.

A. D. Flaxman, A. T. Kalai, and H. B. Mcmahan, Online convex optimization in the bandit setting: gradient descent without a gradient, SODA'05: Proceedings of the 16th annual ACM-SIAM symposium on discrete algorithms, pp.385-394, 2005.

G. F. Pedersen, COST 231-Digital mobile radio towards future generation systems, EU, 1999.

R. T. Rockafellar, Convex analysis, 2015.

G. H. Hardy, ENSEA, CNRS), he worked on applications of online optimization to dynamic and unpredictable wireless (IoT) networks. He is currently a teaching assistant in ENSEA, France in 2014, the M.Sc. and Ph.D. degrees both from the University of Cergy-Pontoise, France in 2015 and 2019, respectively. During his Ph.D. in ETIS laboratory (ETIS, UMR 8051, 1949.

E. Veronica, Since 2018, she is the recipient of the Doctoral Supervision and Research Bonus (PEDR) by the French National Council of Universities (CNU 61), Belmega (S'08-M'10) received the M.Sc. (engineer diploma) degree from the University Politehnica of Bucharest, 2007.

, Panayotis Mertikopoulos (M' 11) received the Ptychion degree in physics (summa cum laude) from the University of Athens in 2003, his M.Sc. and M.Phil. degrees in mathematics from Brown University in 2005 and 2006 (both summa cum laude), and his Ph.D. degree from the University of Athens in 2010, 2010.

P. , Mertikopoulos was an Embeirikeion Foundation Fellow between 2003 and 2006, and received the best paper award in NetGCoop '12. He is serving on the editorial board and program committees of several journals and conferences on learning and optimization

, USA) funded by the French CNRS. She is currently a full Professor at ENSEA at the "classe exceptionnelle" level. Her research interests are in signal processing for digital communications, including iterative processing, optimization, estimation theory, signal processing for dirty-RF. She is (co-)author of over 180 publications, His main research interests lie in learning, optimization, game theory, and their applications to networks and machine learning systems. Inbar Fijalkow (M'96-SM'10) received her engineering and Ph.D. degrees from TelecomParis, vol.8051, 1990.