M. Nicosia, R. Klemann, K. Griffin, S. Taylor, B. Demuth et al., Rethinking Flat Rate Pricing for Broadband Services, 2012.

C. Cicconetti, 5G Radio Network Architecture, 2013.

E. Aryafar, A. Keshavarz-haddad, M. Wang, and M. Chiang, RAT selection games in HetNets, 2013 Proceedings IEEE INFOCOM, 2013.
DOI : 10.1109/INFCOM.2013.6566889

K. Khawam, M. Ibrahim, J. Cohen, S. Lahoud, and S. Tohme, Individual vs. Global Radio Resource Management in a Hybrid Broadband Network, 2011 IEEE International Conference on Communications (ICC), 2011.
DOI : 10.1109/icc.2011.5962580

URL : https://hal.archives-ouvertes.fr/inria-00528575

M. Ibrahim, K. Khawam, and S. Tohme, Congestion Games for Distributed Radio Access Selection in Broadband Networks, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010, 2010.
DOI : 10.1109/GLOCOM.2010.5683862

D. Niyato and E. Hossain, Dynamics of Network Selection in Heterogeneous Wireless Networks: An Evolutionary Game Approach, IEEE Transactions on Vehicular Technology, vol.58, issue.4, pp.2008-2017, 2009.
DOI : 10.1109/TVT.2008.2004588

P. Coucheney, C. Touati, and B. Gaujal, Fair and Efficient User-Network Association Algorithm for Multi-Technology Wireless Networks, IEEE INFOCOM 2009, The 28th Conference on Computer Communications, 2009.
DOI : 10.1109/INFCOM.2009.5062237

URL : https://hal.archives-ouvertes.fr/inria-00322403

O. Ercetin, Association Games in IEEE 802.11 Wireless Local Area Networks, IEEE Transactions on Wireless Communications, vol.7, issue.12, pp.5136-5143, 2008.

D. Kumar, E. Altman, and J. Kelif, User-Network Association in a WLAN-UMTS Hybrid Cell: Global & Individual Optimality, INRIA, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00088728

Q. Nguyen-vuong, N. Agoulmine, E. Cherkaoui, and L. Toni, Multicriteria Optimization of Access Selection to Improve the Quality of Experience in Heterogeneous Wireless Access Networks, IEEE Transactions on Vehicular Technology, vol.62, issue.4, pp.1785-1800, 2013.
DOI : 10.1109/TVT.2012.2234772

URL : https://hal.archives-ouvertes.fr/hal-00826231

M. E. Helou, S. Lahoud, M. Ibrahim, and K. Khawam, Satisfactionbased Radio Access Technology Selection in Heterogeneous Wireless Networks, Proc. IEEE IFIP Wireless Days Conference (WD), 2013.
URL : https://hal.archives-ouvertes.fr/hal-01018234

M. E. Helou, M. Ibrahim, S. Lahoud, and K. Khawam, Radio access selection approaches in heterogeneous wireless networks, 2013 IEEE 9th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob), 2013.
DOI : 10.1109/WiMOB.2013.6673408

URL : https://hal.archives-ouvertes.fr/hal-01018230

I. Chamodrakas and D. Martakos, A utility-based fuzzy TOPSIS method for energy efficient network selection in heterogeneous wireless networks, Applied Soft Computing, vol.12, issue.7, pp.1929-1938, 2012.
DOI : 10.1016/j.asoc.2012.04.016

F. Bari and V. C. Leung, Automated network selection in a heterogeneous wireless network environment, IEEE Network, vol.21, issue.1, pp.34-40, 2007.
DOI : 10.1109/MNET.2007.314536

E. Stevens-navarro and V. Wong, Comparison between Vertical Handoff Decision Algorithms for Heterogeneous Wireless Networks, 2006 IEEE 63rd Vehicular Technology Conference, 2006.
DOI : 10.1109/VETECS.2006.1682964

Q. Song and A. Jamalipour, Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques, IEEE Wireless Communications, vol.12, issue.3, pp.42-48, 2005.
DOI : 10.1109/MWC.2005.1452853

W. Zhang, Handover Decision Using Fuzzy MADM in Heterogeneous Networks, Proc. IEEE Wireless Communications and Networking Conference (WCNC), 2004.

O. Falowo and H. Chan, RAT selection for multiple calls in heterogeneous wireless networks using modified topsis group decision making technique, 2011 IEEE 22nd International Symposium on Personal, Indoor and Mobile Radio Communications, 2011.
DOI : 10.1109/PIMRC.2011.6139726

M. E. Helou, S. Lahoud, M. Ibrahim, and K. Khawam, A Hybrid Approach for Radio Access Technology Selection in Heterogeneous Wireless Networks, Proc. European Wireless Conference (EW), 2013.
DOI : 10.1007/s11277-015-2957-2

URL : https://hal.archives-ouvertes.fr/hal-01018232

L. Zhu, F. Yu, B. Ning, and T. Tang, Cross-Layer Handoff Design in MIMO-Enabled WLANs for Communication-Based Train Control (CBTC) Systems, IEEE Journal on Selected Areas in Communications, vol.30, issue.4, pp.719-728, 2012.
DOI : 10.1109/JSAC.2012.120506

L. Zhu, F. R. Yu, B. Ning, and T. Tang, Handoff management in communication-based train control networks using stream control transmission protocol and IEEE 802.11p WLANs, EURASIP Journal on Wireless Communications and Networking, vol.2012, issue.1, pp.211-226, 2012.
DOI : 10.1049/ip-com:19990130

X. Zhang, H. Jin, X. Ji, Y. Li, and M. Peng, A separate-SMDP approximation technique for RRM in heterogeneous wireless networks, 2012 IEEE Wireless Communications and Networking Conference (WCNC), 2012.
DOI : 10.1109/WCNC.2012.6214135

J. P. Singh, T. Alpcan, P. Agrawal, and V. Sharma, A Markov Decision Process based flow assignment framework for heterogeneous network access, Wireless Networks, vol.18, issue.6, pp.481-495, 2010.
DOI : 10.1007/s11276-008-0148-8

M. Ibrahim, K. Khawam, and S. Tohme, Network-Centric Joint Radio Resource Policy in Heterogeneous WiMAX-UMTS Networks for Streaming and Elastic traffic, 2009 IEEE Wireless Communications and Networking Conference, 2009.
DOI : 10.1109/WCNC.2009.4917831

M. Coupechoux, J. Kelif, and P. Godlewski, Network Controlled Joint Radio Resource Management for Heterogeneous Networks, VTC Spring 2008, IEEE Vehicular Technology Conference, 2008.
DOI : 10.1109/VETECS.2008.405

URL : https://hal.archives-ouvertes.fr/hal-01493339

H. Tabrizi, G. Farhadi, and J. Cioffi, Dynamic handoff decision in heterogeneous wireless systems: Q-learning approach, 2012 IEEE International Conference on Communications (ICC), 2012.
DOI : 10.1109/ICC.2012.6364194

C. Dhahri and T. Ohtsuki, Q-learning cell selection for femtocell networks: Single- and multi-user case, 2012 IEEE Global Communications Conference (GLOBECOM), 2012.
DOI : 10.1109/GLOCOM.2012.6503908

C. Watkins and P. Dayan, Technical Note, Machine Learning, pp.279-292, 1992.
DOI : 10.1007/978-1-4615-3618-5_4

M. R. Ryan, Hierarchical Reinforcement Learning: A Hybrid Approach, 2002.

Y. Ye, The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate, Mathematics of Operations Research, vol.36, issue.4, pp.593-603, 2011.
DOI : 10.1287/moor.1110.0516

. Melhem-el and . Helou, 15) received the engineering and master degrees in communications and networking from the Ecole Supérieure d'Ingénieurs de Beyrouth (ESIB), Faculty of Engineering at Saint Joseph University of Beirut, respectively and the PhD degree in communication networks from IRISA research institute, University of Rennes 1, 2014. He joined ESIB in September 2013 where he is currently an Assistant Professor (fr: Ma??treMa??tre de conférences). His research interests include wireless communications, cellular technologies, radio resource management, and optimization of heterogeneous networks, 2009.