J. Dean and S. Ghemawat, Mapreduce: simplified data processing on large clusters, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.

K. Shvachko, H. Kuang, S. Radia, and R. Chansler, The hadoop distributed file system, Proceedings of the 26th IEEE Symposium on Mass Storage Systems and Technologies, pp.1-10, 2010.

M. Carvalho, W. Cirne, F. Brasileiro, and J. Wilkes, Longterm SLOs for reclaimed cloud computing resources, Proceedings of the 5th ACM Symposium on Cloud Computing, pp.1-13, 2014.

J. Dartois, A. Knefati, J. Boukhobza, and O. Barais, Using quantile regression for reclaiming unused cloud resources while achieving sla, Proceedings of the 10th IEEE International Conference on Cloud Computing Technology and Science, pp.89-98, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01898438

P. Marshall, K. Keahey, and T. Freeman, Improving utilization of infrastructure clouds, Proceedings of the 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp.205-214, 2011.

M. Zaharia, A. Konwinski, A. D. Joseph, R. H. Katz, and I. Stoica, Improving mapreduce performance in heterogeneous environments, Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation, pp.29-42, 2008.

M. C. Calzarossa, M. L. Della-vedova, L. Massari, D. Petcu, M. I. Tabash et al., Workloads in the clouds, Principles of Performance and Reliability Modeling and Evaluation: Essays in Honor of Kishor Trivedi on his 70th Birthday, pp.525-550, 2016.

M. Amiri and L. Mohammad-khanli, Survey on prediction models of applications for resources provisioning in cloud, Journal of Network and Computer Applications, vol.82, pp.93-113, 2017.

M. Katevenis, S. Sidiropoulos, and C. Courcoubetis, Weighted round-robin cell multiplexing in a general-purpose atm switch chip, IEEE Journal on Selected Areas in Communications, vol.9, issue.8, pp.1265-1279, 1991.

M. Khan, Y. Jin, M. Li, Y. Xiang, and C. Jiang, Hadoop performance modeling for job estimation and resource provisioning, IEEE Transactions on Parallel and Distributed Systems, vol.27, pp.441-454, 2016.

E. Sammer, Hadoop Operations: A Guide for Developers and Administrators, 2012.

J. C. Anjos, I. Carrera, W. Kolberg, A. L. Tibola, L. B. Arantes et al., Mra++: Scheduling and data placement on mapreduce for heterogeneous environments, Future Generation Computer Systems, vol.42, pp.22-35, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01197424

J. Dartois, J. Boukhobza, A. Knefati, and O. Barais, Investigating machine learning algorithms for modeling ssd i/o performance for container-based virtualization, IEEE Transactions on Cloud Computing, vol.14, pp.1-14, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02013421

M. Zaharia, D. Borthakur, J. Sarma, K. Elmeleegy, S. Shenker et al., Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling, Proceedings of the 5th European Conference on Computer Systems, pp.265-278, 2010.

H. Jin, X. Yang, X. Sun, and I. Raicu, Adapt: Availabilityaware mapreduce data placement for non-dedicated distributed computing, Proceedings of the 32nd IEEE International Conference on Distributed Computing Systems, pp.516-525, 2012.

H. Lin, X. Ma, J. Archuleta, W. Feng, M. Gardner et al., Moon: Mapreduce on opportunistic environments, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, pp.95-106, 2010.

L. A. Steffenel, O. Flauzac, A. S. Charão, P. P. Barcelos, B. Stein et al., Mapreduce challenges on pervasive grids, Journal of Computer Science, vol.10, issue.11, pp.2194-2210, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01085287

N. Chohan, C. Castillo, M. Spreitzer, M. Steinder, A. Tantawi et al., See spot run: using spot instances for mapreduce workflows, Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, pp.7-14, 2010.

J. Anjos, K. Matteussi, P. Souza, C. Geyer, A. D. Veith et al., Enabling strategies for big data analytics in hybrid infrastructures, Proceedings of the 16th International Conference on High Performance Computing & Simulation, pp.869-876, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01875952

M. Merabet, S. Benslimane, M. Barhamgi, and C. Bonnet, A predictive map task scheduler for optimizing data locality in mapreduce clusters, International Journal of Grid and High Performance Computing, vol.10, issue.4, pp.1-14, 2018.

B. Tang, M. Tang, G. Fedak, and H. He, Availability/network-aware mapreduce over the internet, Information Sciences, vol.379, pp.94-111, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01426393