White paper: Cisco vni forecast and methodology, 2016. ,
Seagull: Intelligent cloud bursting for enterprise applications, USENIX ATC '12: Conference on Annual Technical Conference, pp.33-33, 2012. ,
On exploiting data locality for iterative mapreduce applications in hybrid clouds, BDCAT '16: 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, pp.118-122, 2016. ,
Enabling big data analytics in the hybrid cloud using iterative MapReduce, UCC '15: 8th IEEE/ACM International Conference on Utility and Cloud Computing, pp.290-299, 2015. ,
, Hadoop: The Definitive Guide. USA, 2010.
Evaluation of data locality strategies for hybrid cloud bursting of iterative mapreduce, CCGrid'17: 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp.181-185, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01469991
MapReduce in the clouds for science, CloudCom '10: 2on IEEE Conference on Cloud Computing Technology and Science, pp.565-572, 2010. ,
A scalable two-phase top-down specialization approach for data anonymization using MapReduce on cloud, IEEE Transactions on Parallel and Distributed Systems, vol.25, issue.2, pp.363-373, 2014. ,
Bursting the cloud data bubble: Towards transparent storage elasticity in IaaS clouds, IPDPS '14: 28th IEEE International Parallel and Distributed Processing Symposium, pp.135-144, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00947599
, Transparent Throughput Elasticity for IaaS Cloud Storage Using Guest-Side Block-Level Caching, UCC'14: 7th IEEE/ACM International Conference on Utility and Cloud Computing, 2014.
Leveraging adaptive I/O to optimize collective data shuffling patterns for big data analytics, IEEE Transactions on Parallel and Distributed Systems, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01531374
HaLoop: Efficient iterative data processing on large clusters, Proc. VLDB Endow, vol.3, issue.1-2, pp.285-296, 2010. ,
iMapReduce: A distributed computing framework for iterative computation, Journal of Grid Computing, vol.10, issue.1, pp.47-68, 2012. ,
Towards transparent throughput elasticity for IaaS cloud storage: Exploring the benefits of adaptive block-level caching, International Journal of Distributed Systems and Technologies, vol.6, issue.4, pp.21-44, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01199464
Towards optimal resource provisioning for running MapReduce programs in public clouds, CLOUD '11: IEEE International Conference on Cloud Computing, pp.155-162, 2011. ,
CRESP: Towards optimal resource provisioning for MapReduce computing in public clouds, IEEE Transactions on Parallel and Distributed Systems, vol.25, issue.6, pp.1403-1412, 2014. ,
AROMA: Automated resource allocation and configuration of MapReduce environment in the cloud, ICAC '12: 9th International Conference on Autonomic Computing, pp.63-72, 2012. ,
Starfish: A self-tuning system for big data analytics, CRID '11: 5th Biennial Conference on Innovative Data Systems Research, pp.261-272, 2011. ,
ARIA: Automatic Resource Inference and Allocation for Mapreduce Environments, ICAC '11: 8th ACM International Conference on Autonomic Computing, 2011. ,
Resource provisioning framework for MapReduce jobs with performance goals, Middleware '11: 12th ACM/IFIP/USENIX International Middleware Conference, pp.165-186, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01597764
Hadoop Performance Models, CS-2011-05, 2011. ,
Benchmarking approach for designing a mapreduce performance model, ICPE '13: 4th ACM/SPEC International Conference on Performance Engineering, pp.253-258, 2013. ,
, Performance modeling of mapreduce jobs in heterogeneous cloud environments, CLOUD '13: 6th IEEE International Conference on Cloud Computing, pp.839-846, 2013.
Improving mapreduce performance in heterogeneous environments, OSDI '08: 8th USENIX Conference on Operating Systems Design and Implementation, pp.29-42, 2008. ,
Tarazu: Optimizing mapreduce on heterogeneous clusters, ASPLOS '12: 17th International Conference on Architectural Support for Programming Languages and Operating Systems, pp.61-74, 2012. ,
Performance management of accelerated MapReduce workloads in heterogeneous clusters, 2010. ,
The Hadoop distributed file system, MSST '10: 26th IEEE Symposium on Massive Storage Systems and Technologies, 2010. ,
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing, NSDI'12: 9th USENIX Conference on Networked Systems Design and Implementation, vol.2, pp.1-2, 2012. ,
Pregel: A system for large-scale graph processing, SIGMOD'10: The 2010 ACM SIGMOD International Conference on Management of Data, pp.135-146, 2010. ,
Apache hadoop rumen, pp.13-15 ,
Sysstat utilities for the Linux OS, pp.13-15 ,
,
The stratosphere platform for big data analytics, VLDB J, vol.23, issue.6, pp.939-964, 2014. ,
An analytical approach to evaluation of ssd effects under mapreduce workloads, Journal of Semiconductor Technology and Science, vol.15, pp.511-518, 2015. ,
On the energy efficiency of mapreduce shuffling operations in data centers, ICTON'17: 19th International Conference on Transparent Optical Networks, pp.1-5, 2017. ,
Clustering methods: A history of K-Means algorithms, Selected Contributions in Data Analysis and Classification, pp.161-172, 2007. ,
Parallel K-Means clustering based on MapReduce, CloudCom '09: 1st International Conference on Cloud Computing, 2009. ,
The HiBench benchmark suite: Characterization of the MapReduce-based data analysis, ICDEW '10: 26th IEEE International Conference on Data Engineering Workshops, pp.41-51, 2010. ,
The anatomy of a large-scale hypertextual web search engine, Comput. Netw. ISDN Syst, vol.30, issue.1-7, pp.107-117, 1998. ,
CC-MR-Finding Connected Components in Huge Graphs with MapReduce, pp.458-473, 2012. ,
BigDataBench: A big data benchmark suite from internet services ,
DOI : 10.1109/hpca.2014.6835958
URL : http://arxiv.org/pdf/1401.1406.pdf