A. Cassandra,

O. Swift,

S. A. Weil, S. A. Brandt, and E. L. Miller, Ceph: A scalable, high-performance distributed file system, Proceedings of the 7th Symposium on Operating Systems Design and Implementation, OSDI '06, USENIX Association, pp.307-320, 2006.

N. Liu and J. Cope, On the role of burst buffers in leadershipclass storage systems, IEEE 28th Symposium on Mass Storage Systems and Technologies, MSST 2012, pp.1-11, 2012.

M. Romanus, R. B. Ross, and M. Parashar, Challenges and considerations for utilizing burst buffers in high-performance computing

Q. Liu, J. Logan, and Y. Tian, Hello ADIOS: the challenges and lessons of developing leadership class I/O frameworks, Concurrency and Computation: Practice and Experience, vol.26, issue.7, pp.1453-1473, 2014.

C. Bartz, K. Chasapis, M. Kuhn, P. Nerge, and T. Ludwig, A best practice analysis of HDF5 and NetCDF-4 using lustre, High Performance Computing, no. 9137 in Lecture Notes in Computer Science, pp.274-281, 2015.

M. P. Forum, Mpi: A message-passing interface standard, 1994.

M. Zaharia, R. S. Xin, and P. Wendell, Apache spark: A unified engine for big data processing, Commun. ACM, vol.59, issue.11, pp.56-65, 2016.

K. Shvachko, H. Kuang, and S. Radia, The Hadoop distributed file system, 2010 IEEE 26th symposium on Mass Storage Systems and Technologies (MSST), pp.1-10, 2010.

M. Azure,

, Amazon Web Services

. Marenostrum,

Q. Zheng and K. Ren, Deltafs: Exascale file systems scale better without dedicated servers, Proceedings of the 10th Parallel Data Storage Workshop, PDSW '15, pp.1-6, 2015.

P. Carns, J. Jenkins, and C. D. Cranor, Enabling NVM for dataintensive scientific services, 4th Workshop on Interactions of NVM/Flash with Operating Systems and Workloads (INFLOW 16), 2016.

S. A. Weil, A. W. Leung, S. A. Brandt, and C. Maltzahn, Rados: A scalable, reliable storage service for petabyte-scale storage clusters, Proceedings of the 2nd international workshop on petascale data storage: held in conjunction with Supercomputing'07, pp.35-44, 2007.

B. Nicolae, G. Antoniu, and L. Bougé, Blobseer: Nextgeneration data management for large scale infrastructures, J. Parallel Distrib. Comput, vol.71, issue.2, pp.169-184, 2011.

P. Matri, A. Costan, G. Antoniu, J. Montes, and M. S. Pérez, T´yrT´yr: Blob storage meets built-in transactions, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis, pp.573-584, 2016.

, The Lustre File System, 2017.

M. Moore, D. Bonnie, and W. Ligon, 2011 9th USENIX Conference on File and Storage Technologies (FAST), 2011.

D. Kimpe and R. Ross, Storage models: Past, present, and future, High Performance Parallel I/O, pp.335-345, 2014.

J. Cope and K. Iskra, Bridging HPC and grid file I/O with IOFSL, Applied Parallel and Scientific Computing-10th International Conference, pp.215-225, 2010.

D. Huang and J. Yin, UNIO: A unified I/O system framework for hybrid scientific workflow, Cloud Computing and Big Data-Second International Conference, pp.99-114, 2015.

M. Kuhn, A Semantics-Aware I/O Interface for High Performance Computing, Supercomputing, no. 7905 in Lecture Notes in Computer Science, pp.408-421, 2013.

S. Ghemawat, H. Gobioff, and S. Leung, The Google file system, ACM SIGOPS Operating Systems Review, vol.37, pp.29-43, 2003.

S. Mikami, K. Ohta, and O. Tatebe, Using the gfarm file system as a POSIX compatible storage platform for Hadoop MapReduce applications, Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing, pp.181-189, 2011.

, BDEC-Big Data and Extreme-Scale Computing, 2017.

Z. Zhang, K. Barbary, and F. A. Nothaft, Scientific computing meets big data technology: An astronomy use case, 2015 IEEE International Conference on Big Data, Big Data, pp.918-927, 2015.

H. A. Duran-limon and J. Flores-contreras, Efficient execution of the WRF model and other HPC applications in the cloud, Earth Science Informatics, vol.9, issue.3, pp.365-382, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01288718

A. Jaikar and S. Noh, Cloud computing: Read before use, T. LargeScale Data-and Knowledge-Centered Systems, vol.30, pp.1-22, 2016.

A. Gupta and D. Milojicic, Evaluation of HPC applications on cloud, Open Cirrus Summit (OCS), pp.22-26, 2011.

R. Ledyayev and H. Richter, High performance computing in a cloud using OpenStack, pp.108-113, 2014.

A. Pan, J. P. Walters, and V. S. Pai, Integrating high performance file systems in a cloud computing environment, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.753-759, 2012.

Y. Abe and G. Gibson, pwalrus: Towards better integration of parallel file systems into cloud storage, Cluster Computing Workshops and Posters (Cluster Workshops), 2010 IEEE International Conference on, pp.1-7, 2010.

G. C. Fox and J. Shantenu, Towards a comprehensive set of big data benchmarks, Advances in Parallel Computing, vol.26, pp.47-66, 2015.

L. Wang, J. Zhan, and C. Luo, BigDataBench: A big data benchmark suite from Internet services, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA), 2014.

, mpiBLAST: Open-Source Parallel BLAST, 2017.

S. F. Altschul and W. Gish, Basic local alignment search tool, Journal of Molecular Biology, vol.215, issue.3, pp.403-410, 1990.

I. Lorkowski, J. Pätsch, A. Moll, and W. Kühn, Interannual variability of carbon fluxes in the North Sea from 1970 to 2006competing effects of abiotic and biotic drivers on the gasexchange of CO 2, Estuarine, Coastal and Shelf Science, vol.100, pp.38-57, 2012.

F. Große and N. Greenwood, Looking beyond stratification: A model-based analysis of the biological drivers of oxygen depletion in the North Sea, Biogeosciences Discussions, pp.2511-2535, 2015.

J. Stone, An efficient library for parallel ray tracing and animation, 1995.

M. Zaharia, M. Chowdhury, and T. Das, Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing, Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, USENIX Association, pp.2-2, 2012.

M. Li, J. Tan, Y. Wang, L. Zhang, and V. Salapura, Sparkbench: a comprehensive benchmarking suite for in memory data analytic platform Spark, p.53, 2015.

D. Balouek and A. Carpen-amarie, Adding virtualization capabilities to the Grid'5000 testbed, Communications in Computer and Information Science, vol.367, pp.3-20, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00946971

G. Decandia, D. Hastorun, and M. Jampani, Dynamo: Amazon's highly available key-value store, SIGOPS Oper. Syst. Rev, vol.41, issue.6, pp.205-220, 2007.

S. Ishiguro and J. Murakami, Optimizing local file accesses for fuse-based distributed storage, SC Companion: High Performance Computing, Networking Storage and Analysis, pp.760-765, 2012.

S. Oral, J. Simmons, and J. Hill, Best practices and lessons learned from deploying and operating large-scale data-centric parallel file systems, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, pp.217-228, 2014.