S. Yi, J. Heo, Y. Cho, and J. Hong, Taking Point Decision Mechanism of Page-level Incremental Checkpointing based on Cost Analysis of Process Execution Time, The KIPS Transactions:PartA, vol.13, issue.4, pp.1325-1337, 2007.
DOI : 10.3745/KIPSTA.2006.13A.4.289

D. Kondo, B. Javadi, P. Malecot, F. Cappello, and D. P. Anderson, Cost-benefit analysis of Cloud Computing versus desktop grids, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009.
DOI : 10.1109/IPDPS.2009.5160911

URL : https://hal.archives-ouvertes.fr/hal-00788911

A. Andrzejak, D. Kondo, and D. P. Anderson, Exploiting non-dedicated resources for cloud computing, 2010 IEEE Network Operations and Management Symposium, NOMS 2010, 2010.
DOI : 10.1109/NOMS.2010.5488488

URL : https://hal.archives-ouvertes.fr/hal-00788869

M. Palankar, A. Iamnitchi, M. Ripeanu, and S. Garfinkel, Amazon S3 for science grids, Proceedings of the 2008 international workshop on Data-aware distributed computing, DADC '08, 2008.
DOI : 10.1145/1383519.1383526

S. Garfinkel, Commodity grid computing with amazons s3 and ec2, 2007.

E. Deelman, S. Gurmeet, M. Livny, J. Good, and B. Berriman, The cost of doing science on the cloud: The Montage example, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, 2008.
DOI : 10.1109/SC.2008.5217932

. Cloudkick, Simple, powerful tools to manage and monitor cloud servers, https://www.cloudkick, 2010.

J. Dean and S. Ghemawat, MapReduce, OSDI, pp.137-150, 2004.
DOI : 10.1145/1327452.1327492

M. Litzkow, M. Livny, and M. Mutka, Condor-a hunter of idle workstations, [1988] Proceedings. The 8th International Conference on Distributed, 1988.
DOI : 10.1109/DCS.1988.12507

G. Bosilca, A. Bouteiller, F. Cappello, S. Djilali, G. Fedak et al., MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes, ACM/IEEE SC 2002 Conference (SC'02), 2002.
DOI : 10.1109/SC.2002.10048

URL : https://hal.archives-ouvertes.fr/in2p3-00457138

A. Duda, The effects of checkpointing on program execution time, Information Processing Letters, vol.16, issue.5, pp.221-229, 1983.
DOI : 10.1016/0020-0190(83)90093-5

S. Fu and C. Xu, Exploring event correlation for failure prediction in coalitions of clusters, Proceedings of the 2007 ACM/IEEE conference on Supercomputing , SC '07, pp.1-12, 2007.
DOI : 10.1145/1362622.1362678

B. Javadi, D. Kondo, J. Vincent, and D. Anderson, Mining for availability models in large-scale distributed systems: A case study of seti@home, 17th IEEE/ACM International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), 2009.
URL : https://hal.archives-ouvertes.fr/inria-00375624

D. Kondo, A. Andrzejak, and D. P. Anderson, On correlated availability in Internet-distributed systems, 2008 9th IEEE/ACM International Conference on Grid Computing, 2008.
DOI : 10.1109/GRID.2008.4662809

URL : https://hal.archives-ouvertes.fr/inria-00279991

A. Andrzejak, P. Domingues, and L. M. Silva, Predicting Machine Availabilities in Desktop Pools, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006, pp.1-4, 2006.
DOI : 10.1109/NOMS.2006.1687632

J. S. Plank, K. Li, and M. A. Puening, Diskless checkpointing, IEEE Transactions on Parallel and Distributed Systems, vol.9, issue.10, pp.972-986, 1998.
DOI : 10.1109/71.730527

P. Domingues, A. Andrzejak, and L. M. Silva, Using Checkpointing to Enhance Turnaround Time on Institutional Desktop Grids, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06), 2006.
DOI : 10.1109/E-SCIENCE.2006.261157