Performance Comparison of Hierarchical Checkpoint Protocols on Grid Computing - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Performance Comparison of Hierarchical Checkpoint Protocols on Grid Computing

Résumé

Grid infrastructure is a large set of nodes geographically distributed and connected by a communication. In this context, fault tolerance is a necessity imposed by the distribution as any node can fail at any moment and the average time between failures highly decreases. To improve the robustness of supercomputing applications in the presence of failures, many techniques have been developed to provide resistance to these faults of the system. Fault tolerance is intended to allow the system to provide service as specified in spite of occurrences of faults. To meet this need, several techniques have been proposed in the literature. We will study the protocols based on rollback recovery classified into two categories: checkpoint-based rollback recovery protocols and message logging protocols. However, the performance of a protocol depends on the characteristics of the system, network and applications running. Faced with the constraints of large-scale environments, many of algorithms of the literature showed inadequate.Given an application environment and a system, it is not easy to identify the recovery protocol that is most appropriate for a cluster or hierarchical environment, like grid computing. Hence there is a need to implement these protocols in a hierarchical fashion to compare their performance in grid computing. In this paper, we propose hierarchical version of these protocols. We have implemented and compare their performance in clusters and grid computing using the Omnet++ simulator.

Dates et versions

hal-01273280 , version 1 (12-02-2016)

Identifiants

Citer

Ndeye Massata Ndiaye, Pierre Sens, Ousmane Thiare. Performance Comparison of Hierarchical Checkpoint Protocols on Grid Computing. 9th International Conference, Distributed Computing and Artificial Intelligence, Mar 2012, Salamanca, Spain. pp.339-346, ⟨10.1007/978-3-642-28765-7_40⟩. ⟨hal-01273280⟩
62 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More