CCK: An Improved Coordinated Checkpoint/Rollback Protocol for Dataflow Applications in KAAPI - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

CCK: An Improved Coordinated Checkpoint/Rollback Protocol for Dataflow Applications in KAAPI

Résumé

Fault tolerance protocols play an important role in today long runtime scientific parallel applications because the probability of failure may be important due to the number of unreliable components involved during simulation. In this paper we present our approach and preliminary results about a new checkpoint/recovery protocol based on a coordinated scheme. This protocol is highly coupled to the availability of an abstract representation of the execution.
Fichier non déposé

Dates et versions

hal-00684864 , version 1 (03-04-2012)

Identifiants

Citer

Xavier Besseron, Samir Jafar, Thierry Gautier, Jean-Louis Roch. CCK: An Improved Coordinated Checkpoint/Rollback Protocol for Dataflow Applications in KAAPI. ICTTA'06 IEEE Conference on Information and Communication Technologies: from Theory to Applications, Apr 2006, Damascus, Syria. ⟨10.1109/ICTTA.2006.1684955⟩. ⟨hal-00684864⟩
91 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More