A science-gateway workload archive application to the self-healing of workflow incidents

Rafael Ferreira da Silva 1, * Tristan Glatard 1, * Frédéric Desprez 2
* Corresponding author
1 Images et Modèles
CREATIS - Centre de Recherche en Acquisition et Traitement de l'Image pour la Santé
2 AVALON - Algorithms and Software Architectures for Distributed and HPC Platforms
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : Information about the execution of distributed workload is important for studies in computer science and engineering, but workloads acquired at the infrastructure-level reputably lack information about users and application-level middleware. Meanwhile, workloads acquired at science-gateway level contain detailed information about users, pilot jobs, task sub-steps, bag of tasks and workflow executions. In this work, we present a science-gateway archive, we illustrate its possibilities on a few case studies, and we use it for the autonomic handling of workflow incidents.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00766070
Contributor : Ccsd Sciencesconf.Org <>
Submitted on : Monday, December 17, 2012 - 3:09:13 PM
Last modification on : Wednesday, December 12, 2018 - 3:15:21 PM
Long-term archiving on : Sunday, December 18, 2016 - 3:48:54 AM

Identifiers

  • HAL Id : hal-00766070, version 1

Citation

Rafael Ferreira da Silva, Tristan Glatard, Frédéric Desprez. A science-gateway workload archive application to the self-healing of workflow incidents. journées scientifiques mésocentres et France Grilles 2012, Oct 2012, Paris, France. ⟨hal-00766070⟩

Share

Metrics

Record views

649

Files downloads

550