Optimization of jobs submission on the EGEE production grid: modeling faults using workload

Diane Lingrand 1, * Johan Montagnat 1 Janusz Martyniak 2 David Colling 2
* Corresponding author
1 Laboratoire d'Informatique, Signaux, et Systèmes de Sophia-Antipolis (I3S) / Equipe MODALIS
Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : It is commonly observed that production grids are inherently unreliable. The aim of this work is to improve grid application performances by tuning the job submission system. A stochastic model, capturing the behavior of a complex grid workload management system is proposed. To instantiate the model, detailed statistics are extracted from dense grid activity traces. The model is exploited for optimizing a simple job resubmission strategy. It provides quantitative inputs to improve job submission performance and it enables the impact of faults and outliers on grid operations to be quantified.
Liste complète des métadonnées

Cited literature [19 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00677775
Contributor : Johan Montagnat <>
Submitted on : Friday, March 9, 2012 - 4:05:58 PM
Last modification on : Monday, November 5, 2018 - 3:52:09 PM
Document(s) archivé(s) le : Wednesday, December 14, 2016 - 11:54:03 AM

File

jogc.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Diane Lingrand, Johan Montagnat, Janusz Martyniak, David Colling. Optimization of jobs submission on the EGEE production grid: modeling faults using workload. Journal of Grid Computing, Springer Verlag, 2010, 8 (2), pp.305-321. ⟨10.1007/s10723-010-9151-2⟩. ⟨hal-00677775⟩

Share

Metrics

Record views

246

Files downloads

190