Models for scheduling on large scale platforms: which policy for which application?

Pierre-Francois Dutot 1, 2 Lionel Eyraud-Dubois 1, 2 Grégory Mounié 1, 2 Denis Trystram 1, 2
2 APACHE - Parallel algorithms and load sharing
ID-IMAG - Informatique et Distribution, Inria Grenoble - Rhône-Alpes, UJF - Université Joseph Fourier - Grenoble 1
Abstract : In the recent years, there was a huge development of low cost large scale parallel systems. The design of efficient parallel algorithms has to be reconsidered by the influence of new parameters of such execution supports (namely, clusters of workstations, grid computing and global computing) which are characterized by a larger number of heterogeneous processors, often organized by hierarchical sub-systems. Alternative computational models have been designed in order to take into account new characteristics. Parallel Tasks model -- PT in short -- (i.e. tasks that require more than one processor for their execution) is a promising alternative for scheduling parallel applications, especially in the case of slow communication media. The basic idea is to consider the application at a rough level of granularity. Another way of looking at the problem (which is somehow a dual view) is the Divisible Load model (DL) where an application is considered as a collection of a large number of elementary -- sequential -- computing units that will be distributed among the available resources. As the main difficulty for scheduling in actual systems comes from handling efficiently the communications, these two new views of the problem allow us to consider them implicitly or to mask them, thus leading to more tractable problems. This paper aims first at presenting some examples of approximation algorithms for parallelizing applications for the PT model with a special emphasis on new execution supports. Then, we will show how to mix these results with the DLT model in order to integrate them into the previous model for managing the resources of an actual computational grid composed by more than 600 machines built in Grenoble (CiGri project).
Type de document :
Communication dans un congrès
18th International Parallel and Distributed Processing Symposium (IPDPS'04), 2004, Santa Fe, New Mexico, United States. IEEE, 2004, Workshop 7: Workshop on Advances in Parallel and Distributed Computational Models - APDCM'04
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00003125
Contributeur : Grégory Mounié <>
Soumis le : vendredi 22 octobre 2004 - 10:26:22
Dernière modification le : vendredi 21 décembre 2018 - 10:46:07
Document(s) archivé(s) le : jeudi 1 avril 2010 - 15:31:43

Identifiants

  • HAL Id : hal-00003125, version 1

Collections

INRIA | IMAG | UGA

Citation

Pierre-Francois Dutot, Lionel Eyraud-Dubois, Grégory Mounié, Denis Trystram. Models for scheduling on large scale platforms: which policy for which application?. 18th International Parallel and Distributed Processing Symposium (IPDPS'04), 2004, Santa Fe, New Mexico, United States. IEEE, 2004, Workshop 7: Workshop on Advances in Parallel and Distributed Computational Models - APDCM'04. 〈hal-00003125〉

Partager

Métriques

Consultations de la notice

476

Téléchargements de fichiers

92