Heterogeneity-aware Deep Learning Workload Deployments on the Computing Continuum - Archive ouverte HAL Accéder directement au contenu
Poster De Conférence Année : 2021

Heterogeneity-aware Deep Learning Workload Deployments on the Computing Continuum

Thomas Bouvier
Alexandru Costan
Gabriel Antoniu

Résumé

The increasing need for real-time analytics motivated the emergence of new incremental methods to learn representations from continuous flows of data, especially in the context of the Internet of Things. This trend led to the evolution of centralized computing infrastructures towards interconnected processing units spanning from edge devices to cloud data centers. This new paradigm is referred to as the Computing or Edge-to-Cloud Continuum. However, the network and compute heterogeneity across and within clusters may negatively impact Deep Learning (DL) training. We introduce a roadmap for understanding the end-to-end performance of DL workloads in such heterogeneous settings. The goal is to identify key parameters leading to stragglers and devise novel intra- and inter-cluster strategies to address them. We will explore various policies aiming to improve makespan, cost and fairness objectives while ensuring system scalability.
Fichier principal
Vignette du fichier
ipdps_21_poster.pdf (445.56 Ko) Télécharger le fichier
ipdps_21_paper.pdf (166.83 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03270129 , version 1 (25-06-2021)

Identifiants

  • HAL Id : hal-03270129 , version 1

Citer

Thomas Bouvier, Alexandru Costan, Gabriel Antoniu. Heterogeneity-aware Deep Learning Workload Deployments on the Computing Continuum. IPDPS 2021 - 35th IEEE International Parallel and Distributed Processing Symposium, May 2021, Virtual / Portland, United States. ⟨hal-03270129⟩
159 Consultations
116 Téléchargements

Partager

Gmail Facebook X LinkedIn More