ReLeaSER: A Reinforcement Learning Strategy for Optimizing Utilization Of Ephemeral Cloud Resources

Mohamed Handaoui; Jean-Emile Dartois; Jalil Boukhobza; Olivier Barais; Laurent d'Orazio

Communication Dans Un Congrès Année : 2020

ReLeaSER: A Reinforcement Learning Strategy for Optimizing Utilization Of Ephemeral Cloud Resources

(1) , (2) , (3) , (4) , (5)

1
2
3
4
5

Mohamed Handaoui

Fonction : Auteur
PersonId : 182965
IdHAL : handaoui-mohamed
ORCID : 0000-0002-4353-194X

Domaine Hypermedia (IRT b<>com)

Jean-Emile Dartois

Fonction : Auteur
PersonId : 179284
IdHAL : jean-emile-dartois
ORCID : 0000-0002-2050-8472
IdRef : 253124328

Institut de Recherche Technologique b-com

Jalil Boukhobza

Fonction : Auteur
PersonId : 1840
IdHAL : jalil-boukhobza
ORCID : 0000-0002-2194-4006
IdRef : 09772582X

Lab-STICC_UBO_CACS_MOCS

Olivier Barais

Fonction : Auteur
PersonId : 1972
IdHAL : olivierbarais
ORCID : 0000-0002-4551-8562
IdRef : 094608946

Diversity-centric Software Engineering

Laurent d'Orazio

Fonction : Auteur
PersonId : 1054026

Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes

Résumé

Cloud data center capacities are over-provisioned to handle demand peaks and hardware failures which leads to low resources' utilization. One way to improve resource utilization and thus reduce the total cost of ownership is to offer unused resources (referred to as ephemeral resources) at a lower price. However, reselling resources needs to meet the expectations of its customers in terms of Quality of Service. The goal is so to maximize the amount of reclaimed resources while avoiding SLA penalties. To achieve that, cloud providers have to estimate their future utilization to provide availability guarantees. The prediction should consider a safety margin for resources to react to unpredictable workloads. The challenge is to find the safety margin that provides the best trade-off between the amount of resources to reclaim and the risk of SLA violations. Most state-of-the-art solutions consider a fixed safety margin for all types of metrics (e.g., CPU, RAM). However, a unique fixed margin does not consider various workloads variations over time which may lead to SLA violations or/and poor utilization. In order to tackle these challenges, we propose ReLeaSER, a Reinforcement Learning strategy for optimizing the ephemeral resources' utilization in the cloud. ReLeaSER dynamically tunes the safety margin at the host-level for each resource metric. The strategy learns from past prediction errors (that caused SLA violations). Our solution reduces significantly the SLA violation penalties on average by 2.7x and up to 3.4x. It also improves considerably the CPs' potential savings by 27.6% on average and up to 43.6%.

Mots clés

Cloud Ephemeral Resources Resource Optimization SLA Safety Margin Reinforcement Learning

Domaines

Informatique [cs]

Fichier principal

2020003050.pdf (598.49 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Handaoui Mohamed : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02989286

Soumis le : jeudi 5 novembre 2020-10:08:44

Dernière modification le : mercredi 10 janvier 2024-15:18:03

Archivage à long terme le : samedi 6 février 2021-18:35:49

Dates et versions

hal-02989286 , version 1 (05-11-2020)

Identifiants

HAL Id : hal-02989286 , version 1

Citer

Mohamed Handaoui, Jean-Emile Dartois, Jalil Boukhobza, Olivier Barais, Laurent d'Orazio. ReLeaSER: A Reinforcement Learning Strategy for Optimizing Utilization Of Ephemeral Cloud Resources. CloudCom 2020 - 12th IEEE International Conference on Cloud Computing Technology and Science, Dec 2020, Bangkok, Thailand. pp.1-9. ⟨hal-02989286⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST INSTITUT-TELECOM UNIV-RENNES1 PRES_CLERMONT CNRS INRIA INSA-RENNES ENSSAT IRISA LAB-STICC_UBO LIMOS ENIB LAB-STICC CENTRALESUPELEC INRIA2 UR1-MATH-STIC BCOM_HYPERMEDIA UR1-UFR-ISTIC UNIV-RENNES IBNM UR1-MATH-NUM CYBERSCHOOL CLERMONT-AUVERGNE-INP

141 Consultations

102 Téléchargements

ReLeaSER: A Reinforcement Learning Strategy for Optimizing Utilization Of Ephemeral Cloud Resources

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager