Learning fast is painful

Theo Bouganim; Andrea Araldo; Antoine Lavignotte; Nessim Oussedik; Gabriel Guez

Pré-Publication, Document De Travail Année : 2020

Learning fast is painful

(1, 2) , (2, 3, 4) , (2, 3) , (1, 2) , (1, 2)

1
2
3
4

Theo Bouganim

Fonction : Auteur

Télécom SudParis

Institut Polytechnique de Paris

Andrea Araldo

Fonction : Auteur
PersonId : 175764
IdHAL : andrea-araldo
ORCID : 0000-0002-5448-6646
IdRef : 197090443

Institut Polytechnique de Paris

Département Réseaux et Services de Télécommunications

Méthodes et modèles pour les réseaux

Antoine Lavignotte

Fonction : Auteur
PersonId : 740564
IdHAL : antoinelavignotte
ORCID : 0000-0001-8463-012X
IdRef : 18612564X

Institut Polytechnique de Paris

Département Réseaux et Services de Télécommunications

Nessim Oussedik

Fonction : Auteur

Télécom SudParis

Institut Polytechnique de Paris

Gabriel Guez

Fonction : Auteur

Télécom SudParis

Institut Polytechnique de Paris

Résumé

We study the problem of data-driven resource allocation in Multi-Tenant Edge Computing: a Network Operator (NO) owns resources at the Edge and dynamically allocates them to third party application Service Providers (SPs). The objective of the NO is to reduce its operational cost. Since SPs' traffic is encrypted, NO's allocation strategy is based solely on the amount of traffic measured. In this exploratory work, we solve this problem via Reinforcement Learning (RL). RL has mainly been intended to be trained in simulation, before applying it in real scenarios. We instead employ RL online, training it directly while optimizing resource allocation. An important factor, which we call perturbation cost, emerges in this case: in order to learn how to optimize a system, we need to perturb it and measure its reaction. While this perturbation cost has no physical meaning when training RL in simulation, it cannot be ignored when it is paid by the real system. We explore in this work the trade-off between perturbing a lot the system to learn faster to optimize the allocation, or learning slower to reduce the perturbation cost. In our case study, the resource we allocate is storage. We show results from simulation and make the entire code available as open-source.

Domaines

Réseaux et télécommunications [cs.NI] Intelligence artificielle [cs.AI]

Fichier principal

Learning_Fast_is_Painful(7).pdf (266.31 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Andrea Araldo : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02542133

Soumis le : mardi 14 avril 2020-14:44:53

Dernière modification le : mercredi 21 juin 2023-11:42:05

Dates et versions

hal-02542133 , version 1 (14-04-2020)

hal-02542133 , version 2 (30-08-2020)

Identifiants

HAL Id : hal-02542133 , version 1

Citer

Theo Bouganim, Andrea Araldo, Antoine Lavignotte, Nessim Oussedik, Gabriel Guez. Learning fast is painful: reinforcement learning for edge computing allocation. 2020. ⟨hal-02542133v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

188 Consultations

96 Téléchargements

Learning fast is painful

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager