Formalisation of metamorph Reinforcement Learning

Iago Bonnici; Abdelkader Gouaich; Fabien Michel

Rapport (Rapport Technique) Année : 2018

Formalisation of metamorph Reinforcement Learning

(1) , (1) , (1)

Iago Bonnici

Fonction : Auteur
PersonId : 8023
IdHAL : iago-bonnici
ORCID : 0000-0003-2934-351X

Système Multi-agent, Interaction, Langage, Evolution

Abdelkader Gouaich

Fonction : Auteur
PersonId : 922401

Système Multi-agent, Interaction, Langage, Evolution

Fabien Michel

Fonction : Auteur
PersonId : 2829
IdHAL : fabien-michel
ORCID : 0000-0003-1814-980X
IdRef : 104620927

Système Multi-agent, Interaction, Langage, Evolution

Résumé

This technical report describes the formalisation of a particular Reinforcement Learning (RL) situation that we call "metamorph" (mRL). In this situation, the signature of the learner agent, i.e. its set of inputs, outputs and feedback slots, can change over the course of learning. RL can be viewed as signal processing, because the learner agent transforms the inputs/feedbacks signals it is continuously fed with into output signals. The following formalisation is therefore concerned with signals description and the transformation from one signal to another. Also, since the signature of the agent is expected to change, we get concerned in the definition of what is a "signature" and a "signature change". In the first part, we describe mRL learning context, or how the metamorph agent is embedded into its environment and interacts with it. In the second part, we describe one generic example of a metamorph learner agent: a dynamical computational graph that could theoretically be used in controlling the agent. In the last part, we reformulate the classical problem of RL, a.k.a. "maximizing feedback" in terms of this formalised mRL. 1

Domaines

Apprentissage [cs.LG]

Fichier principal

main.pdf (154.55 Ko)

mRL_Bonnici_Gouaich.pdf (154.55 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Iago Bonnici : Connectez-vous pour contacter le contributeur

https://hal-lara.archives-ouvertes.fr/hal-01924642

Soumis le : mercredi 5 décembre 2018-13:27:18

Dernière modification le : vendredi 24 mars 2023-14:53:08

Archivage à long terme le : mercredi 6 mars 2019-12:34:03

Dates et versions

hal-01924642 , version 1 (05-12-2018)

Identifiants

HAL Id : hal-01924642 , version 1

Citer

Iago Bonnici, Abdelkader Gouaich, Fabien Michel. Formalisation of metamorph Reinforcement Learning. [Technical Report] LIRMM (UM, CNRS). 2018. ⟨hal-01924642⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS SMILE LIRMM LARA MIPS UNIV-MONTPELLIER

266 Consultations

77 Téléchargements

Formalisation of metamorph Reinforcement Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager