Emergent Proximo-Distal Maturation through Adaptive Exploration

Freek Stulp; Pierre-Yves Oudeyer

Communication Dans Un Congrès Année : 2012

Emergent Proximo-Distal Maturation through Adaptive Exploration

(1, 2) , (1, 2)

1
2

Freek Stulp

Fonction : Auteur
PersonId : 1420
IdHAL : freek-stulp
IdRef : 177920629

Flowing Epigenetic Robots and Systems

École Nationale Supérieure de Techniques Avancées

Pierre-Yves Oudeyer

Fonction : Auteur
PersonId : 6675
IdHAL : pyoudeyer
ORCID : 0000-0002-9404-7613
IdRef : 081674481

Flowing Epigenetic Robots and Systems

École Nationale Supérieure de Techniques Avancées

Résumé

Life-long robot learning in the high-dimensional real world requires guided and structured exploration mechanisms. In this developmental context, we investigate here the use of the recently proposed PI2-CMAES episodic reinforcement learning algorithm, which is able to learn high-dimensional motor tasks through adaptive control of exploration. By studying PI2-CMAES in a reaching task on a simulated arm, we observe two developmental properties. First, we show how PI2-CMAES autonomously and continuously tunes the global exploration/exploitation trade-off, allowing it to re-adapt to changing tasks. Second, we show how PI2-CMAES spontaneously self-organizes a maturational structure whilst exploring the degrees-of-freedom (DOFs) of the motor space. In particular, it automatically demonstrates the so-called proximo-distal maturation observed in humans: after first freezing distal DOFs while exploring predominantly the most proximal DOF, it progressively frees exploration in DOFs along the proximo-distal body axis. These emergent properties suggest the use of PI2-CMAES as a general tool for studying reinforcement learning of skills in life-long developmental learning contexts.

Domaines

Robotique [cs.RO]

Freek Stulp : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00789393

Soumis le : lundi 18 février 2013-10:59:36

Dernière modification le : mercredi 15 mars 2023-08:50:07

Dates et versions

hal-00789393 , version 1 (18-02-2013)

Identifiants

HAL Id : hal-00789393 , version 1

Citer

Freek Stulp, Pierre-Yves Oudeyer. Emergent Proximo-Distal Maturation through Adaptive Exploration. International Conference on Development and Learning (ICDL), 2012, United States. pp.0-0. ⟨hal-00789393⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA INRIA ENSTA_U2IS INRIA2

79 Consultations

0 Téléchargements

Emergent Proximo-Distal Maturation through Adaptive Exploration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager