Stage-Wise Learning of Reaching Using Little Prior Knowledge

François de La Bourdonnaye; Céline Teulière; Jochen Triesch; Thierry Chateau

doi:10.3389/frobt.2018.00110

Article Dans Une Revue Frontiers in Robotics and AI Année : 2018

Stage-Wise Learning of Reaching Using Little Prior Knowledge

(1) , (1) , (2) , (1)

1
2

François de La Bourdonnaye

Fonction : Auteur

Institut Pascal

Céline Teulière

Fonction : Auteur
PersonId : 8681
IdHAL : cteuliere
IdRef : 149645163

Institut Pascal

Jochen Triesch

Fonction : Auteur

Frankfurt Institute for Advanced Studies

Thierry Chateau

Fonction : Auteur
PersonId : 8056
IdHAL : thierry-chateau
IdRef : 154402176

Institut Pascal

Résumé

In some manipulation robotics environments, because of the difficulty of precisely modeling dynamics and computing features which describe well the variety of scene appearances, hand-programming a robot behavior is often intractable. Deep reinforcement learning methods partially alleviate this problem in that they can dispense with hand-crafted features for the state representation and do not need pre-computed dynamics. However, they often use prior information in the task definition in the form of shaping rewards which guide the robot toward goal state areas but require engineering or human supervision and can lead to sub-optimal behavior. In this work we consider a complex robot reaching task with a large range of initial object positions and initial arm positions and propose a new learning approach with minimal supervision. Inspired by developmental robotics, our method consists of a weakly-supervised stage-wise procedure of three tasks. First, the robot learns to fixate the object with a 2-camera system. Second, it learns hand-eye coordination by learning to fixate its end-effector. Third, using the knowledge acquired in the previous steps, it learns to reach the object at different positions and from a large set of initial robot joint angles. Experiments in a simulated environment show that our stage-wise framework yields similar reaching performances, compared with a supervised setting without using kinematic models, hand-crafted features, calibration parameters or supervised visual modules.

Mots clés

deep reinforcement learning weakly-supervised stage-wise learning manipulation robotics hierarchical learning

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

frobt-05-00110.pdf (2.07 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Céline Teulière : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01892940

Soumis le : mercredi 19 décembre 2018-12:25:46

Dernière modification le : samedi 22 avril 2023-04:30:10

Archivage à long terme le : mercredi 20 mars 2019-19:25:24

Dates et versions

hal-01892940 , version 1 (19-12-2018)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-01892940 , version 1
DOI : 10.3389/frobt.2018.00110

Citer

François de La Bourdonnaye, Céline Teulière, Jochen Triesch, Thierry Chateau. Stage-Wise Learning of Reaching Using Little Prior Knowledge. Frontiers in Robotics and AI, 2018, 5, ⟨10.3389/frobt.2018.00110⟩. ⟨hal-01892940⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PRES_CLERMONT CNRS INSTITUT_PASCAL

72 Consultations

151 Téléchargements

Stage-Wise Learning of Reaching Using Little Prior Knowledge

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager