Learning to combine primitive skills: A step towards versatile robotic manipulation

Robin Strudel; Alexander Pashevich; Igor Kalevatykh; Ivan Laptev; Josef Sivic; Cordelia Schmid

doi:10.1109/ICRA40945.2020.9196619

Communication Dans Un Congrès Année : 2020

Learning to combine primitive skills: A step towards versatile robotic manipulation

(1, 2) , (3) , (1, 2) , (1, 2) , (1, 2) , (3)

1
2
3

Robin Strudel

Fonction : Auteur

Models of visual object recognition and scene understanding

Université Paris Sciences et Lettres

Alexander Pashevich

Fonction : Auteur
PersonId : 1040983

Apprentissage de modèles à partir de données massives

Igor Kalevatykh

Fonction : Auteur

Models of visual object recognition and scene understanding

Université Paris Sciences et Lettres

Ivan Laptev

Fonction : Auteur
PersonId : 865349

Models of visual object recognition and scene understanding

Université Paris Sciences et Lettres

Josef Sivic

Fonction : Auteur

Models of visual object recognition and scene understanding

Université Paris Sciences et Lettres

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Apprentissage de modèles à partir de données massives

Résumé

Manipulation tasks such as preparing a meal or assembling furniture remain highly challenging for robotics and vision. Traditional task and motion planning (TAMP) methods can solve complex tasks but require full state observability and are not adapted to dynamic scene changes. Recent learning methods can operate directly on visual inputs but typically require many demonstrations and/or task-specific reward engineering. In this work we aim to overcome previous limitations and propose a reinforcement learning (RL) approach to task planning that learns to combine primitive skills. First, compared to previous learning methods, our approach requires neither intermediate rewards nor complete task demonstrations during training. Second, we demonstrate the versatility of our vision-based task planning in challenging settings with temporary occlusions and dynamic scene changes. Third, we propose an efficient training of basic skills from few synthetic demonstrations by exploring recent CNN architectures and data augmentation. Notably, while all of our policies are learned on visual inputs in simulated environments, we demonstrate the successful transfer and high success rates when applying such policies to manipulation tasks on a real UR5 robotic arm.

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Autres [stat.ML] Apprentissage [cs.LG]

Fichier principal

ICRA_2020.pdf (2.37 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alexander Pashevich : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02274969

Soumis le : mercredi 30 septembre 2020-18:07:17

Dernière modification le : vendredi 19 avril 2024-16:18:58

Archivage à long terme le : lundi 4 janvier 2021-08:55:53

Dates et versions

hal-02274969 , version 1 (30-09-2020)

Identifiants

HAL Id : hal-02274969 , version 1
ARXIV : 1908.00722
DOI : 10.1109/ICRA40945.2020.9196619

Citer

Robin Strudel, Alexander Pashevich, Igor Kalevatykh, Ivan Laptev, Josef Sivic, et al.. Learning to combine primitive skills: A step towards versatile robotic manipulation. ICRA 2020 - IEEE International Conference on Robotics and Automation, May 2020, Paris / Virtuel, France. ⟨10.1109/ICRA40945.2020.9196619⟩. ⟨hal-02274969⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI INRIA2 GENCI LJK-GI-THOTH PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR PRAIRIE-IA UR1-MATH-NUM

379 Consultations

318 Téléchargements

Learning to combine primitive skills: A step towards versatile robotic manipulation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager