Transfer Reinforcement Learning with Shared Dynamics

Romain Laroche; Merwan Barlier

Communication Dans Un Congrès Année : 2017

Transfer Reinforcement Learning with Shared Dynamics

(1) , (1, 2)

1
2

Romain Laroche

Fonction : Auteur
PersonId : 1011042

Orange Labs [Issy les Moulineaux]

Merwan Barlier

Fonction : Auteur
PersonId : 972744

Orange Labs [Issy les Moulineaux]

Sequential Learning

Résumé

This article addresses a particular Transfer Reinforcement Learning (RL) problem: when dynamics do not change from one task to another, and only the reward function does. Our method relies on two ideas, the first one is that transition samples obtained from a task can be reused to learn on any other task: an immediate reward estimator is learnt in a supervised fashion and for each sample, the reward entry is changed by its reward estimate. The second idea consists in adopting the optimism in the face of uncertainty principle and to use upper bound reward estimates. Our method is tested on a navigation task, under four Transfer RL experimental settings: with a known reward function, with strong and weak expert knowledge on the reward function, and with a completely unknown reward function. It is also evaluated in a Multi-Task RL experiment and compared with the state-of-the-art algorithms. Results reveal that this method constitutes a major improvement for transfer/multi-task problems that share dynamics.

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

aaai-multi-task(1).pdf (1.4 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Merwan Barlier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01548649

Soumis le : mercredi 9 août 2017-16:34:44

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Archivage à long terme le : mercredi 17 janvier 2018-20:24:08

Dates et versions

hal-01548649 , version 1 (09-08-2017)

Licence

Identifiants

HAL Id : hal-01548649 , version 1

Citer

Romain Laroche, Merwan Barlier. Transfer Reinforcement Learning with Shared Dynamics. AAAI-17 - Thirty-First AAAI Conference on Artificial Intelligence, Feb 2017, San Francisco, United States. pp.7. ⟨hal-01548649⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

305 Consultations

103 Téléchargements

Transfer Reinforcement Learning with Shared Dynamics

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager