EgoMap: Projective mapping and structured egocentric memory for Deep RL

Tasks involving localization, memorization and planning in partially observable 3D environments are an ongoing challenge in Deep Reinforcement Learning. We present EgoMap, a spatially structured neural memory architecture. EgoMap augments a deep reinforcement learning agent's performance in 3D environments on challenging tasks with multi-step objectives. The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map. The map is updated with ego-motion measurements through a differentiable affine transform. We show this architecture outperforms both standard recurrent agents and state of the art agents with structured memory. We demonstrate that incorporating these inductive biases into an agent's architecture allows for stable training with reward alone, circumventing the expense of acquiring and labelling expert trajectories. A detailed ablation study demonstrates the impact of key aspects of the architecture and through extensive qualitative analysis, we show how the agent exploits its structured internal memory to achieve higher performance.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Intelligence artificielle [cs.AI]

Christian Wolf : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02864146

Soumis le : mercredi 10 juin 2020-21:34:07

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-02864146 , version 1 (10-06-2020)

Identifiants

HAL Id : hal-02864146 , version 1
ARXIV : 2002.02286
DOI : 10.1007/978-3-030-67661-2_31

Citer

Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf. EgoMap: Projective mapping and structured egocentric memory for Deep RL. ECML-PKDD 2020 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2020, ghent, Belgium. pp.1-12, ⟨10.1007/978-3-030-67661-2_31⟩. ⟨hal-02864146⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INRIA2 CITI INSA-GROUPE UDL

237 Consultations

0 Téléchargements