DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

Theo Jaunet; Romain Vuillemot; Christian Wolf

doi:10.1111/cgf.13962

Article Dans Une Revue Computer graphics Forum (Proc. Eurovis) Année : 2020

DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

(1) , (1) , (2)

1
2

Theo Jaunet

Fonction : Auteur
PersonId : 175615
IdHAL : theo-jaunet
ORCID : 0000-0003-3081-5123

Situated Interaction, Collaboration, Adaptation and Learning

Romain Vuillemot

Fonction : Auteur
PersonId : 2912
IdHAL : romain-vuillemot
ORCID : 0000-0003-1447-6926
IdRef : 155739948

Situated Interaction, Collaboration, Adaptation and Learning

Christian Wolf

Fonction : Auteur
PersonId : 3860
IdHAL : christian-wolf
ORCID : 0000-0001-9766-3211
IdRef : 083311696

Extraction de Caractéristiques et Identification

Résumé

We present DRLViz, a visual analytics interface to interpret the internal memory of an agent (e.g. a robot) trained using deep reinforcement learning. This memory is composed of large temporal vectors updated when the agent moves in an environment and is not trivial to understand due to the number of dimensions, dependencies to past vectors, spatial/temporal correlations, and co-correlation between dimensions. It is often referred to as a black box as only inputs (images) and outputs (actions) are intelligible for humans. Using DRLViz, experts are assisted to interpret decisions using memory reduction interactions, and to investigate the role of parts of the memory when errors have been made (e.g. wrong direction). We report on DRLViz applied in the context of video games simulators (ViZDoom) for a navigation scenario with item gathering tasks. We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explainability in the field of visual analytics.

Domaines

Intelligence artificielle [cs.AI] Interface homme-machine [cs.HC]

Fichier principal

DRLViz_preprint.pdf (5.45 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Christian Wolf : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02864138

Soumis le : lundi 19 octobre 2020-11:16:47

Dernière modification le : mercredi 27 mars 2024-09:16:03

Archivage à long terme le : mercredi 20 janvier 2021-18:30:32

Dates et versions

hal-02864138 , version 1 (19-10-2020)

Identifiants

HAL Id : hal-02864138 , version 1
ARXIV : 1909.02982
DOI : 10.1111/cgf.13962

Citer

Theo Jaunet, Romain Vuillemot, Christian Wolf. DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning. Computer graphics Forum (Proc. Eurovis), 2020, ⟨10.1111/cgf.13962⟩. ⟨hal-02864138⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INSA-GROUPE UDL EC_LYON_STRICT

114 Consultations

72 Téléchargements

DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager