Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI

Ikram Chraibi Kaadoud; Adrien Bennetot; Barbara Mawhin; Vicky Charisi; Natalia Díaz-Rodríguez

doi:10.1016/j.neunet.2022.08.002

Article Dans Une Revue Neural Networks Année : 2022

Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI

(1, 2) , (3, 4) , , (5) , (6, 7)

1
2
3
4
5
6
7

Ikram Chraibi Kaadoud

Fonction : Auteur
PersonId : 745009
IdHAL : ikram-chraibi-kaadoud
ORCID : 0000-0001-8393-1262
IdRef : 226123804

Equipe DECIDE

Département Logique des Usages, Sciences sociales et Sciences de l'Information

Adrien Bennetot

Fonction : Auteur
PersonId : 1225782
IdRef : 26761571X

SEGULA Technologies [Trappes]

Unité d'Informatique et d'Ingénierie des Systèmes

Barbara Mawhin

Fonction : Auteur

Vicky Charisi

Fonction : Auteur

European Commission - Joint Research Centre [Seville]

Natalia Díaz-Rodríguez

Fonction : Auteur
PersonId : 1138091

Instituto Andaluz Interuniversitario en Data Science and Computational Intelligence

Universidad de Granada = University of Granada

Résumé

During the learning process, a child develops a mental representation of the task he or she is learning. A Machine Learning algorithm develops also a latent representation of the task it learns. We investigate the development of the knowledge construction of an artificial agent through the analysis of its behavior, i.e., its sequences of moves while learning to perform the Tower of Hanoï (TOH) task. The TOH is a wellknown task in experimental contexts to study the problem-solving processes and one of the fundamental processes of children's knowledge construction about their world. We position ourselves in the field of explainable reinforcement learning for developmental robotics, at the crossroads of cognitive modeling and explainable AI. Our main contribution proposes a 3-step methodology named Implicit Knowledge Extraction with eXplainable Artificial Intelligence (IKE-XAI) to extract the implicit knowledge, in form of an automaton, encoded by an artificial agent during its learning. We showcase this technique to solve and explain the TOH task when researchers have only access to moves that represent observational behavior as in human-machine interaction. Therefore, to extract the agent acquired knowledge at different stages of its training, our approach combines: first, a Q-learning agent that learns to perform the TOH task; second, a trained recurrent neural network that encodes an implicit representation of the TOH task; and third, an XAI process using a post-hoc implicit rule extraction algorithm to extract finite state automata. We propose using graph representations as visual and explicit explanations of the behavior of the Q-learning agent. Our experiments show that the IKE-XAI approach helps understanding the development of the Q-learning agent behavior by providing a global explanation of its knowledge evolution during learning. IKE-XAI also allows researchers to identify the agent's Aha! moment by determining from what moment the knowledge representation stabilizes and the agent no longer learns.

Mots clés

Explainable AI Reinforcement Learning Cognitive modeling Developmental robotics Post-hoc rule extraction Knowledge extraction

Domaines

Sciences cognitives Intelligence artificielle [cs.AI]

Fichier principal

2021_Explaining_Aha_moments_in_artificial_agents_through_IKE_XAI (9).pdf (5.27 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Ikram Chraibi Kaadoud : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03794946

Soumis le : lundi 3 octobre 2022-16:18:22

Dernière modification le : vendredi 2 février 2024-09:42:31

Archivage à long terme le : mercredi 4 janvier 2023-18:57:01

Dates et versions

hal-03794946 , version 1 (03-10-2022)

Identifiants

HAL Id : hal-03794946 , version 1
DOI : 10.1016/j.neunet.2022.08.002

Citer

Ikram Chraibi Kaadoud, Adrien Bennetot, Barbara Mawhin, Vicky Charisi, Natalia Díaz-Rodríguez. Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI. Neural Networks, 2022, 155, pp.95-118. ⟨10.1016/j.neunet.2022.08.002⟩. ⟨hal-03794946⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST INSTITUT-TELECOM ENSTA CNRS LAB-STICC_UBO ENIB ENSTA_U2IS LAB-STICC IMT-ATLANTIQUE IP_PARIS LAB-STICC_DECIDE LAB-STICC_DMID

64 Consultations

22 Téléchargements

Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager