A General Framework for Personalising Post Hoc Explanations through User Knowledge Integration

Adulam Jeyasothy; Thibault Laugel; Marie-Jeanne Lesot; Christophe Marsala; Marcin Detyniecki

doi:10.1016/j.ijar.2023.108944

Article Dans Une Revue International Journal of Approximate Reasoning Année : 2023

A General Framework for Personalising Post Hoc Explanations through User Knowledge Integration

(1) , (2) , (1) , (1) , (2, 3)

1
2
3

Adulam Jeyasothy

Fonction : Auteur

Learning, Fuzzy and Intelligent systems

Thibault Laugel

Fonction : Auteur

AXA France

Marie-Jeanne Lesot

Fonction : Auteur

Learning, Fuzzy and Intelligent systems

Christophe Marsala

Fonction : Auteur

Learning, Fuzzy and Intelligent systems

Marcin Detyniecki

Fonction : Auteur

AXA France

Polska Akademia Nauk = Polish Academy of Sciences = Académie polonaise des sciences

Résumé

The field of XAI aims at providing explanations about the behavior of AI methods to a user. In particular, local post-hoc interpretability approaches aim at generating explanations for a particular prediction of a trained machine learning model. It is generally recognized that such explanations should be adapted to each user: integrating user knowledge and taking into account the user specificity allows to provide personalized explanations and to improve the explanation understandability. Yet these elements appear to be rarely taken into account, and only in specific configurations. In this paper, we propose a general framework to allow this integration of user knowledge in post-hoc interpretability methods, relying on the addition of a compatibility term in the cost function. We instantiate the proposed formalization in two scenarios, varying in the explanation form they propose, in the case where the available user knowledge provides information about the data features. As a result, two new explainability methods are proposed, respectively named Knowledge Integration in Counterfactual Explanation (KICE) and Knowledge Integration in Surrogate Model (KISM). These methods are experimentally studied on several benchmark data sets to characterize the explanations they generate as compared to reference methods.

Mots clés

eXplainable Artificial Intelligence XAI user knowledge compatibility counterfactual explanation surrogate model explanations local feature importance

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

IJAR2023.pdf (583.76 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

adulam jeyasothy : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04337026

Soumis le : mardi 12 décembre 2023-09:01:49

Dernière modification le : jeudi 4 janvier 2024-22:26:03

Dates et versions

hal-04337026 , version 1 (12-12-2023)

Identifiants

HAL Id : hal-04337026 , version 1
DOI : 10.1016/j.ijar.2023.108944

Citer

Adulam Jeyasothy, Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki. A General Framework for Personalising Post Hoc Explanations through User Knowledge Integration. International Journal of Approximate Reasoning, 2023, 160, pp.108944. ⟨10.1016/j.ijar.2023.108944⟩. ⟨hal-04337026⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

24 Consultations

17 Téléchargements

A General Framework for Personalising Post Hoc Explanations through User Knowledge Integration

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager