Explaining Black Boxes on Sequential Data using Weighted Automata - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Explaining Black Boxes on Sequential Data using Weighted Automata

Résumé

Understanding how a learned black box works is of crucial interest for the future of Machine Learning. In this paper, we pioneer the question of the global interpretability of learned black box models that assign numerical values to symbolic sequential data. To tackle that task, we propose a spectral algorithm for the extraction of weighted automata (WA) from such black boxes. This algorithm does not require the access to a dataset or to the inner representation of the black box: the inferred model can be obtained solely by querying the black box, feeding it with inputs and analyzing its outputs. Experiments using Recurrent Neural Networks (RNN) trained on a wide collection of 48 synthetic datasets and 2 real datasets show that the obtained approximation is of great quality.
Fichier principal
Vignette du fichier
Explaining_black_box_on_sequential_data.pdf (1.64 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01888514 , version 1 (05-10-2018)

Identifiants

  • HAL Id : hal-01888514 , version 1

Citer

Stéphane Ayache, Rémi Eyraud, Noé Goudian. Explaining Black Boxes on Sequential Data using Weighted Automata. 14th International Conference on Grammatical Inference, Sep 2018, Wrocław,, Poland. ⟨hal-01888514⟩
105 Consultations
62 Téléchargements

Partager

Gmail Facebook X LinkedIn More