Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach *

Rémi Eyraud; Stéphane Ayache

doi:10.1007/s10994-021-05948-1

Article Dans Une Revue Machine Learning Année : 2020

Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach *

(1) , (2)

1
2

Rémi Eyraud

Fonction : Auteur

Laboratoire Hubert Curien

Stéphane Ayache

Fonction : Auteur
PersonId : 16733
IdHAL : stephane-ayache
ORCID : 0000-0003-2982-7127
IdRef : 129313254

éQuipe d'AppRentissage de MArseille

Résumé

This paper is an attempt to bridge the gap between deep learning and grammatical inference. Indeed, it provides an algorithm to extract a (stochastic) formal language from any recurrent neural network trained for language modelling. In detail, the algorithm uses the already trained network as an oracle-and thus does not require the access to the inner representation of the black-box-and applies a spectral approach to infer a weighted automaton. As weighted automata compute linear functions, they are computationally more efficient than neural networks and thus the nature of the approach is the one of knowledge distillation. We detail experiments on 62 data sets (both synthetic and from real-world applications) that allow an in-depth study of the abilities of the proposed algorithm. The results show the WA we extract are good approximations of the RNN, validating the approach. Moreover, we show how the process provides interesting insights toward the behavior of RNN learned on data, enlarging the scope of this work to the one of explainability of deep learning models.

Mots clés

Weighted Automata Recurrent Neural Network Spectral Extraction Grammatical Inference Explainability

Domaines

Apprentissage [cs.LG]

Fichier principal

2009.13101.pdf (4.15 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Eyraud : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03270667

Soumis le : vendredi 25 juin 2021-09:45:38

Dernière modification le : vendredi 22 mars 2024-18:24:04

Archivage à long terme le : dimanche 26 septembre 2021-18:18:24

Dates et versions

hal-03270667 , version 1 (25-06-2021)

Identifiants

HAL Id : hal-03270667 , version 1
DOI : 10.1007/s10994-021-05948-1

Citer

Rémi Eyraud, Stéphane Ayache. Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach *. Machine Learning, 2020, ⟨10.1007/s10994-021-05948-1⟩. ⟨hal-03270667⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS UNIV-TLN CNRS UNIV-AMU PARISTECH GENCI LIS-LAB UDL ANR INCIAM

44 Consultations

38 Téléchargements

Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach *

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager