Deep active learning with simulated rationales for text classification

Paul Guélorget; Bruno Grilheres; Titus Zaharia

doi:10.1007/978-3-030-59830-3_32

Communication Dans Un Congrès Année : 2020

Deep active learning with simulated rationales for text classification

(1, 2) , (2) , (1, 3, 4)

1
2
3
4

Paul Guélorget

Fonction : Auteur

Institut Polytechnique de Paris

Airbus Defence and Space [Les Mureaux]

Bruno Grilheres

Fonction : Auteur

Airbus Defence and Space [Les Mureaux]

Titus Zaharia

Fonction : Auteur
PersonId : 751841
IdHAL : titus-zaharia

Institut Polytechnique de Paris

Département Advanced Research And Techniques For Multidimensional Imaging Systems

ARMEDIA

Résumé

Neural networks have become a preferred tool for text classification tasks, demonstrating state of the art performances when trained on a large set of labeled data. However, in an early active learning setup, the scarcity of the ground-truth labels available severely penalizes the generalization capability of the neural network. In order to overcome such limitations, in this paper, we introduce a new learning strategy, which consist of inserting in the early stages of the learning process some additional, local and salient knowledge, presented under the form of simulated, human like rationales. We show how such knowledge can be automatically extracted from documents by analyzing the class activation maps of a convolutional neural network. The experimental results obtained demonstrate that the exploitation of such rationales permits to significantly speed-up the learning process, with a spectacular increase of the accuracy rates, starting from a very reduced number of documents (10–20).

Mots clés

Deep neural networks Active learning Rationales Class activation maps Text classification

Domaines

Traitement du signal et de l'image [eess.SP]

Titus Zaharia : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03126814

Soumis le : lundi 1 février 2021-09:52:32

Dernière modification le : vendredi 10 mars 2023-15:42:56

Dates et versions

hal-03126814 , version 1 (01-02-2021)

Identifiants

HAL Id : hal-03126814 , version 1
DOI : 10.1007/978-3-030-59830-3_32

Citer

Paul Guélorget, Bruno Grilheres, Titus Zaharia. Deep active learning with simulated rationales for text classification. ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence:, Oct 2020, Zhongshan (online), China. pp.363-379, ⟨10.1007/978-3-030-59830-3_32⟩. ⟨hal-03126814⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM TELECOM-SUDPARIS IP_PARIS

45 Consultations

0 Téléchargements

Deep active learning with simulated rationales for text classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager