Deep active learning with simulated rationales for text classification - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Deep active learning with simulated rationales for text classification

Résumé

Neural networks have become a preferred tool for text classification tasks, demonstrating state of the art performances when trained on a large set of labeled data. However, in an early active learning setup, the scarcity of the ground-truth labels available severely penalizes the generalization capability of the neural network. In order to overcome such limitations, in this paper, we introduce a new learning strategy, which consist of inserting in the early stages of the learning process some additional, local and salient knowledge, presented under the form of simulated, human like rationales. We show how such knowledge can be automatically extracted from documents by analyzing the class activation maps of a convolutional neural network. The experimental results obtained demonstrate that the exploitation of such rationales permits to significantly speed-up the learning process, with a spectacular increase of the accuracy rates, starting from a very reduced number of documents (10–20).
Fichier non déposé

Dates et versions

hal-03126814 , version 1 (01-02-2021)

Identifiants

Citer

Paul Guélorget, Bruno Grilheres, Titus Zaharia. Deep active learning with simulated rationales for text classification. ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence:, Oct 2020, Zhongshan (online), China. pp.363-379, ⟨10.1007/978-3-030-59830-3_32⟩. ⟨hal-03126814⟩
45 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More