Skip to Main content Skip to Navigation
Journal articles

Offline versus Online Representation Learning of Documents Using External Knowledge

Lynda Tamine 1 Laure Soulier 2 Gia-Hung Nguyen 1 Nathalie Souf 3 
1 IRIT-IRIS - Recherche d’Information et Synthèse d’Information
IRIT - Institut de recherche en informatique de Toulouse
3 IRIT-SIG - Systèmes d’Informations Généralisées
IRIT - Institut de recherche en informatique de Toulouse
Abstract : An intensive recent research work investigated the combined use of hand-curated knowledge resources and corpus-driven resources to learn effective text representations. The overall learning process could be run by online revising the learning objective or by offline refining an original learned representation. The differentiated impact of each of the learning approaches on the quality of the learned representations has not been studied so far in the literature. This article focuses on the design of comparable offline vs. online knowledge-enhanced document representation learning models and the comparison of their effectiveness using a set of standard IR and NLP downstream tasks. The results of quantitative and qualitative analyses show that (1) offline vs. online learning approaches have dissimilar result trends regarding the task as well as the dataset distribution counts with regard to domain application; (2) while considering external knowledge resources is undoubtedly beneficial, the way used to express relational constraints could affect semantic inference effectiveness. The findings of this work present opportunities for the design of future representation learning models, but also for providing insights about the evaluation of such models.
Document type :
Journal articles
Complete list of metadata
Contributor : Laure Soulier Connect in order to contact the contributor
Submitted on : Thursday, October 3, 2019 - 3:10:39 PM
Last modification on : Monday, July 4, 2022 - 9:32:48 AM



Lynda Tamine, Laure Soulier, Gia-Hung Nguyen, Nathalie Souf. Offline versus Online Representation Learning of Documents Using External Knowledge. ACM Transactions on Information Systems, Association for Computing Machinery, 2019, 37 (4), pp.42:1 - 42:34. ⟨10.1145/3349527⟩. ⟨hal-02304815⟩



Record views