On Ranking Relevant Entities in Heterogeneous Networks Using a Language-Based Model

Abstract : A new challenge, accessing multiple relevant entities, arises from the availability of linked heterogeneous data. In this article, we address more specifically the problem of accessing relevant entities, such as publications and authors within a bibliographic network, given an information need. We propose a novel algorithm, called BibRank, that estimates a joint relevance of documents and authors within a bibliographic network. This model ranks each type of entity using a score propagation algorithm with respect to the query topic and the structure of the underlying bi-type information entity network. Evidence sources, namely content-based and network-based scores, are both used to estimate the topical similarity between connected entities. For this purpose, authorship relationships are analyzed through a language model-based score on the one hand and on the other hand, non topically related entities of the same type are detected through marginal citations. The article reports the results of experiments using the Bibrank algorithm for an information retrieval task. The CiteSeerX bibliographic data set forms the basis for the topical query automatic generation and evaluation. We show that a statistically significant improvement over closely related ranking models is achieved.
Liste complète des métadonnées

Cited literature [39 references]  Display  Hide  Download

Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Thursday, February 19, 2015 - 3:50:22 PM
Last modification on : Friday, April 12, 2019 - 4:23:01 PM
Document(s) archivé(s) le : Wednesday, May 20, 2015 - 11:00:23 AM


Files produced by the author(s)




Laure Soulier, Lamjed Ben Jabeur, Lynda Tamine, Wahiba Bahsoun. On Ranking Relevant Entities in Heterogeneous Networks Using a Language-Based Model. Journal of the Association for Information Science and Technology, ASIS&T/Wiley, 2013, vol. 64 (n° 3), pp. 500-515. ⟨10.1002/asi.22762⟩. ⟨hal-01118673⟩



Record views


Files downloads