Hybrid Molecule-based Information Retrieval

Abstract : The increased availability of interdependent heterogeneous data generated from different sources is fostering the incorporation of semantic knowledge-based graphs and ontologies in information management and search applications. Most of the existing Information Retrieval systems mainly focus on the semantic analysis of the information contained in heterogeneous data. In their results, they provide documents as query answers without considering (i) detailed information regarding relevant granularity levels of the documents, and most importantly (ii) dependencies between the documents or parts of the documents. To overcome these limitations, we propose a graph-based search and ranking algorithm within a generic framework that retrieves the data in the form of a novel augmented data structure for query answers, which we call hybrid molecules. The latter consist of well-defined subgraphs representing relevant contextual information regarding domain-specific information coupled with structural information related to the document. This improves the search results and reduces users’ efforts in tracking and interpreting them. Experiments conducted on real world data corpus using projects from the building construction industry validate the effectiveness of our approach.
Complete list of metadatas

Contributor : Sébastien Laborie <>
Submitted on : Monday, April 29, 2019 - 3:49:18 PM
Last modification on : Thursday, May 2, 2019 - 10:32:56 AM


Files produced by the author(s)




Nathalie Charbel, Christian Sallaberry, Sébastien Laborie, Richard Chbeir. Hybrid Molecule-based Information Retrieval. The 34th ACM/SIGAPP Symposium On Applied Computing (ACM SAC 2019), Apr 2019, Limassol, Cyprus. pp.808-815, ⟨10.1145/3297280.3297358⟩. ⟨hal-02077488⟩



Record views


Files downloads