Hybrid Molecule-based Information Retrieval

Abstract : The increased availability of interdependent heterogeneous data generated from different sources is fostering the incorporation of semantic knowledge-based graphs and ontologies in information management and search applications. Most of the existing Information Retrieval systems mainly focus on the semantic analysis of the information contained in heterogeneous data. In their results, they provide documents as query answers without considering (i) detailed information regarding relevant granularity levels of the documents, and most importantly (ii) dependencies between the documents or parts of the documents. To overcome these limitations, we propose a graph-based search and ranking algorithm within a generic framework that retrieves the data in the form of a novel augmented data structure for query answers, which we call hybrid molecules. The latter consist of well-defined subgraphs representing relevant contextual information regarding domain-specific information coupled with structural information related to the document. This improves the search results and reduces users’ efforts in tracking and interpreting them. Experiments conducted on real world data corpus using projects from the building construction industry validate the effectiveness of our approach.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02077488
Contributor : Sébastien Laborie <>
Submitted on : Monday, April 29, 2019 - 3:49:18 PM
Last modification on : Thursday, May 2, 2019 - 10:32:56 AM

File

charbel2019-accepted.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Nathalie Charbel, Christian Sallaberry, Sébastien Laborie, Richard Chbeir. Hybrid Molecule-based Information Retrieval. The 34th ACM/SIGAPP Symposium On Applied Computing (ACM SAC 2019), Apr 2019, Limassol, Cyprus. pp.808-815, ⟨10.1145/3297280.3297358⟩. ⟨hal-02077488⟩

Share

Metrics

Record views

29

Files downloads

18