Hybrid Molecule-based Information Retrieval

The increased availability of interdependent heterogeneous data generated from different sources is fostering the incorporation of semantic knowledge-based graphs and ontologies in information management and search applications. Most of the existing Information Retrieval systems mainly focus on the semantic analysis of the information contained in heterogeneous data. In their results, they provide documents as query answers without considering (i) detailed information regarding relevant granularity levels of the documents, and most importantly (ii) dependencies between the documents or parts of the documents. To overcome these limitations, we propose a graph-based search and ranking algorithm within a generic framework that retrieves the data in the form of a novel augmented data structure for query answers, which we call hybrid molecules. The latter consist of well-defined subgraphs representing relevant contextual information regarding domain-specific information coupled with structural information related to the document. This improves the search results and reduces users’ efforts in tracking and interpreting them. Experiments conducted on real world data corpus using projects from the building construction industry validate the effectiveness of our approach.

Mots clés

Tightly Coupled Semantic Graphs, Hybrid Molecules, CCS CONCEPTS • Information systems → Information retrieval Enterprise search

Document representation, Ontologies

Domaines

Multimédia [cs.MM] Web Recherche d'information [cs.IR]

Fichier principal

charbel2019-accepted.pdf (979.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sébastien Laborie : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02077488

Soumis le : lundi 29 avril 2019-15:49:18

Dernière modification le : lundi 7 novembre 2022-17:24:33

Dates et versions

hal-02077488 , version 1 (29-04-2019)

Identifiants

HAL Id : hal-02077488 , version 1
DOI : 10.1145/3297280.3297358

Citer

Nathalie Charbel, Christian Sallaberry, Sébastien Laborie, Richard Chbeir. Hybrid Molecule-based Information Retrieval. The 34th ACM/SIGAPP Symposium On Applied Computing (ACM SAC 2019), Apr 2019, Limassol, Cyprus. pp.808-815, ⟨10.1145/3297280.3297358⟩. ⟨hal-02077488⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PAU LIUPPA

77 Consultations

153 Téléchargements