An Approach to Manage Semantic Heterogeneity in Unstructured P2P Information Retrieval Systems

Thomas Cerqueus 1 Sylvie Cazalens 2 Philippe Lamarre 3
1 GDD - Gestion de Données Distribuées [Nantes]
LINA - Laboratoire d'Informatique de Nantes Atlantique
3 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In unstructured information retrieval P2P systems, semantic heterogeneity comes from the use of different ontologies. Semantic interoperability refers to the ability of peers to communicate with each others. We take into account these notions separately, as raising two different problems. Hence we propose two independent and complementary solutions. The GoOD-TA protocol aims at reducing heterogeneity through ontology-driven topology adaptation. DiQuESh is a top-k algorithm for distributed information retrieval that is intended to ensure interoperability. This distinction enables highlighting their respective benefits on the IR performances and leads to a modular architecture. For our experiments we obtained a set of actively used real-world ontologies through the NCBO BioPortal. We implemented GoOD-TA and DiQuESH in Java and used the PeerSim simulator. We first show that GoOD-TA nicely reduces the semantic heterogeneity related to the system topology, handles the evolution of peers' descriptors, and is suitable for dynamic systems. Then, GoOD-TA and DiQuESh are run simultaneously, with a significant increase of precision and recall. This enables to identify the indirect contribution of heterogeneity reduction obtained with GoOD-TA to improving interoperability.
Complete list of metadatas

Cited literature [26 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00731848
Contributor : Thomas Cerqueus <>
Submitted on : Thursday, September 13, 2012 - 4:02:16 PM
Last modification on : Wednesday, November 20, 2019 - 2:46:50 AM
Long-term archiving on : Friday, December 14, 2012 - 3:58:03 AM

File

Main.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-00731848, version 1

Citation

Thomas Cerqueus, Sylvie Cazalens, Philippe Lamarre. An Approach to Manage Semantic Heterogeneity in Unstructured P2P Information Retrieval Systems. IEEE International Conference on Peer-to-Peer Computing, Sep 2012, Tarragona, Spain. pp.178. ⟨hal-00731848⟩

Share

Metrics

Record views

973

Files downloads

375