4464 articles – 13152 Notices  [english version]
HAL : hal-00483535, version 1

Fiche détaillée  Récupérer au format
International Conference on Management of Emergent Digital EcoSystems, Lyon : France (2009)
Efficient super-peer-based queries routing
Anis Ismail 1, Mohamed Quafafou 1, Gilles Nachouki 2, Mohammad Hajjar
(27/10/2009)

Peer-to-peer (P2P) Data-sharing systems now generate a significant portion of internet traffic. P2P systems have emerged as a popular way to share huge volumes of data. Requirements for widely distributed information systems supporting virtual organizations have given rise to a new category of P2P systems called schema-based. In such systems each peer is a database management system in itself, exposing its own schema. In such settings, the main objective is the efficient search across peer databases by processing each incoming query without overly consuming bandwidth. The usability of these systems depends on effective techniques to find and retrieve data; however, efficient and effective routing of content-based queries is an emerging problem in P2P networks. In this paper, we propose an architecture, based on (super-)peers, and we focus on query routing. Our approach considers that (super-)Peers having similar interests are grouped together for an efficient query routing method. In such groups, called knowledge-super-peers (KSP), super-peers submit queries that are often processed by members of this group. A KSP is a specific super-peer which contains knowledge about: 1. its super-peers and 2. the others super-peers. Knowledge is extracted by using data mining techniques (e.g. decision tree algorithms) starting from queries of peers that transit on the network. The advantage of this distributed knowledge is that, it avoids to making semantic mapping, between heterogeneous data sources owned by (super-)peers, each time the system decides to route query to other (super-)peers. The set of KSP improves the robustness in queries routing mechanism and scalability in P2P Network. Compared with a baseline approach, our proposal shows a better performance with respect to important criteria such as response time, precision and recall.
1 :  Laboratoire des Sciences de l'Information et des Systèmes (LSIS)
CNRS : UMR6168 – Arts et Métiers ParisTech – Université Paul Cézanne - Aix-Marseille III – Université de la Méditerranée - Aix-Marseille II – Université de Provence - Aix-Marseille I – Université Sud Toulon Var
2 :  Laboratoire d'Informatique de Nantes Atlantique (LINA)
CNRS : UMR6241 – Université de Nantes – École Nationale Supérieure des Mines - Nantes
Informatique/Base de données

Informatique/Recherche d'information