Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

SPT – Summary Prefix Tree: An over DHT Indexing Data Structure for Efficient Superset Search

Abstract : This paper presents the summary prefix tree (SPT), a trie data structure that supports efficient superset searches over DHT. Each document is summarized by a Bloom filter which is then used by SPT to index this document. SPT implements an hybrid lookup procedure that is well-adapted to sparse indexing keys such as Bloom filters. We also propose a mapping function that permits to mitigate the impact of the skewness of SPT due to the sparsity of Bloom filters, especially when they contain only few words. To perform superset searches, SPT maintains on each node a local view of the global tree. The main contributions are the following. First, the approximation of the superset relationship among keyword-sets by the descendant relationship among Bloom filters. Second, the use of a summary prefix tree, a trie indexing data structure, for keyword-based search over DHT. Third, a hybrid lookup procedure which exploits the sparsity of Bloom filters to offer good performances. Finally, an algorithm that exploits SPT to efficiently find descriptions that are supersets of query keywords.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01757074
Contributor : Bassirou Ngom <>
Submitted on : Tuesday, April 3, 2018 - 1:15:05 PM
Last modification on : Friday, July 5, 2019 - 3:26:03 PM

File

spt-arima.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01757074, version 1

Citation

Bassirou Ngom, Mesaac Makpangou, Samba Ndiaye. SPT – Summary Prefix Tree: An over DHT Indexing Data Structure for Efficient Superset Search. 2018. ⟨hal-01757074⟩

Share

Metrics

Record views

200

Files downloads

249