Schema-independent Querying for Heterogeneous Collections in NoSQL Document Stores - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Information Systems Année : 2019

Schema-independent Querying for Heterogeneous Collections in NoSQL Document Stores

Résumé

NoSQL document stores are well-tailored to efficiently load and manage massive collections of heterogeneous documents without any prior structural validation. However, this flexibility becomes a serious challenge when querying heterogeneous documents, and hence the user has to build complex queries or reformulate existing queries whenever new schemas are introduced in a collection. In this paper we propose a novel approach, based on formal foundations, for building schema-independent queries which are designed to query multi-structured documents. We present a query enrichment mechanism that consults a pre-constructed dictionary. This dictionary binds each possible path in the documents to all its corresponding absolute paths in all the documents. We automate the process of query reformulation via a set of rules that reformulate most document store operators, such as select, project, unnest, aggregate and lookup. We then produce queries across multi-structured documents which are compatible with the native query engine of the underlying document store. To evaluate our approach, we conducted experiments on synthetic datasets. Our results show that the induced overhead can be acceptable when compared to the efforts needed to restructure the data or the time required to execute several queries corresponding to the different schemas inside the collection.
Fichier principal
Vignette du fichier
S0306437918302990.pdf (773.64 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02976616 , version 1 (25-10-2021)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

Citer

Hamdi Ben Hamadou, Faiza Ghozzi, André Péninou, Olivier Teste. Schema-independent Querying for Heterogeneous Collections in NoSQL Document Stores. Information Systems, 2019, 85, pp.48-67. ⟨10.1016/j.is.2019.04.005⟩. ⟨hal-02976616⟩
85 Consultations
148 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More