Skip to Main content Skip to Navigation
Conference papers

Towards schema-independent querying on document data stores

Abstract : Documents are a pervasive semi-structured data model in today's web and internet of things applications where the data structure is rapidly evolving over time. NoSQL documents stores are well tailored to efficiently load and manage large amounts of heterogeneous documents without any prior structure validations or constraints. However, this flexibility becomes a serious challenge while querying data from a heterogeneous collection of documents. Hence, it is mandatory to modify existing queries or add new ones whenever new structures are introduced in the collection. In this paper, we propose a novel approach to enable transparent querying over a heterogeneous collection of documents. We offer an automatic query enrichment mechanism that benefits from a pre-materialized dictionary gathering different possible underlying document structures. The query enrichment is automated via query operators rewriting algorithms. Also, we refer to a set of experiments to evaluate the performances of our approach over a synthetic datas
Document type :
Conference papers
Complete list of metadata

Cited literature [24 references]  Display  Hide  Download
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Tuesday, November 5, 2019 - 12:39:46 PM
Last modification on : Wednesday, June 9, 2021 - 10:00:32 AM
Long-term archiving on: : Friday, February 7, 2020 - 10:11:38 AM


Publisher files allowed on an open archive


  • HAL Id : hal-02348159, version 1
  • OATAO : 22348


Hamdi Ben Hamadou, Faïza Ghozzi Jedidi, André Péninou, Olivier Teste. Towards schema-independent querying on document data stores. 20th International Workshop On Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2018), Mar 2018, Vienna, Austria. pp.1-10. ⟨hal-02348159⟩



Record views


Files downloads