Modeling Data Lake Metadata with a Data Vault - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Modeling Data Lake Metadata with a Data Vault

Iuri Nogueira
  • Fonction : Auteur
  • PersonId : 1025027
Maram Romdhane
  • Fonction : Auteur
  • PersonId : 1025028

Résumé

With the rise of big data, business intelligence had to find solutions for managing even greater data volumes and variety than in data warehouses, which proved ill-adapted. Data lakes answer these needs from a storage point of view, but require managing adequate metadata to guarantee an efficient access to data. Starting from a multidimensional metadata model designed for an industrial heritage data lake presenting a lack of schema evolutivity, we propose in this paper to use ensemble modeling, and more precisely a data vault, to address this issue. To illustrate the feasibility of this approach, we instantiate our metadata conceptual model into relational and document-oriented logical and physical models, respectively. We also compare the physical models in terms of metadata storage and query response time.
Fichier principal
Vignette du fichier
vaultlake-ideas2018.pdf (396.55 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01788036 , version 1 (10-07-2018)

Licence

Paternité

Identifiants

Citer

Iuri Nogueira, Maram Romdhane, Jérôme Darmont. Modeling Data Lake Metadata with a Data Vault. 22nd International Database Engineering & Applications Symposium (IDEAS 2018), Jun 2018, Villa San Giovanni, Italy. pp.253-261. ⟨hal-01788036⟩
201 Consultations
5101 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More