On Complex Value Relations in Hive - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

On Complex Value Relations in Hive

Résumé

In this paper, we raise the question how data architects model their data for processing in Apache Hive. This well-known SQL-on-Hadoop engine supports complex value relations, where attribute types need not be atomic. In fact, this feature seems to be one of the prominent selling points, e.g., in Hive reference books. In an empirical study, we analyze Hive schemas in open source repositories. We examine to which extent practitioners make use of complex value relations and accordingly , whether they write queries over complex types. Understanding which features are actively used will help make the right decisions in setting up benchmarks for SQL-on-Hadoop engines, as well as in choosing which query operators to optimize for.
Fichier principal
Vignette du fichier
19mobid.pdf (344.61 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02290732 , version 1 (17-09-2019)

Identifiants

  • HAL Id : hal-02290732 , version 1

Citer

Matthieu Pilven, Stefanie Scherzinger, Laurent d'Orazio. On Complex Value Relations in Hive. International Workshop on Modeling and Management of Big Data, Nov 2019, Salvador, Bahia, Brazil. ⟨hal-02290732⟩
122 Consultations
658 Téléchargements

Partager

Gmail Facebook X LinkedIn More