SQL Query Completion for Data Exploration

Marie Le Guilly 1 Jean-Marc Petit 1 Vasile-Marian Scuturici 1
1 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Within the big data tsunami, relational databases and SQL are still there and remain mandatory in most of cases for accessing data. On the one hand, SQL is easy-to-use by non specialists and allows to identify pertinent initial data at the very beginning of the data exploration process. On the other hand, it is not always so easy to formulate SQL queries: nowadays, it is more and more frequent to have several databases available for one application domain, some of them with hundreds of tables and/or attributes. Identifying the pertinent conditions to select the desired data, or even identifying relevant attributes is far from trivial. To make it easier to write SQL queries, we propose the notion of SQL query completion: given a query, it suggests additional conditions to be added to its WHERE clause. This completion is semantic, as it relies on the data from the database, unlike current completion tools that are mostly syntactic. Since the process can be repeated over and over again – until the data analyst reaches her data of interest –, SQL query completion facilitates the exploration of databases. SQL query completion has been implemented in a SQL editor on top of a database management system. For the evaluation, two questions need to be studied: first, does the completion speed up the writing of SQL queries? Second , is the completion easily adopted by users? A thorough experiment has been conducted on a group of 70 computer science students divided in two groups (one with the completion and the other one without) to answer those questions. The results are positive and very promising.
Type de document :
Rapport
[Research Report] INSA Lyon. 2018
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01703346
Contributeur : Marie Le Guilly <>
Soumis le : mercredi 7 février 2018 - 17:36:06
Dernière modification le : jeudi 22 février 2018 - 13:06:09

Fichiers

main2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01703346, version 1
  • ARXIV : 1802.02872

Citation

Marie Le Guilly, Jean-Marc Petit, Vasile-Marian Scuturici. SQL Query Completion for Data Exploration. [Research Report] INSA Lyon. 2018. 〈hal-01703346〉

Partager

Métriques

Consultations de la notice

59

Téléchargements de fichiers

5