Enacting Data Science Pipelines for Exploring Graphs: From Libraries to Studios

his paper proposes a study of existing environments used to enact data science pipelines applied to graphs. Data science pipelines are a new form of queries combining classic graph operations with ar- artificial intelligence graph analytics operations. A pipeline defines a data flow consisting of tasks for querying, exploring and analysing graphs. The results of pipelines tasks can be subgraphs or they can be aggregation and statistic numbers describing graphs. Different environments and systems can be used for enacting pipelines. They range from graph NoSQL stores, programming languages extended with libraries providing graph processing and analytics functions, to full machine learning and artificial intelligence studios. The paper describes these environments and compares the design principles that they promote for enacting data science pipelines intended to query, process and explore graphs.

Mots clés

data science queries data analytics graph data flow graph analytics machine learning studio

Domaines

Base de données [cs.DB]

Genoveva Vargas-Solar : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03002550

Soumis le : jeudi 12 novembre 2020-19:36:58

Dernière modification le : jeudi 4 avril 2024-18:25:08

Dates et versions

hal-03002550 , version 1 (12-11-2020)

Identifiants

HAL Id : hal-03002550 , version 1
DOI : 10.1007/978-3-030-55814-7_23

Citer

Genoveva Vargas-Solar, José-Luis Zechinelli-Martini, Javier A Espinosa-Oviedo. Enacting Data Science Pipelines for Exploring Graphs: From Libraries to Studios. 24th European Conference on Advances in Databases and Information Systems, Aug 2020, Lyon, France. pp.271-280, ⟨10.1007/978-3-030-55814-7_23⟩. ⟨hal-03002550⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS UNIV-COMPIEGNE LIG LIG_SIDCH

46 Consultations

0 Téléchargements