Enacting Data Science Pipelines for Exploring Graphs: From Libraries to Studios - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Enacting Data Science Pipelines for Exploring Graphs: From Libraries to Studios

Résumé

his paper proposes a study of existing environments used to enact data science pipelines applied to graphs. Data science pipelines are a new form of queries combining classic graph operations with ar- artificial intelligence graph analytics operations. A pipeline defines a data flow consisting of tasks for querying, exploring and analysing graphs. The results of pipelines tasks can be subgraphs or they can be aggregation and statistic numbers describing graphs. Different environments and systems can be used for enacting pipelines. They range from graph NoSQL stores, programming languages extended with libraries providing graph processing and analytics functions, to full machine learning and artificial intelligence studios. The paper describes these environments and compares the design principles that they promote for enacting data science pipelines intended to query, process and explore graphs.
Fichier non déposé

Dates et versions

hal-03002550 , version 1 (12-11-2020)

Identifiants

Citer

Genoveva Vargas-Solar, José-Luis Zechinelli-Martini, Javier A Espinosa-Oviedo. Enacting Data Science Pipelines for Exploring Graphs: From Libraries to Studios. 24th European Conference on Advances in Databases and Information Systems, Aug 2020, Lyon, France. pp.271-280, ⟨10.1007/978-3-030-55814-7_23⟩. ⟨hal-03002550⟩
46 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More