Exploiting NoSQL Graph Databases and In Memory Architectures for Extracting Graph Structural Data Summaries - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems Année : 2017

Exploiting NoSQL Graph Databases and In Memory Architectures for Extracting Graph Structural Data Summaries

Résumé

NoSQL graph databases have been introduced in recent years for dealing with large collections of graph-based data. Scientific data and social networks are among the best examples of the dramatic increase of the use of such structures. NoSQL repositories allow the management of large amounts of data in order to store and query them. Such data are not structured with a predefined schema as relational databases could be. They are rather composed by nodes and relationships of a certain type. For instance, a node can represent a Person and a relationship Friendship. Retrieving the structure of the graph database is thus of great help to users, for example when they must know how to query the data or to identify relevant data sources for recommender systems. For this reason, this paper introduces methods to retrieve structural summaries. Such structural summaries are extracted at different levels of information from the NoSQL graph database. The expression of the mining queries is facilitated by the use of two frame-works: Fuzzy4S allowing to define fuzzy operators and operations with Scala; Cypherf allowing the use of fuzzy operators and operations in the declarative queries over NoSQL graph databases. We show that extracting such summaries can be impossible with the NoSQL query engines because of the data volume and the complexity of the task of automatic knowledge extraction. A novel method based on in memory architectures is thus introduced. This paper provides the definitions of the summaries with the methods to automatically extract them from NoSQL graph databases only and with the help of in-memory architectures. The benefit of our proposition is demonstrated by experimental results.
Fichier principal
Vignette du fichier
lirmm-01381083v1.pdf (1.36 Mo) Télécharger le fichier
Loading...

Dates et versions

lirmm-01381083 , version 1 (01-11-2019)

Identifiants

Citer

Arnaud Castelltort, Anne Laurent. Exploiting NoSQL Graph Databases and In Memory Architectures for Extracting Graph Structural Data Summaries. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2017, 25 (1), pp.81-109. ⟨10.1142/S0218488517500040⟩. ⟨lirmm-01381083⟩
335 Consultations
286 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More