Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases

Résumé

When real datasets are difficult to obtain for tasks such as system analysis, or algorithm evaluation, synthetic datasets are commonly used. Techniques for generating such datasets often generate random data for single-table datasets. Such datasets are often inapplicable when it comes to evaluating data mining or machine learning algorithms dealing with relational data. To address this, our earlier works have dealt with the task of generating relational datasets from Probabilistic Relational Models (PRMs), a framework for dealing with prob-abilistic uncertainties in relational domains. In this article, we extend this work by proposing to use more efficient data sampling algorithms, and by using a spatial extension of PRMs to generate synthetic spatial datasets. We also present our experimental analysis on three different data sampling algorithms applicable in our method, and the quality of the datasets generated by them.
Fichier principal
Vignette du fichier
08406645.pdf (1.11 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01761901 , version 1 (09-04-2020)

Identifiants

Citer

Rajani Chulyadyo, Philippe Leray. Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases. Research Challenges in Information Science (RCIS) 2018, 12th International Conference on, May 2018, Nantes, France. pp.1-12, ⟨10.1109/RCIS.2018.8406645⟩. ⟨hal-01761901⟩
242 Consultations
233 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More