Skip to Main content Skip to Navigation
Conference papers

Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases

Abstract : When real datasets are difficult to obtain for tasks such as system analysis, or algorithm evaluation, synthetic datasets are commonly used. Techniques for generating such datasets often generate random data for single-table datasets. Such datasets are often inapplicable when it comes to evaluating data mining or machine learning algorithms dealing with relational data. To address this, our earlier works have dealt with the task of generating relational datasets from Probabilistic Relational Models (PRMs), a framework for dealing with prob-abilistic uncertainties in relational domains. In this article, we extend this work by proposing to use more efficient data sampling algorithms, and by using a spatial extension of PRMs to generate synthetic spatial datasets. We also present our experimental analysis on three different data sampling algorithms applicable in our method, and the quality of the datasets generated by them.
Complete list of metadatas

Cited literature [27 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01761901
Contributor : Rajani Chulyadyo <>
Submitted on : Thursday, April 9, 2020 - 5:40:43 PM
Last modification on : Wednesday, June 24, 2020 - 4:19:29 PM

File

08406645.pdf
Files produced by the author(s)

Identifiers

Citation

Rajani Chulyadyo, Philippe Leray. Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases. Research Challenges in Information Science (RCIS) 2018, 12th International Conference on, May 2018, Nantes, France. pp.1-12, ⟨10.1109/RCIS.2018.8406645⟩. ⟨hal-01761901⟩

Share

Metrics

Record views

193

Files downloads

44