Approximate NORTA simulations for virtual sample generation - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Expert Systems with Applications Année : 2017

Approximate NORTA simulations for virtual sample generation

Résumé

We introduce an approximate variant of the NORTA method which aims at generating structured data from a given prior sample. The technique accommodates for any combinations of marginals (especially continuous/discrete mixtures) and a wide range of correlation structures. We focus on the interesting case where the sample includes categorical data, both ordered and unordered. We provide an application in the financial industry through a test of our iterative Newton-like algorithm on a dataset comprising the results of a questionnaire. We show that the sampled data, similarly to the NORTA technique, matches both the marginal and correlation structures of the original dataset closely. Consequently, analyses such as decision tree modeling or Support Vector Machine classification and regression, can be carried out on the new, much larger, sample without altering the core properties of the original sample.
Fichier non déposé

Dates et versions

hal-02000704 , version 1 (30-01-2019)

Identifiants

Citer

Guillaume Coqueret. Approximate NORTA simulations for virtual sample generation. Expert Systems with Applications, 2017, 73, pp.69-81. ⟨10.1016/j.eswa.2016.12.027⟩. ⟨hal-02000704⟩
63 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More