Data-driven probability concentration and sampling on manifold - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Computational Physics Année : 2016

Data-driven probability concentration and sampling on manifold

Résumé

A new methodology is proposed for generating realizations of a random vector with values in a finite-dimensional Euclidean space that are statistically consistent with a dataset of observations of this vector. The probability distribution of this random vector, while a-priori not known, is presumed to be concentrated on an unknown subset of the Euclidean space. A random matrix is introduced whose columns are independent copies of the random vector and for which the number of columns is the number of data points in the dataset. The approach is based on the use of (i) the multidimensional kernel-density estimation method for estimating the probability distribution of the random matrix, (ii) a MCMC method for generating realizations for the random matrix, (iii) the diffusion-maps approach for discovering and characterizing the geometry and the structure of the dataset, and (iv) a reduced-order representation of the random matrix, which is constructed using the diffusion-maps vectors associated with the first eigenvalues of the transition matrix relative to the given dataset. The convergence aspects of the proposed methodology are analyzed and a numerical validation is explored through three applications of increasing complexity. The proposed method is found to be robust to noise levels and data complexity as well as to the intrinsic dimension of data and the size of experimental datasets. Both the methodology and the underlying mathematical framework presented in this paper contribute new capabilities and perspectives at the interface of uncertainty quantification, statistical data analysis, stochastic modeling and associated statistical inverse problems.
Fichier principal
Vignette du fichier
publi-2016-JCP-soize-ghanem-preprint.pdf (1.61 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01325279 , version 1 (02-06-2016)

Identifiants

Citer

Christian Soize, Roger Ghanem. Data-driven probability concentration and sampling on manifold. Journal of Computational Physics, 2016, 321, pp.242-258. ⟨10.1016/j.jcp.2016.05.044⟩. ⟨hal-01325279⟩
181 Consultations
628 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More