Average of Recentered Parallel MCMC for Big Data - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Average of Recentered Parallel MCMC for Big Data

Résumé

In big data context, traditional MCMC methods, such as Metropolis-Hastings algorithms and hybrid Monte Carlo, scale poorly because of their need to evaluate the likelihood over the whole data set at each iteration. In order to resurrect MCMC methods, numerous approaches belonging to two categories: divide-and-conquer and subsampling, are proposed. In this article, we study the parallel MCMC and propose a new combination method in the divide-and-conquer framework. Compared with some parallel MCMC methods, such as consensus Monte Carlo, Weierstrass Sampler, instead of sampling from subposteriors, our method runs MCMC on rescaled subposteriors, but share the same computation cost in the parallel stage. We also give the mathematical justification of our method and show its performance in several models. Besides, even though our new methods is proposed in parametric framework, it can been applied to non-parametric cases without difficulty.

Dates et versions

hal-01968782 , version 1 (03-01-2019)

Identifiants

Citer

Christian Robert, Changye Wu, Christian P. Robert. Average of Recentered Parallel MCMC for Big Data. 2019. ⟨hal-01968782⟩
301 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More