, Chapter 18, pp.93-97
Monte Carlo simulation and clustering for customer segmentation in business organization, 3rd International Conference on Science and Technology-Computer, pp.104-109, 2017. ,
Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems, Ann. Statist, vol.2, issue.6, pp.1152-1174, 1974. ,
MapReduce: Simplified Data Processing on Large Clusters, Commun. ACM, vol.51, pp.107-113, 2008. ,
Determining the k in k-means with MapReduce, EDBT/ICDT Workshops, pp.19-28, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01525708
Maximum likelihood from incomplete data via the EM algorithm, Journal of the royal statistical society. Series B (methodological, pp.1-38, 1977. ,
Fast clustering using MapReduce, Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.681-689, 2011. ,
Estimating normal means with a Dirichlet process prior, J. Amer. Statist. Assoc, vol.89, pp.268-277, 1994. ,
Bayesian density estimation and inference using mixtures, Journal of the american statistical association, vol.90, pp.577-588, 1995. ,
Pitfalls in the use of parallel inference for the Dirichlet process, Proceedings of the 31st International Conference on Machine Learning, pp.208-216, 2014. ,
, Bayesian Data Analysis, 2004.
Parallel gibbs sampling: From colored fields to thin junction trees, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp.324-332, 2011. ,
A Survey of Outlier Detection Methodologies, Artificial Intelligence Review, vol.22, pp.85-126, 2004. ,
An introduction to statistical learning, vol.112, 2013. ,
Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling, Statist. Sci, pp.50-67, 2005. ,
Parallel markov chain monte carlo for dirichlet process mixtures, Workshop on Big Learning, NIPS, 2012. ,
Machine learning for Big Data analytics in plants, Trends in Plant Science, vol.19, pp.798-808, 0112. ,
Inconsistency of Pitman-Yor process mixtures for the number of components, The Journal of Machine Learning Research, vol.15, pp.3333-3370, 2014. ,
Markov chain sampling methods for Dirichlet process mixture models, Journal of computational and graphical statistics, vol.9, pp.249-265, 2000. ,
Distributed algorithms for topic models, Journal of Machine Learning Research, vol.10, pp.1801-1828, 2009. ,
A fast version of the k-means classification algorithm for astronomical applications, Astronomy & Astrophysics, vol.565, 2014. ,
A constructive definition of Dirichlet priors, Statistica sinica, pp.639-650, 1994. ,
The Hadoop distributed filesystem: Balancing portability and performance, 2010. ,
Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance, J. Mach. Learn. Res, vol.11, pp.2837-2854, 2010. ,
Scalable Estimation of Dirichlet Process Mixture Models on Distributed Data, Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.4632-4639, 2017. ,
Parallel Markov chain Monte Carlo for nonparametric mixture models, International Conference on Machine Learning, pp.98-106, 2013. ,
, Spark: Cluster Computing with Working Sets. In HotCloud, 2010.