An Extension of PLSA for Document Clustering

Abstract : In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to co-cluster documents and terms simultaneously. We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models.
Document type :
Conference papers
Complete list of metadatas
Contributor : Lip6 Publications <>
Submitted on : Tuesday, April 12, 2016 - 3:19:09 PM
Last modification on : Thursday, March 21, 2019 - 1:09:12 PM



Young-Min Kim, Jean-François Pessiot, Massih-Reza Amini, Patrick Gallinari. An Extension of PLSA for Document Clustering. 17th ACM Conference on Information and Knowledge Management (CIKM 2008), Oct 2008, Napa Valley, CA, United States. pp.1345-1346, ⟨10.1145/1458082.1458271⟩. ⟨hal-01301612⟩



Record views