Skip to Main content Skip to Navigation
Conference papers

Multiple Imputation and Multidimensional Scaling Applied to a k-means Method

Abstract : The effects of missing data (MD)and imputation methods(IM) in clus-ter analysis have been studied in, Silva (2005) and Silva et al. (2006), for somehierarchical classification methods and partition methods, in the case of variablesclustering. As in Silva et al (2006) the partition method is the following: we start byfinding a dissimilarity matrix between variables; a multidimensional scaling tech-nique (MDS)-PROXSCAL-provides components which are used as inputs in a k-means method. In this communication, when there are MD, we evaluate the effectof IM combined with the PROXSCAL MDS procedure (Commandeur and Heiser(1993)): for a data matrix with missing data;mimputations are realized;mdissim-ilarity matrices are then obtained from each imputed matrix; PROXSCAL withoutconstraints over thesemdissimilarity matrices provides components; k-means isperformed on these components and finally the partitions is compared with theoriginal one ie with the complete data by means of the Rand index as in Younessand Saporta (2004) and an affinity coefficient as in Sousa (2006). The simulationstudy consists in generating different patterns of partitions from twenty-five vari-ables following multinormal distributions. As in Silva (2005) data are deleted inincreasing proportions to create MD patterns and several IM are compare
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01125564
Contributor : Laboratoire Cedric <>
Submitted on : Friday, March 6, 2015 - 11:14:13 AM
Last modification on : Monday, March 30, 2020 - 2:27:11 PM

Identifiers

  • HAL Id : hal-01125564, version 1

Collections

Citation

Helena Bacelar-Nicolau, Gilbert Saporta, Ana Lorga da Silva. Multiple Imputation and Multidimensional Scaling Applied to a k-means Method. COMPSTAT'08, Porto, Portugal, Aug 2008, Porto, Portugal. ⟨hal-01125564⟩

Share

Metrics

Record views

29