Missing Data in Hierarchical Classification of Variables, a simulation study - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Missing Data in Hierarchical Classification of Variables, a simulation study

Résumé

Here we develop from a first work the effect of missing data in hierarchical classification of variables according to the following factors: amount of missing data, imputation techniques, similarity coefficient, and aggregation criterion. We have used two methods of imputation, a regression method using an OLS method and an EM algorithm. For the similarity matrices we have used the basic affinity coefficient and the Pearson's correlation coefficient. As aggregation criteria we apply average linkage, single linkage and complete linkage methods. To compare the structure of the hierarchical classifications the Spearman's coefficient between the associated ultrametrics has been used. We present here simulation experiments in five multivariate normal cases.
Fichier principal
Vignette du fichier
RC481.pdf (129.67 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01124802 , version 1 (26-03-2020)

Identifiants

  • HAL Id : hal-01124802 , version 1

Citer

Ana Lorga da Silva, Helena Bacelar-Nicolau, Gilbert Saporta. Missing Data in Hierarchical Classification of Variables, a simulation study. IFCS 2002, International Federation of Classification Societies, Jul 2002, Krakow, Poland. pp.121-128. ⟨hal-01124802⟩

Collections

CNAM CEDRIC-CNAM
91 Consultations
63 Téléchargements

Partager

Gmail Facebook X LinkedIn More