Extension of model-based classification for binary data when training and test populations differ - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2008

Extension of model-based classification for binary data when training and test populations differ

Julien Jacques
Christophe Biernacki
  • Fonction : Auteur
  • PersonId : 853117

Résumé

Standard discriminant analysis supposes that both the training sample and the test sample are issued from the same population. When these samples arise from populations differing from their descriptive parameters, a generalization of discriminant analysis consists in adapting the classification rule related to the training population to another rule related to the test population, by estimating a link between both populations. This paper extends an existing work available in a multinormal context to the case of binary data. To raise the major challenge which consists in defining a link between the two binary populations, it is supposed that binary data result from the discretization of latent Gaussian data. Estimation method and robustness study are presented, and two applications in a biological context illustrate this work.
Fichier principal
Vignette du fichier
ArticleADG-Preprint.pdf (240.66 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00316080 , version 1 (02-09-2008)
hal-00316080 , version 2 (02-03-2009)
hal-00316080 , version 3 (17-03-2009)

Identifiants

  • HAL Id : hal-00316080 , version 1

Citer

Julien Jacques, Christophe Biernacki. Extension of model-based classification for binary data when training and test populations differ. 2008. ⟨hal-00316080v1⟩
125 Consultations
165 Téléchargements

Partager

Gmail Facebook X LinkedIn More