A Bayesian reassessment of nearest-neighbour classification - Archive ouverte HAL Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2007

A Bayesian reassessment of nearest-neighbour classification

Résumé

The k-nearest-neighbour procedure is a well-known method used in supervised classification. While it has been superseded by more recent methods developed in machine learning, it remains an essential tool for classifiers. This paper proposes a reassessment of this approach as a statistical technique derived from a proper probabilistic model; in particular, we modify the assessment made in a previous analysis of this method undertaken by Holmes and Adams (2002, 2003) where the underlying probabilistic model is not completely well-defined. Once clear probabilistic bases of the k-nearest-neighbour procedure are established, we proceed to the derivation of practical computational tools to conduct Bayesian inference on the parameters of the corresponding model. In particular, we assess the difficulties inherent to pseudo-likelihood and to path sampling approximations of a missing normalising constant, and propose a perfect sampling strategy to implement a correct MCMC sampler associated with our model. Illustrations of the performance of the corresponding Bayesian classifier are provided for two benchmark datasets, demonstrating in particular the limitations of the pseudo-likelihood approximation in this set-up.
Fichier principal
Vignette du fichier
marin-robert-titterington.pdf (543.16 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

inria-00143783 , version 1 (26-04-2007)
inria-00143783 , version 2 (08-05-2007)
inria-00143783 , version 3 (03-03-2008)
inria-00143783 , version 4 (03-03-2008)

Identifiants

  • HAL Id : inria-00143783 , version 1

Citer

Jean-Michel Marin, Christian Robert, Mike Titterington. A Bayesian reassessment of nearest-neighbour classification. [Research Report] RR-6173, 2007, pp.28. ⟨inria-00143783v1⟩

Collections

INRIA-RRRT
354 Consultations
848 Téléchargements

Partager

Gmail Facebook X LinkedIn More