A Bayesian reassessment of nearest-neighbour classification - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of the American Statistical Association Année : 2009

A Bayesian reassessment of nearest-neighbour classification

Résumé

The k-nearest-neighbor (knn) procedure is a well-known deterministic method used in supervised classification. This article proposes a reassessment of this approach as a statistical technique derived from a proper probabilistic model; in particular, we modify the assessment found in Holmes and Adams, and evaluated by Manocha and Girolami, where the underlying probabilistic model is not completely well defined. Once provided with a clear probabilistic basis for the knn procedure, we derive computational tools for Bayesian inference on the parameters of the corresponding model. In particular, we assess the difficulties inherent to both pseudo-likelihood and path sampling approximations of an intractable normalizing constant. We implement a correct MCMC sampler based on perfect sampling. When perfect sampling is not available, we use instead a Gibbs sampling approximation. Illustrations of the performance of the corresponding Bayesian classifier are provided for benchmark datasets, demonstrating in particular the limitations of the pseudo-likelihood approximation in this set up.
Fichier principal
Vignette du fichier
RR-6173.pdf (1.99 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00143783 , version 1 (26-04-2007)
inria-00143783 , version 2 (08-05-2007)
inria-00143783 , version 3 (03-03-2008)
inria-00143783 , version 4 (03-03-2008)

Identifiants

Citer

Lionel Cucala, Jean-Michel Marin, Christian Robert, Mike Titterington. A Bayesian reassessment of nearest-neighbour classification. Journal of the American Statistical Association, 2009, 104 (485), pp.263-273. ⟨10.1198/jasa.2009.0125⟩. ⟨inria-00143783v4⟩
346 Consultations
843 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More