Boosting Nearest Neighbors for the Efficient Estimation of Posteriors

Abstract : It is an admitted fact that mainstream boosting algorithms like AdaBoost do not perform well to estimate class conditional probabilities. In this paper, we analyze, in the light of this problem, a recent algorithm, unn, which leverages nearest neighbors while minimizing a convex loss. Our contribution is threefold. First, we show that there exists a subclass of surrogate losses, elsewhere called balanced, whose minimization brings simple and statistically efficient estimators for Bayes posteriors. Second, we show explicit convergence rates towards these estimators for \unn, for any such surrogate loss, under a Weak Learning Assumption which parallels that of classical boosting results. Third and last, we provide experiments and comparisons on synthetic and real datasets, including the challenging SUN computer vision database. Results clearly display that boosting nearest neighbors may provide highly accurate estimators, sometimes more than a hundred times more accurate than those of other contenders like support vector machines.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00702771
Contributor : Wafa Bel Haj Ali <>
Submitted on : Thursday, May 31, 2012 - 11:55:41 AM
Last modification on : Thursday, February 7, 2019 - 3:08:39 PM
Document(s) archivé(s) le : Friday, November 30, 2012 - 12:51:17 PM

File

ecml12-dnbnb-sub.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00702771, version 1

Collections

Citation

Roberto d'Ambrosio, Richard Nock, Wafa Bel Haj Ali, Frank Nielsen, Michel Barlaud. Boosting Nearest Neighbors for the Efficient Estimation of Posteriors. ECML-PKDD 2012, Sep 2012, Bristol, United Kingdom. pp.16. ⟨hal-00702771⟩

Share

Metrics

Record views

654

Files downloads

128