106 articles – 48 Notices  [english version]
HAL : hal-00186391, version 1

Fiche détaillée  Récupérer au format
Learning from Noisy Data using Hyperplane Sampling and Sample Averages
Guillaume Stempfel 1, Liva Ralaivola 1, François Denis 1
(03/05/2007)

We present a new classification algorithm capable of learning from data corrupted by a class dependent uniform classification noise. The produced classifier is a linear classifier, and the algorithm works seamlessly when using kernels. The algorithm relies on the sampling of random hyperplanes that help the building of new training examples of which the correct classes are known; a linear classifier (e.g. an svm) is learned from these examples and output by the algorithm. The produced examples are sample averages computed from the data at hand with respect to areas of the space defined by the random hyperplanes and the target hyperplane. A statistical analysis of the properties of these sample averages is provided as well as results from numerical simulations conducted on synthetic datasets. These simulations show that the linear and kernelized versions of our algorithm are effective for learning from both noise-free and noisy data.
1 :  Laboratoire d'informatique Fondamentale de Marseille (LIF)
CNRS : UMR6166 – Université de la Méditerranée - Aix-Marseille II – Université de Provence - Aix-Marseille I
Informatique/Apprentissage
Classification Noise – Linear Classifier – Kernels
Liste des fichiers attachés à ce document : 
PDF
quad.pdf(804.6 KB)