Comparison of two topological approaches for dealing with noisy labeling - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Neurocomputing Année : 2015

Comparison of two topological approaches for dealing with noisy labeling

Résumé

This paper focuses on the detection of likely mislabeled instances in a learning dataset. In order to detect potentially mislabeled samples, two solutions are considered which are both based on the same framework of topological graphs. The first is a statistical approach based on Cut Edges Weighted statistics (CEW) in the neighborhood graph. The second solution is a Relaxation Technique (RT) that optimizes a local criterion in the neighborhood graph. The evaluations by ROC curves show good results since almost 90% of the mislabeled instances are retrieved for a cost of less than 20% of false positive. The removal of samples detected as mislabeled by our approaches generally leads to an improvement of the performances of classical machine learning algorithms.
Fichier principal
Vignette du fichier
Neurocomputing__SL_FM_FR_DAZ_V5.pdf (660.15 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01524431 , version 1 (18-05-2017)

Identifiants

Citer

Fabien A Rico, Fabrice Muhlenbach, Djamel Abdelkader Zighed, Stéphane A Lallich. Comparison of two topological approaches for dealing with noisy labeling. Neurocomputing, 2015, 160, pp.3 - 17. ⟨10.1016/j.neucom.2014.10.087⟩. ⟨hal-01524431⟩
60 Consultations
151 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More