Editing training data for multi-label classification with the k-nearest neighbor rule

Sawsan Kanj; Fahed Abdallah; Thierry Denoeux; Kifah Tout

doi:10.1007/s10044-015-0452-8

Article Dans Une Revue Pattern Analysis and Applications Année : 2016

Editing training data for multi-label classification with the k-nearest neighbor rule

(1) , (1) , (2, 1) , (3)

1
2
3

Sawsan Kanj

Fonction : Auteur

Heuristique et Diagnostic des Systèmes Complexes [Compiègne]

Fahed Abdallah

Fonction : Auteur
PersonId : 866468

Heuristique et Diagnostic des Systèmes Complexes [Compiègne]

Thierry Denoeux

Fonction : Auteur
PersonId : 2983
IdHAL : tdenoeux
ORCID : 0000-0002-0660-5436
IdRef : 058663495

Laboratoire d'Excellence "Maîtrise des Systèmes de Systèmes Technologiques"

Heuristique et Diagnostic des Systèmes Complexes [Compiègne]

Kifah Tout

Fonction : Auteur

Azm Center for Biotechnology Research

Résumé

Multi-label classification allows instances to belong to several classes at once. It has received significant attention in machine learning and has found many real world applications in recent years, such as text categorization, automatic video annotation and functional genomics, resulting in the development of many multi-label classification methods. Based on labelled examples in the training dataset, a multi-labelled method extracts inherent information in order to output a function that predicts the labels of unlabelled data. Due to several problems, like errors in the input vectors or in their labels, this information may be wrong and might lead the multi-label algorithm to fail. In this paper, we propose a simple algorithm for overcoming these problems by editing the existing training dataset, and adapting this edited set with different multi-label classification methods. Evaluation on benchmark datasets demonstrates the usefulness and effectiveness of our approach.

Mots clés

prototype selection edition Classification multi-label k-nearest neighbors rule

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

paperPAAA.pdf (526.09 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Thierry Denoeux : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01294269

Soumis le : mardi 29 mars 2016-05:00:29

Dernière modification le : mercredi 7 février 2024-15:42:07

Archivage à long terme le : lundi 14 novembre 2016-07:34:46

Dates et versions

hal-01294269 , version 1 (29-03-2016)

Identifiants

HAL Id : hal-01294269 , version 1
DOI : 10.1007/s10044-015-0452-8

Citer

Sawsan Kanj, Fahed Abdallah, Thierry Denoeux, Kifah Tout. Editing training data for multi-label classification with the k-nearest neighbor rule. Pattern Analysis and Applications, 2016, 19 (1), pp.145-161. ⟨10.1007/s10044-015-0452-8⟩. ⟨hal-01294269⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-COMPIEGNE HEUDIASYC DI MS2T

159 Consultations

3604 Téléchargements

Editing training data for multi-label classification with the k-nearest neighbor rule

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager