Skip to Main content Skip to Navigation
Journal articles

Editing training data for multi-label classification with the k-nearest neighbor rule

Abstract : Multi-label classification allows instances to belong to several classes at once. It has received significant attention in machine learning and has found many real world applications in recent years, such as text categorization, automatic video annotation and functional genomics, resulting in the development of many multi-label classification methods. Based on labelled examples in the training dataset, a multi-labelled method extracts inherent information in order to output a function that predicts the labels of unlabelled data. Due to several problems, like errors in the input vectors or in their labels, this information may be wrong and might lead the multi-label algorithm to fail. In this paper, we propose a simple algorithm for overcoming these problems by editing the existing training dataset, and adapting this edited set with different multi-label classification methods. Evaluation on benchmark datasets demonstrates the usefulness and effectiveness of our approach.
Document type :
Journal articles
Complete list of metadata

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01294269
Contributor : Thierry Denoeux <>
Submitted on : Tuesday, March 29, 2016 - 5:00:29 AM
Last modification on : Wednesday, July 15, 2020 - 11:52:04 AM
Long-term archiving on: : Monday, November 14, 2016 - 7:34:46 AM

File

paperPAAA.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Sawsan Kanj, Fahed Abdallah, Thierry Denoeux, Kifah Tout. Editing training data for multi-label classification with the k-nearest neighbor rule. Pattern Analysis and Applications, Springer Verlag, 2016, 19 (1), pp.145-161. ⟨10.1007/s10044-015-0452-8⟩. ⟨hal-01294269⟩

Share

Metrics

Record views

403

Files downloads

3217