Characterizing Forest of Fuzzy Decision Trees Errors - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Characterizing Forest of Fuzzy Decision Trees Errors

Résumé

Forest of fuzzy decision trees are known to be a powerful machine learning tool in terms of good classification rate. The idea of this paper is to focus on the misclassified examples of this method to improve it. To this aim, the preliminary step consists in understanding what are the sources of misclassifications. Concretely, we propose to characterize misclassified examples by finding different types of errors or different groups of errors. We modify J. Forest’s algorithm of supervised segmentation to find these groups. Our general process is the following: first, we learn a forest of fuzzy decision trees on a learning dataset and we validate it on a validation dataset. We class the examples on two categories: those who are well classified by the majority of the trees and those who are badly classified by the majority of the trees. Then, this new dataset is characterized by supervised segmentation algorithm. For instance, we can obtain that the badly classified examples can be distinguished in two groups: those who can be interpreted as outliers and those who are close to the frontiers of decision. We apply our general process on several datasets of UCI machine learning repository to show its benefits.
Fichier non déposé

Dates et versions

hal-01286230 , version 1 (10-03-2016)

Identifiants

  • HAL Id : hal-01286230 , version 1

Citer

Christophe Marsala, Maria Rifqi. Characterizing Forest of Fuzzy Decision Trees Errors. The 4th International Conference of the ERCIM WG on COMPUTING & STATISTICS (ERCIM'11), Dec 2011, London, United Kingdom. ⟨hal-01286230⟩
70 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More