On the selection of decision trees in Random Forests

Simon Bernard; Laurent Heutte; Sébastien Adam

doi:10.1109/IJCNN.2009.5178693

Communication Dans Un Congrès Année : 2009

On the selection of decision trees in Random Forests

(1) , (1) , (1)

Simon Bernard

Fonction : Auteur
PersonId : 21744
IdHAL : bernasim
ORCID : 0000-0003-0200-4294
IdRef : 143005707

Equipe Apprentissage

Laurent Heutte

Fonction : Auteur
PersonId : 171701
IdHAL : laurent-heutte
ORCID : 0000-0003-4740-9770
IdRef : 143005863

Equipe Apprentissage

Sébastien Adam

Fonction : Auteur

Equipe Apprentissage

Résumé

In this paper we present a study on the random forest (RF) family of ensemble methods. In a ldquoclassicalrdquo RF induction process a fixed number of randomized decision trees are inducted to form an ensemble. This kind of algorithm presents two main drawbacks : (i) the number of trees has to be fixed a priori (ii) the interpretability and analysis capacities offered by decision tree classifiers are lost due to the randomization principle. This kind of process in which trees are independently added to the ensemble, offers no guarantee that all those trees will cooperate effectively in the same committee. This statement rises two questions: are there any decision trees in a RF that provide the deterioration of ensemble performance? If so, is it possible to form a more accurate committee via removal of decision trees with poor performance? The answer to these questions is tackled as a classifier selection problem. We thus show that better subsets of decision trees can be obtained even using a sub-optimal classifier selection method. This proves that ldquoclassicalrdquo RF induction process, for which randomized trees are arbitrary added to the ensemble, is not the best approach to produce accurate RF classifiers. We also show the interest in designing RF by adding trees in a more dependent way than it is traditionally done in ldquoclassicalrdquo RF induction algorithms.

Domaines

Apprentissage [cs.LG]

Fichier principal

ijcnn09.pdf (135.64 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sébastien Adam : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00436355

Soumis le : jeudi 26 novembre 2009-14:22:02

Dernière modification le : vendredi 22 décembre 2023-15:16:05

Archivage à long terme le : mardi 16 octobre 2012-14:55:57

Dates et versions

hal-00436355 , version 1 (26-11-2009)

Identifiants

HAL Id : hal-00436355 , version 1
DOI : 10.1109/IJCNN.2009.5178693

Citer

Simon Bernard, Laurent Heutte, Sébastien Adam. On the selection of decision trees in Random Forests. IEEE International Joint Conference on Neural Networks (IJCNN), Jun 2008, Atlanta, United States. pp.302-307, ⟨10.1109/IJCNN.2009.5178693⟩. ⟨hal-00436355⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSA-ROUEN LITIS COMUE-NORMANDIE UNIROUEN UNILEHAVRE INSA-GROUPE

2046 Consultations

2303 Téléchargements

On the selection of decision trees in Random Forests

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager