Influence of Hyperparameters on Random Forest Accuracy - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2009

Influence of Hyperparameters on Random Forest Accuracy

Simon Bernard
Laurent Heutte

Résumé

In this paper we present our work on the Random Forest (RF) family of classification methods. Our goal is to go one step further in the understanding of RF mechanisms by studying the parametrization of the reference algorithm Forest-RI. In this algorithm, a randomization principle is used during the tree induction process, that randomly selects K features at each node, among which the best split is chosen. The strength of randomization in the tree induction is thus led by the hyperparameter K which plays an important role for building accurate RF classifiers. We have decided to focus our experimental study on this hyperparameter and on its influence on classification accuracy. For that purpose, we have evaluated the Forest-RI algorithm on several machine learning problems and with different settings of K in order to understand the way it acts on RF performance. We show that default values of K traditionally used in the literature are globally near-optimal, except for some cases for which they are all significatively sub-optimal. Thus additional experiments have been led on those datasets, that highlight the crucial role played by feature relevancy in finding the optimal setting of K.
Fichier principal
Vignette du fichier
mcs09.pdf (110.83 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00436358 , version 1 (26-11-2009)

Identifiants

Citer

Simon Bernard, Laurent Heutte, Sébastien Adam. Influence of Hyperparameters on Random Forest Accuracy. International Workshop on Multiple Classifier Systems (MCS), Jun 2009, Reykjavik, Iceland. pp.171-180, ⟨10.1007/978-3-642-02326-2_18⟩. ⟨hal-00436358⟩
147 Consultations
6161 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More