Analysis of a Random Forests Model - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2010

Analysis of a Random Forests Model

Résumé

Random forests are a scheme proposed by Leo Breiman in the 00's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we offer an in-depth analysis of a random forests model suggested by Breiman in 2004, which is very close to the original algorithm. We show in particular that the procedure is consistent and adapts to sparsity, in the sense that its rate of convergence depends only on the number of strong features and not on how many noise variables are present.
Fichier principal
Vignette du fichier
article2.pdf (548.56 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00476545 , version 1 (27-04-2010)
hal-00476545 , version 2 (25-10-2011)
hal-00476545 , version 3 (26-03-2012)

Identifiants

Citer

Gérard Biau. Analysis of a Random Forests Model. 2010. ⟨hal-00476545v1⟩
452 Consultations
439 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More