Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Arbres CART et Forêts aléatoires, Importance et sélection de variables

Abstract : Two algorithms proposed by Leo Breiman : CART trees (Classification And Regression Trees for) introduced in the first half of the 80s and random forests emerged, meanwhile, in the early 2000s, are the subject of this article. The goal is to provide each of the topics, a presentation, a theoretical guarantee, an example and some variants and extensions. After a preamble, introduction recalls objectives of classification and regression problems before retracing some predecessors of the Random Forests. Then, a section is devoted to CART trees then random forests are presented. Then, a variable selection procedure based on permutation variable importance is proposed. Finally the adaptation of random forests to the Big Data context is sketched.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01387654
Contributor : Robin Genuer <>
Submitted on : Tuesday, October 25, 2016 - 10:21:27 PM
Last modification on : Tuesday, March 24, 2020 - 4:10:24 PM

Files

genuer_poggi.chap_JES2016.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01387654, version 1
  • ARXIV : 1610.08203

Citation

Robin Genuer, Jean-Michel Poggi. Arbres CART et Forêts aléatoires, Importance et sélection de variables. 2016. ⟨hal-01387654v1⟩

Share

Metrics

Record views

122

Files downloads

1014