`python-m5p` - M5 Prime regression trees in python, compliant with scikit-learn - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

`python-m5p` - M5 Prime regression trees in python, compliant with scikit-learn

Sylvain Marié

Résumé

Regression trees are powerful Machine Learning models capable of both flexibility in modeling as well as interpretability when the tree is not too deep. The M5 algorithm was introduced by Quinlan in 1992 under the name "model tree" ; the algorithm is derived from classic regression trees (e.g. CART, Breiman et al., 1984), adding the possibility to prune the tree and use linear regression models at leaves. The goal is to reduce the number of branches and leaves in the tree, making it ultimately more interpretable and smooth. The M5 algorithm was improved by Wang & Witten in 1997, under the name M5 Prime (acronym M5' or M5P). The algorithm gained popularity in particular a dozen years later with the Weka Machine Learning toolbox, providing a java-based implementation. `python-m5p` is an implementation of the M5P algorithm compliant with scikit-learn.
220826 python-m5p - Sylvain MARIE 1.1.pdf (857.76 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03762155 , version 1 (26-08-2022)

Identifiants

  • HAL Id : hal-03762155 , version 1

Citer

Sylvain Marié. `python-m5p` - M5 Prime regression trees in python, compliant with scikit-learn. PyCon.DE & PyData Berlin, 2022, Apr 2022, Berlin, Germany. ⟨hal-03762155⟩
168 Consultations
176 Téléchargements

Partager

Gmail Facebook X LinkedIn More