Keep the Decision Tree and Estimate the Class Probabilities using its Decision Boundary

Isabelle Alvarez; Stephan Bernard; Guillaume Deffuant

Communication Dans Un Congrès Année : 2007

Keep the Decision Tree and Estimate the Class Probabilities using its Decision Boundary

(1) , ,

Isabelle Alvarez

Fonction : Auteur
PersonId : 10684
IdHAL : isabelle-alvarez
ORCID : 0000-0002-5268-8666
IdRef : 169009394

DECISION

Stephan Bernard

Fonction : Auteur
PersonId : 185013
IdHAL : stephan-bernard
ORCID : 0000-0001-9694-1443

Guillaume Deffuant

Fonction : Auteur
PersonId : 737683
IdHAL : guillaume-deffuant
ORCID : 0000-0001-6265-9300
IdRef : 128705337

Résumé

This paper proposes a new method to estimate the class membership probability of the cases classified by a Decision Tree. This method provides smooth class probabilities estimate, without any modification of the tree, when the data are numerical. It applies a posteriori and doesn’t use additional training cases. It relies on the distance to the decision boundary induced by the decision tree. The distance is computed on the training sample. It is then used as an input for a very simple one-dimension kernel-based density estimator, which provides an estimate of the class membership probability. This geometric method gives good results even with pruned trees, so the intelligibility of the tree is fully preserved.

Cet article propose une nouvelle méthode pour estimer les probabilités d'appartenance aux classes des cas classés par un arbre de décision. Cette méthode produit des estimateurs de probabilités qui varient avec les exemples, et sans modification de l'arbre, dès lors que les données sont numériques. La méthode s'applique a posteriori et ne requiert pas d'exemples d'apprentissage supplémentaires. Elle repose sur la frontière de décision induite par l'arbre de décision. Cette distance est calculée sur la base d'apprentissage. Elle est ensuite fournie à un estimateur de densité à noyau qui calcule la probabilité d'appartenance à chaque classe. Cette méthode géométrique donne de bons résultats même après élagage, l'intelligibilité des arbres est donc complètement préservée.

Mots clés

DECISION TREE, LEARNING, KERNEL METHOD

LISC

Domaines

Informatique [cs]

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01335029

Soumis le : mardi 21 juin 2016-15:59:21

Dernière modification le : mardi 11 avril 2023-15:16:28

Dates et versions

hal-01335029 , version 1 (21-06-2016)

Identifiants

HAL Id : hal-01335029 , version 1
IRSTEA : PUB00020610

Citer

Isabelle Alvarez, Stephan Bernard, Guillaume Deffuant. Keep the Decision Tree and Estimate the Class Probabilities using its Decision Boundary. The 20th International Joint Conference on Artificial Intelligence 2007, Jan 2007, Hyderabad, India. pp.654-659. ⟨hal-01335029⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

86 Consultations

0 Téléchargements

Keep the Decision Tree and Estimate the Class Probabilities using its Decision Boundary

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager