Ranking Forests - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2010

Ranking Forests

Résumé

It is the goal of this paper to examine how the aggregation and feature randomization principles underlying the algorithm RANDOM FOREST [1], originally proposed in the classification/regression setup, can be adapted to bipartite ranking, in order to increase the performance of scoring rules produced by the TREERANK algorithm [2], a recently developed tree induction method, specifically tailored for this global learning problem. Since TREERANK may be viewed as a recursive implementation of a cost-sensitive version of the popular classification algorithm CART [3], with a cost locally depending on the data lying within the node to split, various strategies can be considered for ”randomizing” the features involved in the tree growing stage. In parallel, several ways of combining/averaging ranking trees may be used, including techniques inspired from rank aggregation methods recently popularized in Web applications. Ranking procedures based on such approaches are called RANKING FORESTS. Beyond preliminary theoretical background, results of experiments based on simulated data are provided in order to give evidence of their statistical performance.
Fichier principal
Vignette du fichier
Forest_IEEE_sub.pdf (453.73 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00452577 , version 1 (02-02-2010)

Identifiants

  • HAL Id : hal-00452577 , version 1

Citer

Stéphan Clémençon. Ranking Forests. 2010. ⟨hal-00452577⟩
161 Consultations
315 Téléchargements

Partager

Gmail Facebook X LinkedIn More