A Stochastic Algorithm for Feature Selection in Pattern Recognition - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Machine Learning Research Année : 2007

A Stochastic Algorithm for Feature Selection in Pattern Recognition

Laurent Younes
  • Fonction : Auteur
  • PersonId : 875034

Résumé

We introduce a new model addressing feature selection from a large dictionary of variables that can be computed from a signal or an image. Features are extracted according to an efficiency criterion, on the basis of specified classification or recognition tasks. This is done by estimating a probability distribution P on the complete dictionary, which distributes its mass over the more efficient, or informative, components. We implement a stochastic gradient descent algorithm, using the probability as a state variable and optimizing a multi-task goodness of fit criterion for classifiers based on variable randomly chosen according to P. We then generate classifiers from the optimal distribution of weights learned on the training set. The method is first tested on several pattern recognition problems including face detection, handwritten digit recognition, spam classification and micro-array analysis. We then compare our approach with other step-wise algorithms like random forests or recursive feature elimination.
Fichier principal
Vignette du fichier
gadat_younes_jmlr_2007.pdf (422.98 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00714862 , version 1 (05-07-2012)

Identifiants

  • HAL Id : hal-00714862 , version 1

Citer

Sébastien Gadat, Laurent Younes. A Stochastic Algorithm for Feature Selection in Pattern Recognition. Journal of Machine Learning Research, 2007, 8, pp.509-547. ⟨hal-00714862⟩
146 Consultations
460 Téléchargements

Partager

Gmail Facebook X LinkedIn More