Advances in Feature Selection with Mutual Information

Michel Verleysen; Fabrice Rossi; Damien François

doi:10.1007/978-3-642-01805-3_4

Chapitre D'ouvrage Année : 2009

Advances in Feature Selection with Mutual Information

(1) , (2) , (3)

1
2
3

Michel Verleysen

Fonction : Auteur

Dispositifs Intégrés et Circuits Electroniques Machine Learning Group

Fabrice Rossi

Fonction : Auteur correspondant
PersonId : 77
IdHAL : fabrice-rossi
ORCID : 0000-0003-4638-1286
IdRef : 22611385X

Connectez-vous pour contacter l'auteur

Laboratoire Traitement et Communication de l'Information

Damien François

Fonction : Auteur

Centre for Systems Engineering and Applied Mechanics

Résumé

The selection of features that are relevant for a prediction or classification problem is an important problem in many domains involving high-dimensional data. Selecting features helps fighting the curse of dimensionality, improving the performances of prediction or classification methods, and interpreting the application. In a nonlinear context, the mutual information is widely used as relevance criterion for features and sets of features. Nevertheless, it suffers from at least three major limitations: mutual information estimators depend on smoothing parameters, there is no theoretically justified stopping criterion in the feature selection greedy procedure, and the estimation itself suffers from the curse of dimensionality. This chapter shows how to deal with these problems. The two first ones are addressed by using resampling techniques that provide a statistical basis to select the estimator parameters and to stop the search procedure. The third one is addressed by modifying the mutual information criterion into a measure of how features are complementary (and not only informative) for the problem at hand.

Mots clés

Feature Selection Mutual Information Resampling Forward selection

Domaines

Apprentissage [cs.LG] Théorie de l'information [cs.IT] Théorie de l'information et codage [math.IT]

Fichier principal

paperMV.pdf (209.15 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Fabrice Rossi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00413154

Soumis le : jeudi 3 septembre 2009-12:37:03

Dernière modification le : lundi 22 avril 2024-13:43:39

Archivage à long terme le : mardi 15 juin 2010-23:07:50

Dates et versions

hal-00413154 , version 1 (03-09-2009)

Identifiants

HAL Id : hal-00413154 , version 1
ARXIV : 0909.0635
DOI : 10.1007/978-3-642-01805-3_4

Citer

Michel Verleysen, Fabrice Rossi, Damien François. Advances in Feature Selection with Mutual Information. Villmann, Th.; Biehl, M.; Hammer, B.; Verleysen, M. Similarity-Based Clustering, Springer Berlin / Heidelberg, pp.52-69, 2009, Lecture Notes in Computer Science, ⟨10.1007/978-3-642-01805-3_4⟩. ⟨hal-00413154⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI

138 Consultations

134 Téléchargements

Advances in Feature Selection with Mutual Information

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager