High-Dimensional Non-Linear Variable Selection through Hierarchical Kernel Learning - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2009

High-Dimensional Non-Linear Variable Selection through Hierarchical Kernel Learning

Résumé

We consider the problem of high-dimensional non-linear variable selection for supervised learning. Our approach is based on performing linear selection among exponentially many appropriately defined positive definite kernels that characterize non-linear interactions between the original variables. To select efficiently from these many kernels, we use the natural hierarchical structure of the problem to extend the multiple kernel learning framework to kernels that can be embedded in a directed acyclic graph; we show that it is then possible to perform kernel selection through a graph-adapted sparsity-inducing norm, in polynomial time in the number of selected kernels. Moreover, we study the consistency of variable selection in high-dimensional settings, showing that under certain assumptions, our regularization framework allows a number of irrelevant variables which is exponential in the number of observations. Our simulations on synthetic datasets and datasets from the UCI repository show state-of-the-art predictive performance for non-linear regression problems.
Fichier principal
Vignette du fichier
hkl_hal.pdf (579.61 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00413473 , version 1 (04-09-2009)

Identifiants

Citer

Francis Bach. High-Dimensional Non-Linear Variable Selection through Hierarchical Kernel Learning. 2009. ⟨hal-00413473⟩
265 Consultations
754 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More