Collaborative Sliced Inverse Regression

Alessandro Chiancone 1, 2 Stephane Girard 2 Jocelyn Chanussot 3, 4
1 GIPSA-SAIGA - SAIGA
GIPSA-DA - Département Automatique, GIPSA-DIS - Département Images et Signal
2 MISTIS - Modelling and Inference of Complex and Structured Stochastic Systems
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
4 GIPSA-SIGMAPHY - SIGMAPHY
GIPSA-DIS - Département Images et Signal
Abstract : In multidimensional data analysis, one has to deal with a dataset X made of n points in dimension p. When n and p are simultaneously large, classical statistical analysis methods and models fail. Supervised and unsupervised dimensionality reduction techniques are widely used to preprocess high dimensional data retaining the information useful to solve the original problem. In regression context Sliced Inverse Regression has proven to achieve good results retrieving a base of the so called effective dimension reduction (e.d.r.) space i.e. the smallest space containing the information needed to correctly regress the function. Recently, many papers focused on the complex structure of real data showing that often the data is organized in subspaces. Kuentz & Saracco (2009) proposed to clusterize X and use SIR in each cluster to better fit the so called linearity condition. Our hypothesis is that the e.d.r. space is not unique all over the data and that the different clusters can be assigned to different e.d.r. spaces. We introduce a novel technique to identify the number of e.d.r. spaces based on a weighted distance between the different spaces. First we clusterize the data (in our simulation study we used the standard k-means) then we apply SIR independently in each cluster. A greedy merging algorithm is proposed to assign each cluster to its e.d.r space taking into account the size of the cluster on which SIR is performed.Our approach is illustrated on simulated data from a Gaussian mixture model. This work is founded by LabEx Persyval
Type de document :
Communication dans un congrès
Rencontres d'Astrostatistique, Nov 2014, Grenoble, France
Liste complète des métadonnées

Littérature citée [7 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01086931
Contributeur : Stephane Girard <>
Soumis le : mardi 25 novembre 2014 - 11:50:38
Dernière modification le : samedi 12 mars 2016 - 20:22:04

Identifiants

  • HAL Id : hal-01086931, version 1

Citation

Alessandro Chiancone, Stephane Girard, Jocelyn Chanussot. Collaborative Sliced Inverse Regression. Rencontres d'Astrostatistique, Nov 2014, Grenoble, France. 〈hal-01086931〉

Partager

Métriques