HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Collaborative Sliced Inverse Regression

Alessandro Chiancone 1, 2 Stéphane Girard 2 Jocelyn Chanussot 3, 4
1 GIPSA-SAIGA - GIPSA - Signal et Automatique pour la surveillance, le diagnostic et la biomécanique
GIPSA-DA - Département Automatique, GIPSA-DIS - Département Images et Signal
2 MISTIS - Modelling and Inference of Complex and Structured Stochastic Systems
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology
4 GIPSA-SIGMAPHY - GIPSA - Signal Images Physique
GIPSA-DIS - Département Images et Signal
Abstract : In multidimensional data analysis, one has to deal with a dataset X made of n points in dimension p. When n and p are simultaneously large, classical statistical analysis methods and models fail. Supervised and unsupervised dimensionality reduction techniques are widely used to preprocess high dimensional data retaining the information useful to solve the original problem. In regression context Sliced Inverse Regression has proven to achieve good results retrieving a base of the so called effective dimension reduction (e.d.r.) space i.e. the smallest space containing the information needed to correctly regress the function. Recently, many papers focused on the complex structure of real data showing that often the data is organized in subspaces. Kuentz & Saracco (2009) proposed to clusterize X and use SIR in each cluster to better fit the so called linearity condition. Our hypothesis is that the e.d.r. space is not unique all over the data and that the different clusters can be assigned to different e.d.r. spaces. We introduce a novel technique to identify the number of e.d.r. spaces based on a weighted distance between the different spaces. First we clusterize the data (in our simulation study we used the standard k-means) then we apply SIR independently in each cluster. A greedy merging algorithm is proposed to assign each cluster to its e.d.r space taking into account the size of the cluster on which SIR is performed.Our approach is illustrated on simulated data from a Gaussian mixture model. This work is founded by LabEx Persyval
Document type :
Conference papers
Complete list of metadata

Cited literature [7 references]  Display  Hide  Download

Contributor : Stephane Girard Connect in order to contact the contributor
Submitted on : Tuesday, November 25, 2014 - 11:50:38 AM
Last modification on : Thursday, January 20, 2022 - 5:30:16 PM


  • HAL Id : hal-01086931, version 1


Alessandro Chiancone, Stéphane Girard, Jocelyn Chanussot. Collaborative Sliced Inverse Regression. Rencontres d'Astrostatistique, Nov 2014, Grenoble, France. ⟨hal-01086931⟩



Record views


Files downloads