A sliced inverse regression for data stream

Abstract : In this article, we focus on data arriving sequentially by blocks in a stream. A semiparametric regression model involving a common EDR (Effective Dimension Reduction) direction is assumed in each block. Our goal is to estimate this direction at each arrival of a new block. A simple direct approach consists of pooling all the observed blocks and estimating the EDR direction by the SIR (Sliced Inverse Regression) method. But in practice, some disadvantages appear such as the storage of the blocks and the running time for large sample sizes. To overcome these drawbacks, we propose an adaptive SIR estimator of based on the optimization of a quality measure. The corresponding approach is faster both in terms of computational complexity and running time, and provides data storage benefits. The consistency of our estimator is established and its asymptotic distribution is given. An extension to multiple indices model is proposed. A graphical tool is also provided in order to detect changes in the underlying model, i.e., drift in the EDR direction or aberrant blocks in the data stream. A simulation study illustrates the numerical behavior of our estimator. Finally, an application to real data concerning the estimation of physical properties of the Mars surface is presented.
Type de document :
Article dans une revue
Computational Statistics, Springer Verlag, 2014, 29, pp.1129-1152
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01139870
Contributeur : Import Ws Irstea <>
Soumis le : mardi 7 avril 2015 - 11:43:34
Dernière modification le : vendredi 5 octobre 2018 - 18:50:13
Document(s) archivé(s) le : mardi 18 avril 2017 - 12:00:18

Fichier

bx2014-pub00042031.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01139870, version 1
  • IRSTEA : PUB00042031

Collections

Citation

Marie Chavent, Stéphane Girard, V. Kuentz Simonet, Benoit Liquet, Thi Mong Ngoc Nguyen, et al.. A sliced inverse regression for data stream. Computational Statistics, Springer Verlag, 2014, 29, pp.1129-1152. 〈hal-01139870〉

Partager

Métriques

Consultations de la notice

417

Téléchargements de fichiers

136