Parameter selection for principal curves - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2011

Parameter selection for principal curves

Résumé

Principal curves are nonlinear generalizations of the notion of first principal component. Roughly, a principal curve is a parameterized curve in Rd which passes through the "middle" of a data cloud drawn from some unknown probability distribution. Depending on the definition, a principal curve relies on some unknown parameters (number of segments, length, turn. . . ) which have to be properly chosen to recover the shape of the data without interpolating. In the present paper, we consider the principal curve problem from an empirical risk minimization perspective and address the parameter selection issue using the point of view of model selection via penalization. We offer oracle inequalities and implement the proposed approaches to recover the hidden structures in both simulated and real-life data.
Fichier principal
Vignette du fichier
Parameter_selection_for_principal_curves.pdf (972.59 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00565540 , version 1 (13-02-2011)
hal-00565540 , version 2 (10-10-2011)

Identifiants

  • HAL Id : hal-00565540 , version 2

Citer

Gérard Biau, Aurélie Fischer. Parameter selection for principal curves. 2011. ⟨hal-00565540v2⟩
251 Consultations
656 Téléchargements

Partager

Gmail Facebook X LinkedIn More