High–Dimensional Sparse Matched Case–Control and Case–Crossover Data: A Review of Recent Works, Description of an R Tool and an Illustration of the Use in Epidemiological Studies

Marta Avalos 1, 2 Yves Grandvalet 3, 4 Hélène Pouyes 1 Ludivine Orriols 5 Emmanuel Lagarde 5, 1
2 SISTM - Statistics In System biology and Translational Medicine
Epidémiologie et Biostatistique [Bordeaux], Inria Bordeaux - Sud-Ouest
5 Prévention et prise en charge des traumatismes [Bordeaux]
Université Bordeaux Segalen - Bordeaux 2, Inria - Institut National de Recherche en Informatique et en Automatique, INSERM - Institut National de la Santé et de la Recherche Médicale : U897
Abstract : The conditional logistic regression model is the standard tool for the analysis of epidemiological studies in which one or more cases (the event of interest), are matched with one or more controls (not showing the event). These situations arise, for example, in matched case–control and case–crossover studies. In sparse and high-dimensional settings, penalized methods, such as the Lasso, have emerged as an alternative to conventional estimation and variable selection procedures. We describe the R package clogitLasso, which brings together algorithms to estimate parameters of conditional logistic models using sparsity-inducing penalties. Most individually matched designs are covered, and, beside Lasso, Elastic Net, adaptive Lasso and bootstrapped versions are available. Different criteria for choosing the regularization term are implemented, accounting for the dependency of data. Finally, stability is assessed by resampling methods. We previously review the recent works pertaining to clogitLasso. We also report the use in exploratory analysis of a large pharmacoepidemiological study.
Document type :
Book sections
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01099313
Contributor : Marta Avalos <>
Submitted on : Friday, January 2, 2015 - 7:07:04 PM
Last modification on : Monday, February 4, 2019 - 10:04:02 AM

Identifiers

  • HAL Id : hal-01099313, version 1

Citation

Marta Avalos, Yves Grandvalet, Hélène Pouyes, Ludivine Orriols, Emmanuel Lagarde. High–Dimensional Sparse Matched Case–Control and Case–Crossover Data: A Review of Recent Works, Description of an R Tool and an Illustration of the Use in Epidemiological Studies. Enrico Formenti, Roberto Tagliaferri, Ernst Wit. Computational Intelligence Methods for Bioinformatics and Biostatistics, 8452, Springer, pp.109-124, 2014, Lecture Notes in Computer Science, 978-3-319-09041-2. ⟨http://link.springer.com/book/10.1007%2F978-3-319-09042-9⟩. ⟨hal-01099313⟩

Share

Metrics

Record views

291