Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

An ensemble learning method for variable selection: application to high dimensional data and missing values

Abstract : Standard approaches for variable selection in linear models are not tailored to deal properly with high dimensional and incomplete data. Currently, methods dedicated to high dimensional data handle missing values by ad-hoc strategies, like complete case analysis or single imputation, while methods dedicated to missing values, mainly based on multiple imputation, do not discuss the imputation method to use with high dimensional data. Consequently, both approaches appear to be limited for many modern applications. With inspiration from ensemble methods, a new variable selection method is proposed. It extends classical variable selection methods such as stepwise, lasso or knockoff in the case of high dimensional data with or without missing data. Theoretical properties are studied and the practical interest is demonstrated through a simulation study. In the low dimensional case, the procedure improves the control of the error risks, especially type I error, even without missing values. With missing values, the method performs better than reference selection methods based on multiple imputation. Similar performances are obtained in the high-dimensional case with or without missing values.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [22 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02635566
Contributor : Vincent Audigier <>
Submitted on : Thursday, June 18, 2020 - 2:13:33 PM
Last modification on : Friday, June 19, 2020 - 9:51:54 AM

File

1808.06952.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02635566, version 1
  • ARXIV : 1808.06952

Citation

Avner Bar-Hen, Vincent Audigier. An ensemble learning method for variable selection: application to high dimensional data and missing values. 2020. ⟨hal-02635566⟩

Share

Metrics

Record views

16

Files downloads

6