R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society. Series B (Methodological), pp.267-288, 1996.

R. Francis and . Bach, Bolasso: model consistent lasso estimation through the bootstrap, Proceedings of the 25th international conference on Machine learning, pp.33-40, 2008.

R. F. Barber and E. J. Candès, Controlling the false discovery rate via knockoffs, The Annals of Statistics, vol.43, issue.5, pp.2055-2085, 2015.

R. F. Barber and E. J. Candes, A knockoff filter for high-dimensional selective inference, 2016.

F. E. Harrell, Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. Springer Series in Statistics, 2015.

L. Breiman, Heuristics of instability and stabilization in model selection. The annals of statistics, vol.24, pp.2350-2383, 1996.

N. Meinshausen and P. Bühlmann, Stability selection, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.72, issue.4, pp.417-473, 2010.

R. Genuer, J. Poggi, and C. Tuleau-malot, Variable selection using random forests, Pattern Recognition Letters, vol.31, issue.14, pp.2225-2236, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00755489

L. Wasserman and K. Roeder, High dimensional variable selection, Annals of statistics, vol.37, issue.5A, p.2178, 2009.

J. Schafer, Analysis of Incomplete Multivariate Data, 1997.

D. Rubin, Multiple Imputation for Non-Response in Survey, 1987.

R. Little and D. Rubin, Statistical Analysis with Missing Data. Wiley series in probability and statistics, 2002.

Y. Zhao and Q. Long, Variable selection in the presence of missing data: imputation-based methods, Wiley Interdisciplinary Reviews: Computational Statistics, vol.9, issue.5, p.1402

N. Städler and P. Bühlmann, Missing values: sparse inverse covariance estimation and an extension to sparse regression, Statistics and Computing, vol.22, issue.1, pp.219-235, 2012.

L. Po, M. Loh, and . Wainwright, High-dimensional regression with noisy and missing data: Provable guarantees with non-convexity, Advances in Neural Information Processing Systems, pp.2726-2734, 2011.

J. Galimard, S. Chevret, C. Protopopescu, and M. Resche-rigon, A multiple imputation approach for mnar mechanisms compatible with heckman's model, Statistics in Medicine, vol.35, issue.17, pp.2907-2920, 2016.

A. Kapelner and J. Bleich, Prediction with missing data via bayesian additive regression trees, Canadian Journal of Statistics, vol.43, issue.2, pp.224-239, 2015.

E. Patterson and M. Sesia, knockoff: The Knockoff Filter for Controlled Variable Selection, 2017.

V. Audigier, F. Husson, and J. Josse, Multiple imputation for continuous variables using a bayesian principal component analysis, Journal of Statistical Computation and Simulation, vol.86, issue.11, pp.2140-2156, 2016.
URL : https://hal.archives-ouvertes.fr/hal-00951915

Y. Liu, Y. Wang, Y. Feng, and M. M. Wall, Variable selection and prediction with incomplete high-dimensional data. The annals of applied statistics, vol.10, p.418, 2016.

. R-core-team, R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, 2019.

A. Alfons, C. Croux, and S. Gelper, Sparse least trimmed squares regression for analyzing high-dimensional large data sets. The Annals of Applied Statistics, pp.226-248, 2013.