Skip to Main content Skip to Navigation
New interface
Journal articles

Prediction of the Nash through Penalized Mixture of Logistic Regression Models

Marie Morvan 1 Emilie Devijver 2, 3 Madison Giacofci 1 Valérie Monbet 1, 4 
3 SVH - Statistique pour le Vivant et l’Homme
LJK - Laboratoire Jean Kuntzmann
4 SIMSMART - SIMulation pARTiculaire de Modèles Stochastiques
IRMAR - Institut de Recherche Mathématique de Rennes, Inria Rennes – Bretagne Atlantique
Abstract : In many medical problems, it is common to face heterogeneous data with unknown patients profiles leading to difficulties to build a good diagnosis model. In this paper, our aim is to build a suitable and interpretable diagnosis tool to predict the Non-Alcoholic Steatohepatitis (NASH), taking into account the structure and the dimension of the spectrometric data. Thus, we introduce a penalized mixture of logistic regression model that allows the prediction of a binary response. Parameters estimation is done using the EM algorithm. In the presence of a high number of covariates, estimation of the full covariance matrix and interpretation of the regression coefficients is not trivial. To highlight relevant covariates for the prediction and their links, we apply a penalization to the co-variance matrix and the regression coefficients. The estimated model depends on regularization parameters that allow to adjust the strength of the penalization. Automatic selection tools are used to choose the best model, namely with respect to the AIC criterion. A simulation study is performed to evaluate the proposed method, and the application on the NASH data set is presented. This model leads to better prediction performance than the competitive methods and provides useful tools to better understand the data.
Document type :
Journal articles
Complete list of metadata

Cited literature [31 references]  Display  Hide  Download
Contributor : Valérie Monbet Connect in order to contact the contributor
Submitted on : Saturday, December 28, 2019 - 10:33:07 AM
Last modification on : Tuesday, October 25, 2022 - 4:23:56 PM


Files produced by the author(s)



Marie Morvan, Emilie Devijver, Madison Giacofci, Valérie Monbet. Prediction of the Nash through Penalized Mixture of Logistic Regression Models. Annals of Applied Statistics, 2021, 15 (2), pp.952-970. ⟨10.1214/20-AOAS1409⟩. ⟨hal-02151564v2⟩



Record views


Files downloads