Breakpoint Model for Logistic Regression and Application to the detection of G x E Interactions

Abstract : Introduction : Complex diseases are known to be highly heterogeneous in nature. This heterogeneity can be due to various factors including genetic heterogeneity (eg: population stratification), phenotypic heterogeneity (ex: clinical diagnosis of schizophrenia), exposure heterogeneity to various environmental factors (eg: alcohol, drugs, pollution, etc.), and recruitment heterogeneity over time (the so-called « cohort-effect »). In the context of case-control studies, detecting and accounting for this heterogeneity can help to identify high-risk subgroups in the population and provide a better understanding of the disease. Method : In this context, we introduce a new way to detect and account any source of heterogeneity by introducing a breakpoint model for logistic regression. Our model is based on a constrained Hidden Markov Modelling using a constrained Markov model for the hidden segmentation and a logistic regression model for the observed part. Parameter training is performed by combining Forward-Backward recursion with the Expectation-Maximization algorithm. The model output includes both regression estimate in each segment and the full posterior distribution of the breakpoints. Results : We validate and illustrate the usefulness of our model both on simulated and realistic dataset. In particular, we show that if individuals are ordered according to some proximity space (eg: by increasing BMI (Body Mass Index)) we can use our model to detect interactions between genes and latent exposures by using a simple likelihood ratio testing framework. This last result seems particularly promising since it provides an unique way to distinguish between confounding factors (eg: sex for smoking) and genuine non-observed causal exposures.
Type de document :
Communication dans un congrès
International Biometric Conference, Jul 2016, Victoria, Canada
Liste complète des métadonnées
Contributeur : Flora Alarcon <>
Soumis le : jeudi 5 janvier 2017 - 14:26:20
Dernière modification le : mardi 10 octobre 2017 - 11:22:05


  • HAL Id : hal-01427253, version 1



Grégory Nuel, Flora Alarcon. Breakpoint Model for Logistic Regression and Application to the detection of G x E Interactions. International Biometric Conference, Jul 2016, Victoria, Canada. 〈hal-01427253〉



Consultations de la notice