Skip to Main content Skip to Navigation
Conference papers

Forests of latent tree models for the detection of genetic associations

Abstract : Together with the population aging concern, increasing health care costs require understanding the causal basis for common genetic diseases. The high dimensionality and complexity of genetic data hamper the detection of genetic associations. To alleviate the core risks (missing of the causal factor, spurious discoveries), machine learning offers an appealing alternative framework to standard statistical approaches. A novel class of probabilistic graphical models has recently been proposed - the forest of latent tree models - , to obtain a trade-off between faithful modeling of data dependences and tractability. In this paper, we evaluate the soundness of this modeling approach in an association genetics context. We have performed intensive tests, in various controlled conditions, on realistic simulated data. We have also tested the model on real data. Beside guaranteeing data dimension reduction through latent variables, the model is empirically proven able to capture indirect genetic associations with the disease, both on simulated and real data. Strong associations are evidenced between the disease and the ancestor nodes of the causal genetic marker node, in the forest. In contrast, very weak associations are obtained for other nodes.
Complete list of metadata
Contributor : Philippe Leray <>
Submitted on : Wednesday, November 2, 2011 - 10:58:17 AM
Last modification on : Friday, April 17, 2020 - 10:13:00 PM




Christine Sinoquet, Raphaël Mourad, Philippe Leray. Forests of latent tree models for the detection of genetic associations. International Conference on Bioinformatics Models, Methods and Algorithms (BIOINFORMATICS 2012), Feb 2012, Vilamoura, Portugal. pp.1-10, ⟨10.5220/0003703400050014⟩. ⟨hal-00637500⟩



Record views