Statistical methods of SNP data analysis with applications

Abstract : Various statistical methods important for genetic analysis are considered and developed. Namely, we concentrate on the multifactor dimensionality reduction, logic regression, random forests and stochastic gradient boosting. These methods and their new modifications, e.g., the MDR method with "independent rule", are used to study the risk of complex diseases such as cardiovascular ones. The roles of certain combinations of single nucleotide polymorphisms and external risk factors are examined. To perform the data analysis concerning the ischemic heart disease and myocardial infarction the supercomputer SKIF "Chebyshev" of the Lomonosov Moscow State University was employed.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [45 references]  Display  Hide  Download
Contributor : Alexander Bulinski <>
Submitted on : Friday, June 24, 2011 - 12:02:33 AM
Last modification on : Friday, January 10, 2020 - 6:02:02 PM
Long-term archiving on: Sunday, September 25, 2011 - 2:20:40 AM


Files produced by the author(s)


  • HAL Id : hal-00600143, version 1
  • ARXIV : 1106.4989


Alexander Bulinski, Oleg Butkovsky, Alexey Shashkin, Pavel Yaskov. Statistical methods of SNP data analysis with applications. 2011. ⟨hal-00600143⟩



Record views


Files downloads