Skip to Main content Skip to Navigation
Conference papers

The Pervasiveness of Machine Learning in Omics Science: Foundations, Methods and Applications

Abstract : Biology has become an enormously data-rich subject. Data is generated in many flavors and follows particularities of the omics perspective adopted along experimental studies. For instance, genomics is the field of study dealing with genomes and it is mostly associated with the static view (the genes and where they are placed along the genome). The dynamic view is brought from the transcriptomics perspective, so the gene expression and its regulation. Finally, interactomics is usually associated to gene products, proteins, and their interactions. However it could also be seen as a huge graph network with layers of interaction integrating distinct omics perspectives. Omics science applications of unsupervised and/or supervised machine learning (ML) techniques abound in the literature. In this tutorial, we discuss machine learning on omics data, putting the emphasis on (i) mapping and (ii) learning omics patterns. We consider three main omics data: genomics, transcriptomics and interactomics. For each perspective, we first provide, the biological problem, the data mapping (from a biological problem to a machine learning problem), the core ML methods employed and its implementation in the R language.
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download
Contributor : Nicolas Pasquier <>
Submitted on : Tuesday, June 14, 2016 - 2:39:42 PM
Last modification on : Tuesday, May 26, 2020 - 6:50:53 PM


  • HAL Id : hal-01330594, version 1


Ronnie Alves, Claude Pasquier, Nicolas Pasquier. The Pervasiveness of Machine Learning in Omics Science: Foundations, Methods and Applications. ECML/PKDD'2014 International Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases Tutorial T3, Sep 2014, Nancy, France. ⟨hal-01330594⟩



Record views


Files downloads