A consensus orthogonal partial least squares discriminant analysis (OPLS-DA) strategy for multiblock Omics data fusion - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Analytica Chimica Acta Année : 2013

A consensus orthogonal partial least squares discriminant analysis (OPLS-DA) strategy for multiblock Omics data fusion

Résumé

Omics approaches have proven their value to provide a broad monitoring of biological systems. However, as no single analytical technique is sufficient to reveal the full biochemical content of complex biological matrices or biofluids, the fusion of information from several data sources has become a decisive issue. Omics studies generate an increasing amount of massive data obtained from different analytical devices. These data are usually high dimensional and extracting knowledge from these multiple blocks is challenging. Appropriate tools are therefore needed to handle these datasets suitably. For that purpose, a generic methodology is proposed by combining the strengths of established data analysis strategies, i.e. multiple kernel learning and OPLS-DA to offer an efficient tool for the fusion of Omics data obtained from multiple sources. Three real case studies are proposed to assess the potential of the method. A first example illustrates the fusion of mass spectrometry-based metabolomic data acquired in both negative and positive electrospray ionisation modes, from leaf samples of the model plant Arabidopsis thaliana. A second dataset involves the classification of wine grape varieties based on polyphenolic extracts analysed by two-dimensional heteronuclear magnetic resonance spectroscopy. A third case study underlines the ability of the method to combine heterogeneous data from systems biology with the analysis of publicly available data related to NCI-60 cancer cell lines from different tissue origins, which include metabolomics, transcriptomics and proteomics. The fusion of Omics data from different sources is expected to provide a more complete view of biological systems. The proposed method was demonstrated as a relevant and widely applicable alternative to handle efficiently the inherent characteristics of multiple Omics data, such as very large numbers of noisy collinear variables.
Fichier non déposé

Dates et versions

hal-01252654 , version 1 (07-01-2016)

Identifiants

Citer

Julien Boccard, Douglas Rutledge. A consensus orthogonal partial least squares discriminant analysis (OPLS-DA) strategy for multiblock Omics data fusion. Analytica Chimica Acta, 2013, 769, pp.30-39. ⟨10.1016/j.aca.2013.01.022⟩. ⟨hal-01252654⟩
116 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More