Skip to Main content Skip to Navigation
Journal articles

Bioinformatics Methods to Select Prognostic Biomarker Genes from Large Scale Datasets: A Review

Abstract : With the increased availability of survival datasets, that comprise both molecular information (e.g., gene expression), and clinical information (e.g., patient survival), numerous genes are proposed as prognostic biomarkers. Despite efforts and money invested, very few of these biomarkers have been clinically validated and are used routinely. A high false discovery rate is assumed to be largely responsible for this, in particular as the number of tested genes is extremely high relative to the number of patients followed. Here, after describing the historical methodologies on which recent developments have often been based, this review describes studies that have been performed in the last few years. The concepts will be illustrated for a renal cancer dataset, and the corresponding scripts are provided (Supporting Information). These new developments belong to three main fields of applications. First, variable selection concerns various improvements to lasso penalization. Second, accurate definition of p-values and control of the false discovery rate have also been the subject of many studies. Third, the incorporation of biological knowledge, often through the form of networks or pathways, can be used as an a priori and/or to reduce dimensionality. These new and promising developments deserve benchmarking by independent groups not involved in their development, with various independent datasets. Further work on the methodologies is also still required.
Document type :
Journal articles
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-02078995
Contributor : Florent Chatelain Connect in order to contact the contributor
Submitted on : Monday, March 25, 2019 - 5:17:26 PM
Last modification on : Wednesday, November 3, 2021 - 5:11:36 AM

Identifiers

Citation

Rémy Jardillier, Florent Chatelain, Laurent Guyon. Bioinformatics Methods to Select Prognostic Biomarker Genes from Large Scale Datasets: A Review. Biotechnology Journal, Wiley-VCH Verlag, 2018, 13 (12), pp.1800103. ⟨10.1002/biot.201800103⟩. ⟨hal-02078995⟩

Share

Metrics

Record views

512