A new genetic algorithm in proteomics: Feature selection for SELDI-TOF data - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Computational Statistics and Data Analysis Année : 2008

A new genetic algorithm in proteomics: Feature selection for SELDI-TOF data

Résumé

Mass spectrometry from clinical specimens is used in order to identify biomarkers in a diagnosis. Thus, a reliable method for both feature selection and classification is required. A novel method is proposed to find biomarkers in SELDI-TOF in order to perform robust classification.The feature selection is based on a new genetic algorithm. Concerning the classification, a method which takes into account the great variability on intensity by using decision stumps has been developed. Moreover, as the samples are often small, it is more appropriate to use the decision stumps simultaneously than building a complete tree. The thresholds of the decision stumps are determined in the same genetic algorithm. Finally, the method was generalized to more than two groups based on pairwise coupling. The obtained algorithm was applied on two data sets: a publicly available one containing two groups allowing a comparison with other methods from the literature and a new one containing three groups
Fichier principal
Vignette du fichier
Publis08-lasb-040_1.pdf (516.34 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02655748 , version 1 (29-05-2020)

Identifiants

Citer

C. Reynes, Rodolphe Sabatier, Nicolas Molinari, S. Lehmann. A new genetic algorithm in proteomics: Feature selection for SELDI-TOF data. Computational Statistics and Data Analysis, 2008, 52 (9), pp.4380-4394. ⟨10.1016/j.csda.2008.02.025⟩. ⟨hal-02655748⟩
87 Consultations
106 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More