A Machine Learning Methodology for Enzyme Functional Classification Combining Structural and Protein Sequence Descriptors

Abstract : The massive expansion of the worldwide Protein Data Bank (PDB) provides new opportunities for computational approaches which can learn from available data and extrapolate the knowledge into new coming instances. The aim of this work is to apply machine learning in order to train prediction models using data acquired by costly experimental procedures and perform enzyme functional classification. Enzymes constitute key pharmacological targets and the knowledge on the chemical reactions they catalyze is very important for the development of potent molecular agents that will either suppress or enhance the function of the given enzyme, thus modulating a pathogenicity, an illness or even the phenotype. Classification is performed on two levels: (i) using structural information into a Support Vector Machines (SVM) classifier and (ii) based on amino acid sequence alignment and Nearest Neighbor (NN) classification. The classification accuracy is increased by fusing the two classifiers and reaches 93.4% on a large dataset of 39,251 proteins from the PDB database. The method is very competitive with respect to accuracy of classification into the 6 enzymatic classes, while at the same time its computational cost during prediction is very small.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01359157
Contributor : Evangelia Zacharaki <>
Submitted on : Thursday, September 1, 2016 - 11:19:12 PM
Last modification on : Thursday, February 7, 2019 - 5:29:15 PM

File

Amidi_IWBBIO2016.pdf
Files produced by the author(s)

Identifiers

Citation

Afshine Amidi, Shervine Amidi, Dimitrios Vlachakis, Nikos Paragios, Evangelia I. Zacharaki. A Machine Learning Methodology for Enzyme Functional Classification Combining Structural and Protein Sequence Descriptors. Bioinformatics and Biomedical Engineering, Apr 2016, Granada, Spain. pp.728-738, ⟨10.1007/978-3-319-31744-1_63⟩. ⟨hal-01359157⟩

Share

Metrics

Record views

722

Files downloads

829