Protein interaction hotspot identification using sequence-based frequency-derived features - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Biomedical Engineering Année : 2011

Protein interaction hotspot identification using sequence-based frequency-derived features

Résumé

Finding good descriptors, capable of discriminating hotspot residues from others, is still a challenge in many attempts to understand protein interaction. In this paper, descriptors issued from the analysis of amino acid sequences using digital signal processing (DSP) techniques are shown to be as good as those derived from protein tertiary structure and/or information on the complex. The simulation results show that our descriptors can be used separately to predict hotspots, via a random forest classifier, with an accuracy of 79% and a precision of 75%. They can also be used jointly with features derived from tertiary structures to boost the performance up to an accuracy of 82% and a precision of 80%.
Fichier non déposé

Dates et versions

hal-00609247 , version 1 (18-07-2011)

Identifiants

  • HAL Id : hal-00609247 , version 1

Citer

Quang Thang Nguyen, Ronan Fablet, Dominique Pastor. Protein interaction hotspot identification using sequence-based frequency-derived features. IEEE Transactions on Biomedical Engineering, 2011. ⟨hal-00609247⟩
167 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More