From principal component to direct coupling analysis of coevolution in proteins: Low-eigenvalue modes are needed for structure prediction - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2012

From principal component to direct coupling analysis of coevolution in proteins: Low-eigenvalue modes are needed for structure prediction

Résumé

Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predicting residue-residue contacts. In this paper, inspired by the statistical physics of disordered systems, we introduce the Hopfield-Potts model to naturally interpolate between these two approaches. The Hopfield-Potts model allows us to identify relevant 'patterns' of residues from the knowledge of the eigenmodes and eigenvalues of the residue-residue Pearson correlation matrix. We show how the computation of such statistical patterns makes it possible to accurately predict residue-residue contacts with a much smaller number of parameters than DCA. In addition, we show that low-eigenvalue correlation modes, discarded by PCA, are important to recover structural information: the corresponding patterns are highly localized, that is, they are concentrated in few sites, which we find to be in close contact on the three-dimensional protein fold. We also explain why these low-eigenvalue modes, in contrast to the standard principal components, are able to efficiently encode compensatory mutations between pairs of residues.
Fichier principal
Vignette du fichier
hopfield_potts_dec3.pdf (1.09 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00764377 , version 1 (12-12-2012)
hal-00764377 , version 2 (27-08-2013)

Identifiants

Citer

Simona Cocco, Remi Monasson, Martin Weigt. From principal component to direct coupling analysis of coevolution in proteins: Low-eigenvalue modes are needed for structure prediction. 2012. ⟨hal-00764377v1⟩
377 Consultations
440 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More