Analyzing protein-DNA recognition mechanisms.
Résumé
We present a computational algorithm that can be used to analyze the generic mechanisms involved in protein-DNA recognition. Our approach is based on energy calculations for the full set of base sequences that can be threaded onto the DNA within a protein-DNA complex. It is able to reproduce experimental consensus binding sequences for a variety of DNA binding proteins and also correlates well with the order of measured binding free energies. These results suggest that the crystal structure of a protein-DNA complex can be used to identify all potential binding sequences. By analyzing the energy contributions that lead to base sequence selectivity, it is possible to quantify the concept of direct versus indirect recognition and to identify a new concept describing whether the protein-DNA interaction and DNA deformation terms select optimal binding sites by acting in accord or in disaccord.We present a computational algorithm that can be used to analyze the generic mechanisms involved in protein-DNA recognition. Our approach is based on energy calculations for the full set of base sequences that can be threaded onto the DNA within a protein-DNA complex. It is able to reproduce experimental consensus binding sequences for a variety of DNA binding proteins and also correlates well with the order of measured binding free energies. These results suggest that the crystal structure of a protein-DNA complex can be used to identify all potential binding sequences. By analyzing the energy contributions that lead to base sequence selectivity, it is possible to quantify the concept of direct versus indirect recognition and to identify a new concept describing whether the protein-DNA interaction and DNA deformation terms select optimal binding sites by acting in accord or in disaccord.