Understanding Protein Flexibility through Dimensionality Reduction
To cite this article: Miguel L. Teodoro, George N. Phillips Jr, and Lydia E. Kavraki. Journal of Computational Biology.
June 2003,
10(3-4): 617-634.
doi:10.1089/10665270360688228.
Department of Biochemistry and Cell Biology and Department of Computer Science, Rice University, 6100 Main Street, MS 140, Houston, TX 77005
George N. Phillips Jr
Department of Biochemistry and Department of Computer Science, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI 53706-1544
Lydia E. Kavraki
Department of Computer Science and Department of Bioengineering, Rice University, MS 132, P.O. Box 1892, Houston, TX 77251-1892
ABSTRACT
This work shows how to decrease the complexity of modeling flexibility in proteins by reducing the number of dimensions necessary to model important macromolecular motions such as the induced-fit process. Induced fit occurs during the binding of a protein to other proteins, nucleic acids, or small molecules (ligands) and is a critical part of protein function. It is now widely accepted that conformational changes of proteins can affect their ability to bind other molecules and that any progress in modeling protein motion and flexibility will contribute to the understanding of key biological functions. However, modeling protein flexibility has proven a very difficult task. Experimental laboratory methods, such as x-ray crystallography, produce rather limited information, while computational methods such as molecular dynamics are too slow for routine use with large systems. In this work, we show how to use the principal component analysis method, a dimensionality reduction technique, to transform the original high-dimensional representation of protein motion into a lower dimensional representation that captures the dominant modes of motions of proteins. For a medium-sized protein, this corresponds to reducing a problem with a few thousand degrees of freedom to one with less than fifty. Although there is inevitably some loss in accuracy, we show that we can obtain conformations that have been observed in laboratory experiments, starting from different initial conformations and working in a drastically reduced search space.
This paper was cited by:
Dynameomics: protein dynamics and unfolding across fold space
Amanda L. Jonsson, R. Dustin Schaeffer, Marc W. van der Kamp, Valerie Daggett
BioMolecular Concepts. Dec 2010, Vol. 1, No. 5-6: 335-344
Dynameomics: A Comprehensive Database of Protein Dynamics
Marc W. van der Kamp, R. Dustin Schaeffer, Amanda L. Jonsson, Alexander D. Scouras, Andrew M. Simms, Rudesh D. Toofanny, Noah C. Benson, Peter C. Anderson, Eric D. Merkley, Steven Rysavy, Dennis Bromley, David A.C. Beck, Valerie Daggett
Consensus modes, a robust description of protein collective motions from multiple-minima normal mode analysis—application to the HIV-1 protease
Paulo Ricardo Batista, Charles Herbert Robert, Jean-Didier Maréchal, Meriam Ben Hamida-Rebaï, Pedro Geraldo Pascutti, Paulo Mascarello Bisch, David Perahia
Physical Chemistry Chemical Physics. Jan 2010, Vol. 12, No. 12: 2850