New computational method integrates experimental and genomic data through the use of the latest sequence analysis and 3D modelling technology, say scientists.

Proteins are macromolecules that are key to the thousands of cellular functions that take place in a living organism. They are formed by chains of smaller molecules called amino acids that fold forming a three-dimensional structure. It has recently been discovered that by studying the co-evolution of amino acids we can reconstruct the form or structure of these biological compounds in their natural surroundings. "A protein's amino acids can co-evolve, i.e. change in a coordinated way," says Alfonso Valencia. "By analysing the sequences of a given family of proteins, we can predict physical contacts between amino acids with great precision, in sufficient number to reconstruct the folding of a protein accurately and, therefore, its structure or form."
However, this structure does not remain static; it goes through changes in such a way that, similar to a dance in which each of the dancers adapts to their partner, it interacts with other biological compounds or with drugs. This is known as protein dynamics, the study of which has proven to be very difficult both with experimental observations and using computational tools.
The question addressed by the researchers at the beginning of the study, when Francesco Gervasio headed the Computational Biophysics Group at the CNIO, was more complex: can we use co-evolutionary studies to predict changes in the shape of proteins and, consequently, the language they establish with their environment?
"We developed a model in which the amino acids that have a strong co-evolutionary relationship attracted each other, without further additional data," says Simone Marsili, researcher who has also participated in the project. "First, we simulate the folding process and then we can see how the simulations were able to predict the changes in shape of the proteins at different levels of complexity, including those required for kinases to function [these are key proteins in metabolic and cell signalling processes as well as in cell transport, amongst others]."
In addition, it demonstrates that genomic data can be a source of useful information to supplement the current tools used to study the structure and dynamics of proteins.
Advertisement
Source-Eurekalert