TY - JOUR
T1 - ConSurf
T2 - Using evolutionary data to raise testable hypotheses about protein function
AU - Celniker, Gershon
AU - Nimrod, Guy
AU - Ashkenazy, Haim
AU - Glaser, Fabian
AU - Martz, Eric
AU - Mayrose, Itay
AU - Pupko, Tal
AU - Ben-Tal, Nir
PY - 2013/4
Y1 - 2013/4
N2 - Many mutations disappear from the population because they impair protein function and/or stability. Thus, amino acid positions that are essential for proper function evolve more slowly than others, or in other words, the slow evolutionary rate of a position reflects its importance. ConSurf (http://consurf.tau.ac.il), reviewed in this manuscript, exploits this to reveal key amino acid positions that are important for maintaining the native conformation(s) of the protein and its function, be it binding, catalysis, transport, etc. Given the sequence or 3D structure of the query protein as input, a search for similar sequences is conducted and the sequences are aligned. The multiple sequence alignment is subsequently used to calculate the evolutionary rates of each amino acid site, using Bayesian or maximum-likelihood algorithms. Both algorithms take into account the evolutionary relationships between the sequences, reflected in phylogenetic trees, to alleviate problems due to uneven (biased) sampling in sequence space. This is particularly important when the number of sequences is low. The ConSurf-DB, a new release of which is presented here, provides precalculated ConSurf conservation analysis of nearly all available structures in the Protein DataBank (PDB). The usefulness of ConSurf for the study of individual proteins and mutations, as well as a range of large-scale, genome-wide applications, is reviewed.
AB - Many mutations disappear from the population because they impair protein function and/or stability. Thus, amino acid positions that are essential for proper function evolve more slowly than others, or in other words, the slow evolutionary rate of a position reflects its importance. ConSurf (http://consurf.tau.ac.il), reviewed in this manuscript, exploits this to reveal key amino acid positions that are important for maintaining the native conformation(s) of the protein and its function, be it binding, catalysis, transport, etc. Given the sequence or 3D structure of the query protein as input, a search for similar sequences is conducted and the sequences are aligned. The multiple sequence alignment is subsequently used to calculate the evolutionary rates of each amino acid site, using Bayesian or maximum-likelihood algorithms. Both algorithms take into account the evolutionary relationships between the sequences, reflected in phylogenetic trees, to alleviate problems due to uneven (biased) sampling in sequence space. This is particularly important when the number of sequences is low. The ConSurf-DB, a new release of which is presented here, provides precalculated ConSurf conservation analysis of nearly all available structures in the Protein DataBank (PDB). The usefulness of ConSurf for the study of individual proteins and mutations, as well as a range of large-scale, genome-wide applications, is reviewed.
KW - bioinformatics
KW - databases
KW - genomics
KW - protein models
KW - protein structures
UR - http://www.scopus.com/inward/record.url?scp=84876210821&partnerID=8YFLogxK
U2 - 10.1002/ijch.201200096
DO - 10.1002/ijch.201200096
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.systematicreview???
AN - SCOPUS:84876210821
SN - 0021-2148
VL - 53
SP - 199
EP - 206
JO - Israel Journal of Chemistry
JF - Israel Journal of Chemistry
IS - 3-4
ER -