Amino acid composition and the evolutionary rates of protein-coding genes

Dan Graur*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

88 Scopus citations

Abstract

Based on the rates of amino acid substitution for 60 mammalian genes of 50 codons or more, it is shown that the rate of amino acid substitution of a protein is correlated with its amino acid composition. In particular, the content of glycine residues is negatively correlated with the rate of amino acid substitution, and this content alone explains about 38% of the total variation in amino acid substitution rates among different protein families. The propensity of a polypeptide to evolve fast or slowly may be predicted from an index or indices of protein mutability directly derivable from the amino acid composition. The propensity of an amino acid to remain conserved during evolutionary times depends not so much on its being features prominently in active sites, but on its stability index, defined as the mean chemical distance [R. Grantham (1974) Science 185:862-864] between the amino acid and its mutational derivatives produced by single-nucleotide substitutions. Functional constraints related to active and binding sites of proteins play only a minor role in determining the overall rate of amino acid substitution. The importance of amino acid composition in determining rates of substitution is illustrated with examples involving cytochrome c, cytochrome b5, ras-related genes, the calmodulin protein family, and fibrinopeptides.

Original languageEnglish
Pages (from-to)53-62
Number of pages10
JournalJournal of Molecular Evolution
Volume22
Issue number1
DOIs
StatePublished - Aug 1985
Externally publishedYes

Funding

FundersFunder number
National Institute of General Medical SciencesR01GM020293

    Keywords

    • Amino acid composition
    • Functional constraints
    • Glycine
    • Rate of amino acid substitution

    Fingerprint

    Dive into the research topics of 'Amino acid composition and the evolutionary rates of protein-coding genes'. Together they form a unique fingerprint.

    Cite this