TY - JOUR
T1 - Unbiased estimation of symmetrical directional mutation pressure from protein-coding DNA
AU - Jermiin, Lars S.
AU - Foster, Peter G.
AU - Graur, Dan
AU - Lowe, Roger M.
AU - Crozier, Ross H.
PY - 1996
Y1 - 1996
N2 - The most generally applicable procedure for obtaining estimates of the symmetrical, or strand-nonspecific, directional mutation pressure (μ(D)) on protein-coding DNA sequences is to determine the G+C content at synonymous codon sites (P(syn)), and to divide P(syn) by twice the arithmetic mean of the G+C content at synonymous codon sites of a large number of randomly generated, synonymously coding DNA sequences (P̄(syn)). Unfortunately, the original procedure yields biased estimates of P(syn) and μ(D) and is computationally expensive. We here present a fast procedure for estimating unbiased μ(D) values. The procedure employs direct calculation of P(circumflex)(syn) (≃P̄(syn)) and two normalization procedures, one for P(syn) ≥ P(circumflex)(syn) and another for P(syn) ≥ P(circumflex)(syn). The normalization removes a bias sometimes caused by codons specifying arginine, asparagine, isoleucine, and leucine. Consequently, comparison of protein-coding genes that are translated using different genetic codes is facilitated.
AB - The most generally applicable procedure for obtaining estimates of the symmetrical, or strand-nonspecific, directional mutation pressure (μ(D)) on protein-coding DNA sequences is to determine the G+C content at synonymous codon sites (P(syn)), and to divide P(syn) by twice the arithmetic mean of the G+C content at synonymous codon sites of a large number of randomly generated, synonymously coding DNA sequences (P̄(syn)). Unfortunately, the original procedure yields biased estimates of P(syn) and μ(D) and is computationally expensive. We here present a fast procedure for estimating unbiased μ(D) values. The procedure employs direct calculation of P(circumflex)(syn) (≃P̄(syn)) and two normalization procedures, one for P(syn) ≥ P(circumflex)(syn) and another for P(syn) ≥ P(circumflex)(syn). The normalization removes a bias sometimes caused by codons specifying arginine, asparagine, isoleucine, and leucine. Consequently, comparison of protein-coding genes that are translated using different genetic codes is facilitated.
KW - A+T pressure
KW - Bias correction
KW - G+C pressure
KW - Nonsynonymous codon sites
KW - Symmetrical directional mutation pressure
KW - Synonymous codon sites
UR - http://www.scopus.com/inward/record.url?scp=0029992180&partnerID=8YFLogxK
U2 - 10.1007/BF02498643
DO - 10.1007/BF02498643
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:0029992180
SN - 0022-2844
VL - 42
SP - 476
EP - 480
JO - Journal of Molecular Evolution
JF - Journal of Molecular Evolution
IS - 4
ER -