TY - JOUR
T1 - Codon-based indices for modeling gene expression and transcript evolution
AU - Bahiri-Elitzur, Shir
AU - Tuller, Tamir
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/1
Y1 - 2021/1
N2 - Codon usage bias (CUB) refers to the phenomena that synonymous codons are used in different frequencies in most genes and organisms. The general assumption is that codon biases reflect a balance between mutational biases and natural selection. Today we understand that the codon content is related and can affect all gene expression steps. Starting from the 1980s, codon-based indices have been used for answering different questions in all biomedical fields, including systems biology, agriculture, medicine, and biotechnology. In general, codon usage bias indices weigh each codon or a small set of codons to estimate the fitting of a certain coding sequence to a certain phenomenon (e.g., bias in codons, adaptation to the tRNA pool, frequencies of certain codons, transcription elongation speed, etc.) and are usually easy to implement. Today there are dozens of such indices; thus, this paper aims to review and compare the different codon usage bias indices, their applications, and advantages. In addition, we perform analysis that demonstrates that most indices tend to correlate even though they aim to capture different aspects. Due to the centrality of codon usage bias on different gene expression steps, it is important to keep developing new indices that can capture additional aspects that are not modeled with the current indices.
AB - Codon usage bias (CUB) refers to the phenomena that synonymous codons are used in different frequencies in most genes and organisms. The general assumption is that codon biases reflect a balance between mutational biases and natural selection. Today we understand that the codon content is related and can affect all gene expression steps. Starting from the 1980s, codon-based indices have been used for answering different questions in all biomedical fields, including systems biology, agriculture, medicine, and biotechnology. In general, codon usage bias indices weigh each codon or a small set of codons to estimate the fitting of a certain coding sequence to a certain phenomenon (e.g., bias in codons, adaptation to the tRNA pool, frequencies of certain codons, transcription elongation speed, etc.) and are usually easy to implement. Today there are dozens of such indices; thus, this paper aims to review and compare the different codon usage bias indices, their applications, and advantages. In addition, we perform analysis that demonstrates that most indices tend to correlate even though they aim to capture different aspects. Due to the centrality of codon usage bias on different gene expression steps, it is important to keep developing new indices that can capture additional aspects that are not modeled with the current indices.
KW - Codon usage bias
KW - Gene expression
KW - Transcript evolution
UR - http://www.scopus.com/inward/record.url?scp=85105489527&partnerID=8YFLogxK
U2 - 10.1016/j.csbj.2021.04.042
DO - 10.1016/j.csbj.2021.04.042
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.systematicreview???
C2 - 34025951
AN - SCOPUS:85105489527
SN - 2001-0370
VL - 19
SP - 2646
EP - 2663
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -