'Genome order index' should not be used for defining compositional constraints in nucleotide sequences

  • Eran Elhaik*
  • , Dan Graur
  • , Krešimir Josić
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

A "genome order index," defined as S = a2 + c2 + t2 + g2, where a, c, t, and g are the nucleotide frequencies of A, C, T, and G, respectively, was used to suggest that there exist genome-specific constraints on nucleotide composition. We show that the "evidence" for constraint, S < 1/3, is in fact a mathematical property that is always true regardless of data. Moreover, we show that S is strictly equivalent to and derivable from the Shannon H-function and has no advantage over it.

Original languageEnglish
Pages (from-to)147
Number of pages1
JournalComputational Biology and Chemistry
Volume32
Issue number2
DOIs
StatePublished - Apr 2008
Externally publishedYes

Funding

FundersFunder number
National Science FoundationDMS-0071735, DBI-0543342

    Keywords

    • Genome order index
    • Genomic G + C content
    • Isochores
    • Nucleotide composition
    • Shannon H-function

    Fingerprint

    Dive into the research topics of ''Genome order index' should not be used for defining compositional constraints in nucleotide sequences'. Together they form a unique fingerprint.

    Cite this