TY - JOUR
T1 - Regions of unusual statistical properties as tools in the search for horizontally transferred genes in Escherichia coli
AU - Putonti, Catherine
AU - Chumakov, Sergei
AU - Chavez, Arturo
AU - Luo, Yi
AU - Graur, Dan
AU - Fox, George E.
AU - Fofanov, Yuriy
PY - 2006
Y1 - 2006
N2 - The observed diversity of statistical characteristics along genomic sequences is the result of the influences of a variety of ongoing processes including horizontal gene transfer, gene loss, genome rearrangements, and evolution. The rate at which various processes affect the genome typically varies between different genomic regions. Thus, variations in statistical properties seen in different regions of a genome are often associated with its evolution and functional organization. Analysis of such properties is therefore relevant to many ongoing biomedical research efforts. Similarity Plot or S-plot is a Windows-based application for large-scale comparisons and 2D visualization of similarities between genomic sequences. This application combines two approaches wildly used in genomics: window analysis of statistical characteristics along genomes and dot-plot visual representation. S-plot is effective in detecting highly similar regions between two genomes. Within a single genome, S-plot has the ability to identify highly dissimilar regions displaying unusual compositional properties. The application was used to perform a comparative analysis of 50+ microbial genomes as well as many eukaryote genomes including human, rat, mouse, and drosophila. We illustrate the uses of S-Plot in a comparison involving Escherichia coli K12 and E. coli O157:H7.
AB - The observed diversity of statistical characteristics along genomic sequences is the result of the influences of a variety of ongoing processes including horizontal gene transfer, gene loss, genome rearrangements, and evolution. The rate at which various processes affect the genome typically varies between different genomic regions. Thus, variations in statistical properties seen in different regions of a genome are often associated with its evolution and functional organization. Analysis of such properties is therefore relevant to many ongoing biomedical research efforts. Similarity Plot or S-plot is a Windows-based application for large-scale comparisons and 2D visualization of similarities between genomic sequences. This application combines two approaches wildly used in genomics: window analysis of statistical characteristics along genomes and dot-plot visual representation. S-plot is effective in detecting highly similar regions between two genomes. Within a single genome, S-plot has the ability to identify highly dissimilar regions displaying unusual compositional properties. The application was used to perform a comparative analysis of 50+ microbial genomes as well as many eukaryote genomes including human, rat, mouse, and drosophila. We illustrate the uses of S-Plot in a comparison involving Escherichia coli K12 and E. coli O157:H7.
KW - Escherichia coli K12
KW - Escherichia coli O157:H7
KW - Horizontal gene transfer
KW - Sequence composition
UR - http://www.scopus.com/inward/record.url?scp=33846505986&partnerID=8YFLogxK
U2 - 10.1063/1.2356423
DO - 10.1063/1.2356423
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:33846505986
SN - 0094-243X
VL - 854
SP - 126
EP - 128
JO - AIP Conference Proceedings
JF - AIP Conference Proceedings
T2 - 9h Mexican Symposium on Medical Physics
Y2 - 18 March 2006 through 23 March 2006
ER -