Inference of splicing regulatory activities by sequence neighborhood analysis

Michael B. Stadler*, Noam Shomron, Gene W. Yeo, Aniket Schneider, Xinshu Xiao, Christopher B. Burge

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

66 Scopus citations

Abstract

Sequence-specific recognition of nucleic-acid motifs is critical to many cellular processes. We have developed a new and general method called Neighborhood Inference (NI) that predicts sequences with activity in regulating a biochemical process based on the local density of known sites in sequence space. Applied to the problem of RNA splicing regulation, NI was used to predict hundreds of new exonic splicing enhancer (ESE) and silencer (ESS) hexanucleotides from known human ESEs and ESSs. These predictions were supported by cross-validation analysis, by analysis of published splicing regulatory activity data, by sequence-conservation analysis, and by measurement of the splicing regulatory activity of 24 novel predicted ESEs, ESSs, and neutral sequences using an in vivo splicing reporter assay. These results demonstrate the ability of NI to accurately predict splicing regulatory activity and show that the scope of exonic splicing regulatory elements is substantially larger than previously anticipated. Analysis of orthologous exons in four mammals showed that the NI score of ESEs, a measure of function, is much more highly conserved above background than ESE primary sequence. This observation indicates a high degree of selection for ESE activity in mammalian exons, with surprisingly frequent interchangeability between ESE sequences.

Original languageEnglish
Pages (from-to)1849-1860
Number of pages12
JournalPLoS Genetics
Volume2
Issue number11
DOIs
StatePublished - Nov 2006
Externally publishedYes

Fingerprint

Dive into the research topics of 'Inference of splicing regulatory activities by sequence neighborhood analysis'. Together they form a unique fingerprint.

Cite this