TY - JOUR
T1 - Prediction of Interacting Single-Stranded RNA Bases by Protein-Binding Patterns
AU - Shulman-Peleg, Alexandra
AU - Shatsky, Maxim
AU - Nussinov, Ruth
AU - Wolfson, Haim J.
N1 - Funding Information:
We thank O. Dror for the contribution of software for this project. The research of A.S.-P. was supported by the Clore PhD Fellowship. The research of H.J.W. has been supported in part by the Israel Science Foundation (grant 281/05), the NIAID, National Institutes of Health (NIH) (grant 1UC1AI067231), the Binational US–Israel Science Foundation and by the Hermann Minkowski-Minerva Center for Geometry at TAU. This publication has been funded in whole or in part with federal funds from the National Cancer Institute, NIH, under contract NO1-CO-12400. This research was supported in part by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government.
PY - 2008/5/29
Y1 - 2008/5/29
N2 - Prediction of protein-RNA interactions at the atomic level of detail is crucial for our ability to understand and interfere with processes such as gene expression and regulation. Here, we investigate protein binding pockets that accommodate extruded nucleotides not involved in RNA base pairing. We observed that most of the protein-interacting nucleotides are part of a consecutive fragment of at least two nucleotides whose rings have significant interactions with the protein. Many of these share the same protein binding cavity and more than 30% of such pairs are π-stacked. Since these local geometries cannot be inferred from the nucleotide identities, we present a novel framework for their prediction from the properties of protein binding sites. First, we present a classification of known RNA nucleotide and dinucleotide protein binding sites and identify the common types of shared 3-D physicochemical binding patterns. These are recognized by a new classification methodology that is based on spatial multiple alignment. The shared patterns reveal novel similarities between dinucleotide binding sites of proteins with different overall sequences, folds and functions. Given a protein structure, we use these patterns for the prediction of its RNA dinucleotide binding sites. Based on the binding modes of these nucleotides, we further predict an RNA fragment that interacts with those protein binding sites. With these knowledge-based predictions, we construct an RNA fragment that can have a previously unknown sequence and structure. In addition, we provide a drug design application in which the database of all known small-molecule binding sites is searched for regions similar to nucleotide and dinucleotide binding patterns, suggesting new fragments and scaffolds that can target them.
AB - Prediction of protein-RNA interactions at the atomic level of detail is crucial for our ability to understand and interfere with processes such as gene expression and regulation. Here, we investigate protein binding pockets that accommodate extruded nucleotides not involved in RNA base pairing. We observed that most of the protein-interacting nucleotides are part of a consecutive fragment of at least two nucleotides whose rings have significant interactions with the protein. Many of these share the same protein binding cavity and more than 30% of such pairs are π-stacked. Since these local geometries cannot be inferred from the nucleotide identities, we present a novel framework for their prediction from the properties of protein binding sites. First, we present a classification of known RNA nucleotide and dinucleotide protein binding sites and identify the common types of shared 3-D physicochemical binding patterns. These are recognized by a new classification methodology that is based on spatial multiple alignment. The shared patterns reveal novel similarities between dinucleotide binding sites of proteins with different overall sequences, folds and functions. Given a protein structure, we use these patterns for the prediction of its RNA dinucleotide binding sites. Based on the binding modes of these nucleotides, we further predict an RNA fragment that interacts with those protein binding sites. With these knowledge-based predictions, we construct an RNA fragment that can have a previously unknown sequence and structure. In addition, we provide a drug design application in which the database of all known small-molecule binding sites is searched for regions similar to nucleotide and dinucleotide binding patterns, suggesting new fragments and scaffolds that can target them.
KW - RNA aptamer drug design
KW - multiple binding site alignment
KW - nucleotide and dinucleotide binding sites
KW - physicochemical binding patterns
KW - protein-RNA interactions
UR - http://www.scopus.com/inward/record.url?scp=43049091848&partnerID=8YFLogxK
U2 - 10.1016/j.jmb.2008.03.043
DO - 10.1016/j.jmb.2008.03.043
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:43049091848
SN - 0022-2836
VL - 379
SP - 299
EP - 316
JO - Journal of Molecular Biology
JF - Journal of Molecular Biology
IS - 2
ER -