TY - JOUR
T1 - ScanNet
T2 - an interpretable geometric deep learning model for structure-based protein binding site prediction
AU - Tubiana, Jérôme
AU - Schneidman-Duhovny, Dina
AU - Wolfson, Haim J.
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Nature America, Inc.
PY - 2022/6
Y1 - 2022/6
N2 - Predicting the functional sites of a protein from its structure, such as the binding sites of small molecules, other proteins or antibodies, sheds light on its function in vivo. Currently, two classes of methods prevail: machine learning models built on top of handcrafted features and comparative modeling. They are, respectively, limited by the expressivity of the handcrafted features and the availability of similar proteins. Here, we introduce ScanNet, an end-to-end, interpretable geometric deep learning model that learns features directly from 3D structures. ScanNet builds representations of atoms and amino acids based on the spatio-chemical arrangement of their neighbors. We train ScanNet for detecting protein–protein and protein–antibody binding sites, demonstrate its accuracy—including for unseen protein folds—and interpret the filters learned. Finally, we predict epitopes of the SARS-CoV-2 spike protein, validating known antigenic regions and predicting previously uncharacterized ones. Overall, ScanNet is a versatile, powerful and interpretable model suitable for functional site prediction tasks. A webserver for ScanNet is available from http://bioinfo3d.cs.tau.ac.il/ScanNet/.
AB - Predicting the functional sites of a protein from its structure, such as the binding sites of small molecules, other proteins or antibodies, sheds light on its function in vivo. Currently, two classes of methods prevail: machine learning models built on top of handcrafted features and comparative modeling. They are, respectively, limited by the expressivity of the handcrafted features and the availability of similar proteins. Here, we introduce ScanNet, an end-to-end, interpretable geometric deep learning model that learns features directly from 3D structures. ScanNet builds representations of atoms and amino acids based on the spatio-chemical arrangement of their neighbors. We train ScanNet for detecting protein–protein and protein–antibody binding sites, demonstrate its accuracy—including for unseen protein folds—and interpret the filters learned. Finally, we predict epitopes of the SARS-CoV-2 spike protein, validating known antigenic regions and predicting previously uncharacterized ones. Overall, ScanNet is a versatile, powerful and interpretable model suitable for functional site prediction tasks. A webserver for ScanNet is available from http://bioinfo3d.cs.tau.ac.il/ScanNet/.
UR - http://www.scopus.com/inward/record.url?scp=85131063497&partnerID=8YFLogxK
U2 - 10.1038/s41592-022-01490-7
DO - 10.1038/s41592-022-01490-7
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 35637310
AN - SCOPUS:85131063497
SN - 1548-7091
VL - 19
SP - 730
EP - 739
JO - Nature Methods
JF - Nature Methods
IS - 6
ER -