TY - JOUR
T1 - BERTwalk for integrating gene networks to predict gene-to pathway-level properties
AU - Nasser, Rami
AU - Sharan, Roded
N1 - Publisher Copyright:
© 2023 The Author(s). Published by Oxford University Press.
PY - 2023
Y1 - 2023
N2 - Motivation: Graph representation learning is a fundamental problem in the field of data science with applications to integrative analysis of biological networks. Previous work in this domain was mostly limited to shallow representation techniques. A recent deep representation technique, BIONIC, has achieved state-of-The-Art results in a variety of tasks but used arbitrarily defined components. Results: Here, we present BERTwalk, an unsupervised learning scheme that combines the BERT masked language model with a network propagation regularization for graph representation learning. The transformation from networks to texts allows our method to naturally integrate different networks and provide features that inform not only nodes or edges but also pathway-level properties. We show that our BERTwalk model outperforms BIONIC, as well as four other recent methods, on two comprehensive benchmarks in yeast and human. We further show that our model can be utilized to infer functional pathways and their effects. Contact: [email protected]
AB - Motivation: Graph representation learning is a fundamental problem in the field of data science with applications to integrative analysis of biological networks. Previous work in this domain was mostly limited to shallow representation techniques. A recent deep representation technique, BIONIC, has achieved state-of-The-Art results in a variety of tasks but used arbitrarily defined components. Results: Here, we present BERTwalk, an unsupervised learning scheme that combines the BERT masked language model with a network propagation regularization for graph representation learning. The transformation from networks to texts allows our method to naturally integrate different networks and provide features that inform not only nodes or edges but also pathway-level properties. We show that our BERTwalk model outperforms BIONIC, as well as four other recent methods, on two comprehensive benchmarks in yeast and human. We further show that our model can be utilized to infer functional pathways and their effects. Contact: [email protected]
UR - http://www.scopus.com/inward/record.url?scp=85166151370&partnerID=8YFLogxK
U2 - 10.1093/bioadv/vbad086
DO - 10.1093/bioadv/vbad086
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 37448813
AN - SCOPUS:85166151370
SN - 2635-0041
VL - 3
JO - Bioinformatics Advances
JF - Bioinformatics Advances
IS - 1
M1 - vbad086
ER -