TY - JOUR
T1 - Prediction of gestational diabetes based on nationwide electronic health records
AU - Artzi, Nitzan Shalom
AU - Shilo, Smadar
AU - Hadar, Eran
AU - Rossman, Hagai
AU - Barbash-Hazan, Shiri
AU - Ben-Haroush, Avi
AU - Balicer, Ran D.
AU - Feldman, Becca
AU - Wiznitzer, Arnon
AU - Segal, Eran
N1 - Publisher Copyright:
© 2020, The Author(s), under exclusive licence to Springer Nature America, Inc.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Gestational diabetes mellitus (GDM) poses increased risk of short- and long-term complications for mother and offspring1–4. GDM is typically diagnosed at 24–28 weeks of gestation, but earlier detection is desirable as this may prevent or considerably reduce the risk of adverse pregnancy outcomes5,6. Here we used a machine-learning approach to predict GDM on retrospective data of 588,622 pregnancies in Israel for which comprehensive electronic health records were available. Our models predict GDM with high accuracy even at pregnancy initiation (area under the receiver operating curve (auROC) = 0.85), substantially outperforming a baseline risk score (auROC = 0.68). We validated our results on both a future validation set and a geographical validation set from the most populated city in Israel, Jerusalem, thereby emulating real-world performance. Interrogating our model, we uncovered previously unreported risk factors, including results of previous pregnancy glucose challenge tests. Finally, we devised a simpler model based on just nine questions that a patient could answer, with only a modest reduction in accuracy (auROC = 0.80). Overall, our models may allow early-stage intervention in high-risk women, as well as a cost-effective screening approach that could avoid the need for glucose tolerance tests by identifying low-risk women. Future prospective studies and studies on additional populations are needed to assess the real-world clinical utility of the model.
AB - Gestational diabetes mellitus (GDM) poses increased risk of short- and long-term complications for mother and offspring1–4. GDM is typically diagnosed at 24–28 weeks of gestation, but earlier detection is desirable as this may prevent or considerably reduce the risk of adverse pregnancy outcomes5,6. Here we used a machine-learning approach to predict GDM on retrospective data of 588,622 pregnancies in Israel for which comprehensive electronic health records were available. Our models predict GDM with high accuracy even at pregnancy initiation (area under the receiver operating curve (auROC) = 0.85), substantially outperforming a baseline risk score (auROC = 0.68). We validated our results on both a future validation set and a geographical validation set from the most populated city in Israel, Jerusalem, thereby emulating real-world performance. Interrogating our model, we uncovered previously unreported risk factors, including results of previous pregnancy glucose challenge tests. Finally, we devised a simpler model based on just nine questions that a patient could answer, with only a modest reduction in accuracy (auROC = 0.80). Overall, our models may allow early-stage intervention in high-risk women, as well as a cost-effective screening approach that could avoid the need for glucose tolerance tests by identifying low-risk women. Future prospective studies and studies on additional populations are needed to assess the real-world clinical utility of the model.
UR - http://www.scopus.com/inward/record.url?scp=85077786219&partnerID=8YFLogxK
U2 - 10.1038/s41591-019-0724-8
DO - 10.1038/s41591-019-0724-8
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.letter???
C2 - 31932807
AN - SCOPUS:85077786219
SN - 1078-8956
VL - 26
SP - 71
EP - 76
JO - Nature Medicine
JF - Nature Medicine
IS - 1
ER -