TY - JOUR
T1 - Deep learning for embryo evaluation using time-lapse
T2 - a systematic review of diagnostic test accuracy
AU - Berman, Aya
AU - Anteby, Roi
AU - Efros, Orly
AU - Klang, Eyal
AU - Soffer, Shelly
N1 - Publisher Copyright:
© 2023 Elsevier Inc.
PY - 2023/11
Y1 - 2023/11
N2 - Objective: This study aimed to investigate the accuracy of convolutional neural network models in the assessment of embryos using time-lapse monitoring. Data Sources: A systematic search was conducted in PubMed and Web of Science databases from January 2016 to December 2022. The search strategy was carried out by using key words and MeSH (Medical Subject Headings) terms. Study Eligibility Criteria: Studies were included if they reported the accuracy of convolutional neural network models for embryo evaluation using time-lapse monitoring. The review was registered with PROSPERO (International Prospective Register of Systematic Reviews; identification number CRD42021275916). Methods: Two reviewer authors independently screened results using the Covidence systematic review software. The full-text articles were reviewed when studies met the inclusion criteria or in any uncertainty. Nonconsensus was resolved by a third reviewer. Risk of bias and applicability were evaluated using the QUADAS-2 tool and the modified Joanna Briggs Institute or JBI checklist. Results: Following a systematic search of the literature, 22 studies were identified as eligible for inclusion. All studies were retrospective. A total of 522,516 images of 222,998 embryos were analyzed. Three main outcomes were evaluated: successful in vitro fertilization, blastocyst stage classification, and blastocyst quality. Most studies reported >80% accuracy, and embryologists were outperformed in some. Ten studies had a high risk of bias, mostly because of patient bias. Conclusion: The application of artificial intelligence in time-lapse monitoring has the potential to provide more efficient, accurate, and objective embryo evaluation. Models that examined blastocyst stage classification showed the best predictions. Models that predicted live birth had a low risk of bias, used the largest databases, and had external validation, which heightens their relevance to clinical application. Our systematic review is limited by the high heterogeneity among the included studies. Researchers should share databases and standardize reporting.
AB - Objective: This study aimed to investigate the accuracy of convolutional neural network models in the assessment of embryos using time-lapse monitoring. Data Sources: A systematic search was conducted in PubMed and Web of Science databases from January 2016 to December 2022. The search strategy was carried out by using key words and MeSH (Medical Subject Headings) terms. Study Eligibility Criteria: Studies were included if they reported the accuracy of convolutional neural network models for embryo evaluation using time-lapse monitoring. The review was registered with PROSPERO (International Prospective Register of Systematic Reviews; identification number CRD42021275916). Methods: Two reviewer authors independently screened results using the Covidence systematic review software. The full-text articles were reviewed when studies met the inclusion criteria or in any uncertainty. Nonconsensus was resolved by a third reviewer. Risk of bias and applicability were evaluated using the QUADAS-2 tool and the modified Joanna Briggs Institute or JBI checklist. Results: Following a systematic search of the literature, 22 studies were identified as eligible for inclusion. All studies were retrospective. A total of 522,516 images of 222,998 embryos were analyzed. Three main outcomes were evaluated: successful in vitro fertilization, blastocyst stage classification, and blastocyst quality. Most studies reported >80% accuracy, and embryologists were outperformed in some. Ten studies had a high risk of bias, mostly because of patient bias. Conclusion: The application of artificial intelligence in time-lapse monitoring has the potential to provide more efficient, accurate, and objective embryo evaluation. Models that examined blastocyst stage classification showed the best predictions. Models that predicted live birth had a low risk of bias, used the largest databases, and had external validation, which heightens their relevance to clinical application. Our systematic review is limited by the high heterogeneity among the included studies. Researchers should share databases and standardize reporting.
KW - artificial intelligence
KW - convolutional neural network
KW - deep learning
KW - embryo assessment
KW - time-lapse monitor
UR - http://www.scopus.com/inward/record.url?scp=85163368741&partnerID=8YFLogxK
U2 - 10.1016/j.ajog.2023.04.027
DO - 10.1016/j.ajog.2023.04.027
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.systematicreview???
C2 - 37116822
AN - SCOPUS:85163368741
SN - 0002-9378
VL - 229
SP - 490
EP - 501
JO - American Journal of Obstetrics and Gynecology
JF - American Journal of Obstetrics and Gynecology
IS - 5
ER -