TY - JOUR
T1 - Artificial Intelligence
T2 - Large Language Models in Pediatrics. What Do We Know So Far?
AU - Mandelbaum, Maayan
AU - Levy-Erez, Daniella
AU - Soffer, Shelly
AU - Klang, Eyal
AU - Levy-Mendelovich, Sarina
N1 - Publisher Copyright:
© 2025 Israel Medical Association. All rights reserved.
PY - 2025/3
Y1 - 2025/3
N2 - Artificial Intelligence (Al), particularly large language models (LLMs) like OpenAI's ChatGPT, has shown potential in various medical fields, including pediatrics. We evaluated the utility and integration of LLMs in pediatric medicine. We conducted a search in PubMed using specific keywords related to LLMs and pediatric care. Studies were included if they assessed LLMs in pediatric settings, were published in English, peer-reviewed, and reported measurable outcomes. Sixteen studies spanning pediatric sub-specialties such as ophthalmology, cardiology, otology, and emergency medicine were analyzed. The findings indicate that LLMs provide valuable diagnostic support and information management. However, their performance varied, with limitations in complex clinical scenarios and decision-making. Despite excelling in tasks requiring data summarization and basic information delivery, the effectiveness of the models in nuanced clinical decision-making was restricted. LLMs, including ChatGPT, show promise in enhancing pediatric medical care but exhibit inconsistent performance in complex clinical situations. This finding underscores the importance of continuous human oversight. Future integration of LLMs into clinical practice should be approached with caution to ensure they supplement, rather than supplant, expert medical judgment.
AB - Artificial Intelligence (Al), particularly large language models (LLMs) like OpenAI's ChatGPT, has shown potential in various medical fields, including pediatrics. We evaluated the utility and integration of LLMs in pediatric medicine. We conducted a search in PubMed using specific keywords related to LLMs and pediatric care. Studies were included if they assessed LLMs in pediatric settings, were published in English, peer-reviewed, and reported measurable outcomes. Sixteen studies spanning pediatric sub-specialties such as ophthalmology, cardiology, otology, and emergency medicine were analyzed. The findings indicate that LLMs provide valuable diagnostic support and information management. However, their performance varied, with limitations in complex clinical scenarios and decision-making. Despite excelling in tasks requiring data summarization and basic information delivery, the effectiveness of the models in nuanced clinical decision-making was restricted. LLMs, including ChatGPT, show promise in enhancing pediatric medical care but exhibit inconsistent performance in complex clinical situations. This finding underscores the importance of continuous human oversight. Future integration of LLMs into clinical practice should be approached with caution to ensure they supplement, rather than supplant, expert medical judgment.
KW - ChatGPT
KW - artificial intelligence (Al)
KW - large language models (LLMs)
KW - pediatrics
UR - http://www.scopus.com/inward/record.url?scp=105001423322&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.systematicreview???
AN - SCOPUS:105001423322
SN - 1565-1088
VL - 27
SP - 183
EP - 188
JO - Israel Medical Association Journal
JF - Israel Medical Association Journal
ER -