TY - JOUR
T1 - Deep learning in voice analysis for diagnosing vocal cord pathologies
T2 - a systematic review
AU - Tessler, Idit
AU - Primov-Fever, Adi
AU - Soffer, Shelly
AU - Anteby, Roi
AU - Gecel, Nir A.
AU - Livneh, Nir
AU - Alon, Eran E.
AU - Zimlichman, Eyal
AU - Klang, Eyal
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2024/2
Y1 - 2024/2
N2 - Objectives: With smartphones and wearable devices becoming ubiquitous, they offer an opportunity for large-scale voice sampling. This systematic review explores the application of deep learning models for the automated analysis of voice samples to detect vocal cord pathologies. Methods: We conducted a systematic literature review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) reporting guidelines. We searched MEDLINE and Embase databases for original publications on deep learning applications for diagnosing vocal cord pathologies between 2002 and 2022. Risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2). Results: Out of the 14 studies that met the inclusion criteria, data from a total of 3037 patients were analyzed. All studies were retrospective. Deep learning applications targeted Reinke's edema, nodules, polyps, cysts, unilateral cord paralysis, and vocal fold cancer detection. Most pathologies had detection accuracy above 90%. Thirteen studies (93%) exhibited a high risk of bias and concerns about applicability. Conclusions: Technology holds promise for enhancing the screening and diagnosis of vocal cord pathologies. While current research is limited, the presented studies offer proof of concept for developing larger-scale solutions.
AB - Objectives: With smartphones and wearable devices becoming ubiquitous, they offer an opportunity for large-scale voice sampling. This systematic review explores the application of deep learning models for the automated analysis of voice samples to detect vocal cord pathologies. Methods: We conducted a systematic literature review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) reporting guidelines. We searched MEDLINE and Embase databases for original publications on deep learning applications for diagnosing vocal cord pathologies between 2002 and 2022. Risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2). Results: Out of the 14 studies that met the inclusion criteria, data from a total of 3037 patients were analyzed. All studies were retrospective. Deep learning applications targeted Reinke's edema, nodules, polyps, cysts, unilateral cord paralysis, and vocal fold cancer detection. Most pathologies had detection accuracy above 90%. Thirteen studies (93%) exhibited a high risk of bias and concerns about applicability. Conclusions: Technology holds promise for enhancing the screening and diagnosis of vocal cord pathologies. While current research is limited, the presented studies offer proof of concept for developing larger-scale solutions.
KW - Convolutional neural network
KW - Deep learning
KW - Neural network
KW - Recurrent neural network
KW - Vocal cord pathologies
KW - Voice
UR - http://www.scopus.com/inward/record.url?scp=85179652439&partnerID=8YFLogxK
U2 - 10.1007/s00405-023-08362-6
DO - 10.1007/s00405-023-08362-6
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 38091100
AN - SCOPUS:85179652439
SN - 0937-4477
VL - 281
SP - 863
EP - 871
JO - European Archives of Oto-Rhino-Laryngology
JF - European Archives of Oto-Rhino-Laryngology
IS - 2
ER -