Deep learning in voice analysis for diagnosing vocal cord pathologies: a systematic review

Idit Tessler*, Adi Primov-Fever, Shelly Soffer, Roi Anteby, Nir A. Gecel, Nir Livneh, Eran E. Alon, Eyal Zimlichman, Eyal Klang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Objectives: With smartphones and wearable devices becoming ubiquitous, they offer an opportunity for large-scale voice sampling. This systematic review explores the application of deep learning models for the automated analysis of voice samples to detect vocal cord pathologies. Methods: We conducted a systematic literature review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) reporting guidelines. We searched MEDLINE and Embase databases for original publications on deep learning applications for diagnosing vocal cord pathologies between 2002 and 2022. Risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2). Results: Out of the 14 studies that met the inclusion criteria, data from a total of 3037 patients were analyzed. All studies were retrospective. Deep learning applications targeted Reinke's edema, nodules, polyps, cysts, unilateral cord paralysis, and vocal fold cancer detection. Most pathologies had detection accuracy above 90%. Thirteen studies (93%) exhibited a high risk of bias and concerns about applicability. Conclusions: Technology holds promise for enhancing the screening and diagnosis of vocal cord pathologies. While current research is limited, the presented studies offer proof of concept for developing larger-scale solutions.

Original languageEnglish
Pages (from-to)863-871
Number of pages9
JournalEuropean Archives of Oto-Rhino-Laryngology
Volume281
Issue number2
DOIs
StatePublished - Feb 2024

Keywords

  • Convolutional neural network
  • Deep learning
  • Neural network
  • Recurrent neural network
  • Vocal cord pathologies
  • Voice

Fingerprint

Dive into the research topics of 'Deep learning in voice analysis for diagnosing vocal cord pathologies: a systematic review'. Together they form a unique fingerprint.

Cite this