TY - JOUR
T1 - A multi-layer model for the early detection of COVID-19
AU - Shmueli, Erez
AU - Mansuri, Ronen
AU - Porcilan, Matan
AU - Amir, Tamar
AU - Yosha, Lior
AU - Yechezkel, Matan
AU - Patalon, Tal
AU - Handelman-Gotlib, Sharon
AU - Gazit, Sivan
AU - Yamin, Dan
N1 - Publisher Copyright:
© 2021 The Authors.
PY - 2021/8/4
Y1 - 2021/8/4
N2 - Current COVID-19 screening efforts mainly rely on reported symptoms and the potential exposure to infected individuals. Here, we developed a machine-learning model for COVID-19 detection that uses four layers of information: (i) sociodemographic characteristics of the individual, (ii) spatio-temporal patterns of the disease, (iii) medical condition and general health consumption of the individual and (iv) information reported by the individual during the testing episode. We evaluated our model on 140 682 members of Maccabi Health Services who were tested for COVID-19 at least once between February and October 2020. These individuals underwent, in total, 264 516 COVID-19 PCR tests, out of which 16 512 were positive. Our multi-layer model obtained an area under the curve (AUC) of 81.6% when evaluated over all the individuals in the dataset, and an AUC of 72.8% when only individuals who did not report any symptom were included. Furthermore, considering only information collected before the testing episode - i.e. before the individual had the chance to report on any symptom - our model could reach a considerably high AUC of 79.5%. Our ability to predict early on the outcomes of COVID-19 tests is pivotal for breaking transmission chains, and can be used for a more efficient testing policy.
AB - Current COVID-19 screening efforts mainly rely on reported symptoms and the potential exposure to infected individuals. Here, we developed a machine-learning model for COVID-19 detection that uses four layers of information: (i) sociodemographic characteristics of the individual, (ii) spatio-temporal patterns of the disease, (iii) medical condition and general health consumption of the individual and (iv) information reported by the individual during the testing episode. We evaluated our model on 140 682 members of Maccabi Health Services who were tested for COVID-19 at least once between February and October 2020. These individuals underwent, in total, 264 516 COVID-19 PCR tests, out of which 16 512 were positive. Our multi-layer model obtained an area under the curve (AUC) of 81.6% when evaluated over all the individuals in the dataset, and an AUC of 72.8% when only individuals who did not report any symptom were included. Furthermore, considering only information collected before the testing episode - i.e. before the individual had the chance to report on any symptom - our model could reach a considerably high AUC of 79.5%. Our ability to predict early on the outcomes of COVID-19 tests is pivotal for breaking transmission chains, and can be used for a more efficient testing policy.
KW - COVID-19
KW - early detection
KW - electronic medical records
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85113522510&partnerID=8YFLogxK
U2 - 10.1098/rsif.2021.0284
DO - 10.1098/rsif.2021.0284
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 34343454
AN - SCOPUS:85113522510
SN - 1742-5689
VL - 18
JO - Journal of the Royal Society Interface
JF - Journal of the Royal Society Interface
IS - 181
M1 - 20210284
ER -