TY - JOUR
T1 - Partially versus purely data-driven approaches in SARS-CoV-2 prediction
AU - Shilbayeh, Samar A.
AU - Abonamah, Abdullah
AU - Masri, Ahmad A.
N1 - Funding Information:
Conflicts of Interest: Ahmad Masri received research grants from Pfizer and Akcea (paid to OHSU) and serves on an advisory board with Ionis. None of these have conflicts related to this work, and the other authors have no conflict of interest.
Publisher Copyright:
© 2020 by the authors.
PY - 2020/8
Y1 - 2020/8
N2 - Prediction models of coronavirus disease utilizing machine learning algorithms range from forecasting future suspect cases, predicting mortality rates, to building a pattern for country-specific pandemic end date. To predict the future suspect infection and death cases, we categorized the approaches found in the literature into: first, a purely data-driven approach, whose goal is to build a mathematical model that relates the data variables including outputs with inputs to detect general patterns. The discovered patterns can then be used to predict the future infected cases without any expert input. The second approach is partially data-driven; it uses historical data, but allows expert input such as the SIR epidemic algorithm. This approach assumes that the epidemic will end according to medical reasoning. In this paper, we compare the purely data-driven and partially-data driven approaches by applying them to data from three countries having different past pattern behavior. The countries are the US, Jordan, and Italy. It is found that those two prediction approaches yield significantly different results. Purely data-driven approach depends totally on the past behavior and does not show any decline in the number of the infected cases if the country did not experience any decline in the number of cases. On the other hand, a partially data-driven approach guarantees a timely decline of the infected curve to reach zero. Using the two approaches highlights the importance of human intervention in pandemic prediction to guide the learning process as opposed to the purely data-driven approach that predicts future cases based on the pattern detected in the data.
AB - Prediction models of coronavirus disease utilizing machine learning algorithms range from forecasting future suspect cases, predicting mortality rates, to building a pattern for country-specific pandemic end date. To predict the future suspect infection and death cases, we categorized the approaches found in the literature into: first, a purely data-driven approach, whose goal is to build a mathematical model that relates the data variables including outputs with inputs to detect general patterns. The discovered patterns can then be used to predict the future infected cases without any expert input. The second approach is partially data-driven; it uses historical data, but allows expert input such as the SIR epidemic algorithm. This approach assumes that the epidemic will end according to medical reasoning. In this paper, we compare the purely data-driven and partially-data driven approaches by applying them to data from three countries having different past pattern behavior. The countries are the US, Jordan, and Italy. It is found that those two prediction approaches yield significantly different results. Purely data-driven approach depends totally on the past behavior and does not show any decline in the number of the infected cases if the country did not experience any decline in the number of cases. On the other hand, a partially data-driven approach guarantees a timely decline of the infected curve to reach zero. Using the two approaches highlights the importance of human intervention in pandemic prediction to guide the learning process as opposed to the purely data-driven approach that predicts future cases based on the pattern detected in the data.
KW - Corona virus detection
KW - Exponential regression
KW - Exponential smoothing model
KW - Infected cases prediction
KW - Linear regression
KW - Partially data-driven approach
KW - Purely data-driven approach
KW - SIR model
UR - http://www.scopus.com/inward/record.url?scp=85089944449&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089944449&partnerID=8YFLogxK
U2 - 10.3390/app10165696
DO - 10.3390/app10165696
M3 - Article
AN - SCOPUS:85089944449
SN - 2076-3417
VL - 10
JO - Applied Sciences (Switzerland)
JF - Applied Sciences (Switzerland)
IS - 16
M1 - 5696
ER -