TY - JOUR
T1 - Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale
AU - of the Imaging and Informatics in Retinopathy of Prematurity Consortium
AU - Campbell, J. Peter
AU - Kim, Sang Jin
AU - Brown, James M.
AU - Ostmo, Susan
AU - Chan, R. V.Paul
AU - Kalpathy-Cramer, Jayashree
AU - Chiang, Michael F.
AU - Jin Kim, Sang
AU - Sonmez, Kemal
AU - Schelonka, Robert
AU - Peter Campbell, J.
AU - Paul Chan, R. V.
AU - Jonas, Karyn
AU - Horowitz, Jason
AU - Coki, Osode
AU - Eccles, Cheryl Ann
AU - Sarna, Leora
AU - Orlin, Anton
AU - Berrocal, Audina
AU - Negron, Catherin
AU - Denser, Kimberly
AU - Cumming, Kristi
AU - Osentoski, Tammy
AU - Check, Tammy
AU - Zajechowski, Mary
AU - Lee, Thomas
AU - Nagiel, Aaron
AU - Kruger, Evan
AU - McGovern, Kathryn
AU - Simmons, Charles
AU - Murthy, Raghu
AU - Galvis, Sharon
AU - Jerome Rotter MD, Rotter MD
AU - Chen, Ida
AU - Li, Xiaohui
AU - Taylor, Kent
AU - Roll, Kaye
AU - Erdogmus, Deniz
AU - Ioannidis, Stratis
AU - Martinez-Castellanos, Maria Ana
AU - Salinas-Longoria, Samantha
AU - Romero, Rafael
AU - Arriola, Andrea
AU - Olguin-Manriquez, Francisco
AU - Meraz-Gutierrez, Miroslava
AU - Dulanto-Reinoso, Carlos M.
AU - Montero-Mendoza, Cristina
N1 - Funding Information:
Supported by the National Institutes of Health , Bethesda Maryland (grant nos.: R01EY19474 , K12EY27720 , and P30EY10572 ); the National Science Foundation , Arlington, Virginia (grant no.: SCH-1622679 ); and Research to Prevent Blindness , Inc., New York, New York (unrestricted departmental funding and a Career Development Award [J.P.C.]). None of the funding agencies had any role in the design and conduct of the study; collection, management, analysis, or interpretation of the data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.
Publisher Copyright:
© 2020
PY - 2021/7
Y1 - 2021/7
N2 - Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.
AB - Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.
UR - http://www.scopus.com/inward/record.url?scp=85096572102&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096572102&partnerID=8YFLogxK
U2 - 10.1016/j.ophtha.2020.10.025
DO - 10.1016/j.ophtha.2020.10.025
M3 - Article
C2 - 33121959
AN - SCOPUS:85096572102
SN - 0161-6420
VL - 128
SP - 1070
EP - 1076
JO - Ophthalmology
JF - Ophthalmology
IS - 7
ER -