TY - JOUR
T1 - Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity
AU - for the Imaging and Informatics In Retinopathy Of Prematurity (i-ROP) Research Consortium
AU - Redd, Travis K.
AU - Campbell, John Peter
AU - Brown, James M.
AU - Kim, Sang Jin
AU - Ostmo, Susan
AU - Chan, Robison Vernon Paul
AU - Dy, Jennifer
AU - Erdogmus, Deniz
AU - Ioannidis, Stratis
AU - Kalpathy-Cramer, Jayashree
AU - Chiang, Michael F.
N1 - Funding Information:
Supported by grants R01EY19474, K12 EY027720, P30EY10572, and P30EY001792 from the National Institutes of Health (Bethesda, Maryland, USA) by grants SCH-1622679, SCH-1622542, and SCH-1622536 from the National Science Foundation (Arlington, Virginia, USA), and by unrestricted departmental funding from Research to Prevent Blindness (New York, New York, USA).
Publisher Copyright:
© Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.2019.
PY - 2019/5/1
Y1 - 2019/5/1
N2 - Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.
AB - Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.
KW - child health (paediatrics)
KW - public health
KW - retina
KW - telemedicine
UR - http://www.scopus.com/inward/record.url?scp=85057264647&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057264647&partnerID=8YFLogxK
U2 - 10.1136/bjophthalmol-2018-313156
DO - 10.1136/bjophthalmol-2018-313156
M3 - Article
C2 - 30470715
AN - SCOPUS:85057264647
SN - 0007-1161
VL - 103
SP - 580
EP - 584
JO - British Journal of Ophthalmology
JF - British Journal of Ophthalmology
IS - 5
ER -