Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale

of the Imaging and Informatics in Retinopathy of Prematurity Consortium

doi:10.1016/j.ophtha.2020.10.025

Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale

of the Imaging and Informatics in Retinopathy of Prematurity Consortium

Research output: Contribution to journal › Article › peer-review

43 Scopus citations

Abstract

Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.

Original language	English (US)
Pages (from-to)	1070-1076
Number of pages	7
Journal	Ophthalmology
Volume	128
Issue number	7
DOIs	https://doi.org/10.1016/j.ophtha.2020.10.025
State	Published - Jul 2021

ASJC Scopus subject areas

Ophthalmology

Access to Document

10.1016/j.ophtha.2020.10.025

Cite this

@article{43afc5e2f13a47cdbe3e7faa349e14dc,

title = "Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale",

abstract = "Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.",

author = "{of the Imaging and Informatics in Retinopathy of Prematurity Consortium} and Campbell, {J. Peter} and Kim, {Sang Jin} and Brown, {James M.} and Susan Ostmo and Chan, {R. V.Paul} and Jayashree Kalpathy-Cramer and Chiang, {Michael F.} and {Jin Kim}, Sang and Kemal Sonmez and Robert Schelonka and {Peter Campbell}, J. and {Paul Chan}, {R. V.} and Karyn Jonas and Jason Horowitz and Osode Coki and Eccles, {Cheryl Ann} and Leora Sarna and Anton Orlin and Audina Berrocal and Catherin Negron and Kimberly Denser and Kristi Cumming and Tammy Osentoski and Tammy Check and Mary Zajechowski and Thomas Lee and Aaron Nagiel and Evan Kruger and Kathryn McGovern and Charles Simmons and Raghu Murthy and Sharon Galvis and {Jerome Rotter MD}, {Rotter MD} and Ida Chen and Xiaohui Li and Kent Taylor and Kaye Roll and Deniz Erdogmus and Stratis Ioannidis and Martinez-Castellanos, {Maria Ana} and Samantha Salinas-Longoria and Rafael Romero and Andrea Arriola and Francisco Olguin-Manriquez and Miroslava Meraz-Gutierrez and Dulanto-Reinoso, {Carlos M.} and Cristina Montero-Mendoza",

note = "Publisher Copyright: {\textcopyright} 2020",

year = "2021",

month = jul,

doi = "10.1016/j.ophtha.2020.10.025",

language = "English (US)",

volume = "128",

pages = "1070--1076",

journal = "Ophthalmology",

issn = "0161-6420",

publisher = "Elsevier Inc.",

number = "7",

}

TY - JOUR

T1 - Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale

AU - of the Imaging and Informatics in Retinopathy of Prematurity Consortium

AU - Campbell, J. Peter

AU - Kim, Sang Jin

AU - Brown, James M.

AU - Ostmo, Susan

AU - Chan, R. V.Paul

AU - Kalpathy-Cramer, Jayashree

AU - Chiang, Michael F.

AU - Jin Kim, Sang

AU - Sonmez, Kemal

AU - Schelonka, Robert

AU - Peter Campbell, J.

AU - Paul Chan, R. V.

AU - Jonas, Karyn

AU - Horowitz, Jason

AU - Coki, Osode

AU - Eccles, Cheryl Ann

AU - Sarna, Leora

AU - Orlin, Anton

AU - Berrocal, Audina

AU - Negron, Catherin

AU - Denser, Kimberly

AU - Cumming, Kristi

AU - Osentoski, Tammy

AU - Check, Tammy

AU - Zajechowski, Mary

AU - Lee, Thomas

AU - Nagiel, Aaron

AU - Kruger, Evan

AU - McGovern, Kathryn

AU - Simmons, Charles

AU - Murthy, Raghu

AU - Galvis, Sharon

AU - Jerome Rotter MD, Rotter MD

AU - Chen, Ida

AU - Li, Xiaohui

AU - Taylor, Kent

AU - Roll, Kaye

AU - Erdogmus, Deniz

AU - Ioannidis, Stratis

AU - Martinez-Castellanos, Maria Ana

AU - Salinas-Longoria, Samantha

AU - Romero, Rafael

AU - Arriola, Andrea

AU - Olguin-Manriquez, Francisco

AU - Meraz-Gutierrez, Miroslava

AU - Dulanto-Reinoso, Carlos M.

AU - Montero-Mendoza, Cristina

PY - 2021/7

Y1 - 2021/7

N2 - Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.

AB - Purpose: To evaluate the clinical usefulness of a quantitative deep learning-derived vascular severity score for retinopathy of prematurity (ROP) by assessing its correlation with clinical ROP diagnosis and by measuring clinician agreement in applying a novel scale. Design: Analysis of existing database of posterior pole fundus images and corresponding ophthalmoscopic examinations using 2 methods of assigning a quantitative scale to vascular severity. Participants: Images were from clinical examinations of patients in the Imaging and Informatics in ROP Consortium. Four ophthalmologists and 1 study coordinator evaluated vascular severity on a scale from 1 to 9. Methods: A quantitative vascular severity score (1–9) was applied to each image using a deep learning algorithm. A database of 499 images was developed for assessment of interobserver agreement. Main Outcome Measures: Distribution of deep learning-derived vascular severity scores with the clinical assessment of zone (I, II, or III), stage (0, 1, 2, or 3), and extent (<3 clock hours, 3–6 clock hours, and >6 clock hours) of stage 3 evaluated using multivariate linear regression and weighted κ values and Pearson correlation coefficients for interobserver agreement on a 1-to-9 vascular severity scale. Results: For deep learning analysis, a total of 6344 clinical examinations were analyzed. A higher deep learning-derived vascular severity score was associated with more posterior disease, higher disease stage, and higher extent of stage 3 disease (P < 0.001 for all). For a given ROP stage, the vascular severity score was higher in zone I than zones II or III (P < 0.001). Multivariate regression found zone, stage, and extent all were associated independently with the severity score (P < 0.001 for all). For interobserver agreement, the mean ± standard deviation weighted κ value was 0.67 ± 0.06, and the Pearson correlation coefficient ± standard deviation was 0.88 ± 0.04 on the use of a 1-to-9 vascular severity scale. Conclusions: A vascular severity scale for ROP seems feasible for clinical adoption; corresponds with zone, stage, extent of stage 3, and plus disease; and facilitates the use of objective technology such as deep learning to improve the consistency of ROP diagnosis.

UR - http://www.scopus.com/inward/record.url?scp=85096572102&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85096572102&partnerID=8YFLogxK

U2 - 10.1016/j.ophtha.2020.10.025

DO - 10.1016/j.ophtha.2020.10.025

M3 - Article

C2 - 33121959

AN - SCOPUS:85096572102

SN - 0161-6420

VL - 128

SP - 1070

EP - 1076

JO - Ophthalmology

JF - Ophthalmology

IS - 7

ER -

Evaluation of a Deep Learning–Derived Quantitative Retinopathy of Prematurity Severity Scale

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this