TY - JOUR
T1 - The rSNA pediatric bone age machine learning challenge
AU - Halabi, Safwan S.
AU - Prevedello, Luciano M.
AU - Kalpathy-Cramer, Jayashree
AU - Mamonov, Artem B.
AU - Bilbily, Alexander
AU - Cicero, Mark
AU - Pan, Ian
AU - Pereira, Lucas Araújo
AU - Sousa, Rafael Teixeira
AU - Abdala, Nitamar
AU - Kitamura, Felipe Campos
AU - Thodberg, Hans H.
AU - Chen, Leon
AU - Shih, George
AU - Andriole, Katherine
AU - Kohli, Marc D.
AU - Erickson, Bradley J.
AU - Flanders, Adam E.
N1 - Funding Information:
Supported by the National Institute for Health Research (U24CA180927). We thank Matthew C. Chen, MS, for his assistance curating and organizing the data sets (hand radiographs and annotations) for the RSNA Pediatric Bone Age Machine Learning Challenge. This challenge was hosted on the MedICI platform (built CodaLab) provided by Jayashree Kalpathy-Cramer and Massachusetts General Hospital and funded through a contract with Leidos
Funding Information:
Supported by the National Institute for Health Research (U24CA180927). Conflicts of interest are listed at the end of this article. See also the editorial by Siegel in this issue.
Publisher Copyright:
© RSNA, 2018.
PY - 2019/3/1
Y1 - 2019/3/1
N2 - Purpose: The Radiological Society of North America (RSNA) Pediatric Bone Age Machine Learning Challenge was created to show an application of machine learning (ML) and artificial intelligence (AI) in medical imaging, promote collaboration to catalyze AI model creation, and identify innovators in medical imaging. Materials and Methods: The goal of this challenge was to solicit individuals and teams to create an algorithm or model using ML techniques that would accurately determine skeletal age in a curated data set of pediatric hand radiographs. The primary evaluation measure was the mean absolute distance (MAD) in months, which was calculated as the mean of the absolute values of the difference between the model estimates and those of the reference standard, bone age. Results: A data set consisting of 14 236 hand radiographs (12 611 training set, 1425 validation set, 200 test set) was made available to registered challenge participants. A total of 260 individuals or teams registered on the Challenge website. A total of 105 submissions were uploaded from 48 unique users during the training, validation, and test phases. Almost all methods used deep neural network techniques based on one or more convolutional neural networks (CNNs). The best five results based on MAD were 4.2, 4.4, 4.4, 4.5, and 4.5 months, respectively. Conclusion: The RSNA Pediatric Bone Age Machine Learning Challenge showed how a coordinated approach to solving a medical imaging problem can be successfully conducted. Future ML challenges will catalyze collaboration and development of ML tools and methods that can potentially improve diagnostic accuracy and patient care.
AB - Purpose: The Radiological Society of North America (RSNA) Pediatric Bone Age Machine Learning Challenge was created to show an application of machine learning (ML) and artificial intelligence (AI) in medical imaging, promote collaboration to catalyze AI model creation, and identify innovators in medical imaging. Materials and Methods: The goal of this challenge was to solicit individuals and teams to create an algorithm or model using ML techniques that would accurately determine skeletal age in a curated data set of pediatric hand radiographs. The primary evaluation measure was the mean absolute distance (MAD) in months, which was calculated as the mean of the absolute values of the difference between the model estimates and those of the reference standard, bone age. Results: A data set consisting of 14 236 hand radiographs (12 611 training set, 1425 validation set, 200 test set) was made available to registered challenge participants. A total of 260 individuals or teams registered on the Challenge website. A total of 105 submissions were uploaded from 48 unique users during the training, validation, and test phases. Almost all methods used deep neural network techniques based on one or more convolutional neural networks (CNNs). The best five results based on MAD were 4.2, 4.4, 4.4, 4.5, and 4.5 months, respectively. Conclusion: The RSNA Pediatric Bone Age Machine Learning Challenge showed how a coordinated approach to solving a medical imaging problem can be successfully conducted. Future ML challenges will catalyze collaboration and development of ML tools and methods that can potentially improve diagnostic accuracy and patient care.
UR - http://www.scopus.com/inward/record.url?scp=85060370230&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060370230&partnerID=8YFLogxK
U2 - 10.1148/radiol.2018180736
DO - 10.1148/radiol.2018180736
M3 - Article
C2 - 30480490
AN - SCOPUS:85060370230
SN - 0033-8419
VL - 290
SP - 498
EP - 503
JO - RADIOLOGY
JF - RADIOLOGY
IS - 3
ER -