Automated screening for myelodysplastic syndromes through analysis of complete blood count and cell population data parameters

Philipp W. Raess, Gert Jan M. van de Geijn, Tjin L. Njo, Boudewijn Klop, Dmitry Sukhachev, Gerald Wertheim, Tom Mcaleer, Stephen R. Master, Adam Bagg

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

The diagnosis of myelodysplastic syndromes (MDS) requires a high clinical index of suspicion to prompt bone marrow studies as well as subjective assessment of dysplastic morphology. We sought to determine if data collected by automated hematology analyzers during complete blood count (CBC) analysis might help to identify MDS in a routine clinical setting. We collected CBC parameters (including those for research use only and cell population data) and demographic information in a large (>5,000), unselected sequential cohort of outpatients. The cohort was divided into independent training and test groups to develop and validate a random forest classifier that identifies MDS. The classifier effectively identified MDS and had a receiver operating characteristic area under the curve (AUC) of 0.942. Platelet distribution width and the standard deviation of red blood cell distribution width were the most discriminating variables within the classifier. Additionally, a similar classifier was validated with an additional, independent set of >200 patients from a second institution with an AUC of 0.93. This retrospective study demonstrates the feasibility of identifying MDS in an unselected outpatient population using data routinely collected during CBC analysis with a classifier that has been validated using two independent data sets from different institutions.

Original languageEnglish (US)
Pages (from-to)369-374
Number of pages6
JournalAmerican Journal of Hematology
Volume89
Issue number4
DOIs
StatePublished - Apr 2014
Externally publishedYes

ASJC Scopus subject areas

  • Hematology

Fingerprint

Dive into the research topics of 'Automated screening for myelodysplastic syndromes through analysis of complete blood count and cell population data parameters'. Together they form a unique fingerprint.

Cite this