Identifying distinct subgroups of ICU patients: A machine learning approach

Kelly C. Vranas, Jeffrey K. Jopling, Timothy E. Sweeney, Meghan C. Ramsey, Arnold S. Milstein, Christopher G. Slatore, Gabriel J. Escobar, Vincent X. Liu

Research output: Contribution to journalArticlepeer-review

62 Scopus citations


Objectives: Identifying subgroups of ICU patients with similar clinical needs and trajectories may provide a framework for more efficient ICU care through the design of care platforms tailored around patients' shared needs. However, objective methods for identifying these ICU patient subgroups are lacking. We used a machine learning approach to empirically identify ICU patient subgroups through clustering analysis and evaluate whether these groups might represent appropriate targets for care redesign efforts. Design: We performed clustering analysis using data from patients' hospital stays to retrospectively identify patient subgroups from a large, heterogeneous ICU population. Setting: Kaiser Permanente Northern California, a healthcare delivery system serving 3.9 million members. Patients: ICU patients 18 years old or older with an ICU admission between January 1, 2012, and December 31, 2012, at one of 21 Kaiser Permanente Northern California hospitals. Interventions: None. Measurements and Main Results: We used clustering analysis to identify putative clusters among 5,000 patients randomly selected from 24,884 ICU patients. To assess cluster validity, we evaluated the distribution and frequency of patient characteristics and the need for invasive therapies. We then applied a classifier built from the sample cohort to the remaining 19,884 patients to compare the derivation and validation clusters. Clustering analysis successfully identified six clinically recognizable subgroups that differed significantly in all baseline characteristics and clinical trajectories, despite sharing common diagnoses. In the validation cohort, the proportion of patients assigned to each cluster was similar and demonstrated significant differences across clusters for all variables. Conclusions: A machine learning approach revealed important differences between empirically derived subgroups of ICU patients that are not typically revealed by admitting diagnosis or severity of illness alone. Similar data-driven approaches may provide a framework for future organizational innovations in ICU care tailored around patients' shared needs.

Original languageEnglish (US)
Pages (from-to)1607-1615
Number of pages9
JournalCritical care medicine
Issue number10
StatePublished - Oct 1 2017


  • clustering analysis
  • critical care
  • intensive care units
  • patient care management
  • unsupervised machine learning

ASJC Scopus subject areas

  • Critical Care and Intensive Care Medicine


Dive into the research topics of 'Identifying distinct subgroups of ICU patients: A machine learning approach'. Together they form a unique fingerprint.

Cite this