Machine learning derivation of four computable 24-h pediatric sepsis phenotypes to facilitate enrollment in early personalized anti-inflammatory clinical trials. Author Yidi Qin, Kate Kernan, Zhenjiang Fan, Hyun-Jung Park, Soyeon Kim, Scott Canna, John Kellum, Robert Berg, David Wessel, Murray Pollack, Kathleen Meert, Mark Hall, Christopher Newth, John Lin, Allan Doctor, Tom Shanley, Tim Cornell, Rick Harrison, Athena Zuppa, Russell Banks, Ron Reeder, Richard Holubkov, Daniel Notterman, J Michael Dean, Joseph Carcillo Publication Year 2022 Type Journal Article Abstract BACKGROUND: Thrombotic microangiopathy-induced thrombocytopenia-associated multiple organ failure and hyperinflammatory macrophage activation syndrome are important causes of late pediatric sepsis mortality that are often missed or have delayed diagnosis. The National Institutes of General Medical Science sepsis research working group recommendations call for application of new research approaches in extant clinical data sets to improve efficiency of early trials of new sepsis therapies. Our objective is to apply machine learning approaches to derive computable 24-h sepsis phenotypes to facilitate personalized enrollment in early anti-inflammatory trials targeting these conditions.METHODS: We applied consensus, k-means clustering analysis to our extant PHENOtyping sepsis-induced Multiple organ failure Study (PHENOMS) dataset of 404 children. 24-hour computable phenotypes are derived using 25 available bedside variables including C-reactive protein and ferritin.RESULTS: Four computable phenotypes (PedSep-A, B, C, and D) are derived. Compared to all other phenotypes, PedSep-A patients (n = 135; 2% mortality) were younger and previously healthy, with the lowest C-reactive protein and ferritin levels, the highest lymphocyte and platelet counts, highest heart rate, and lowest creatinine (p < 0.05); PedSep-B patients (n = 102; 12% mortality) were most likely to be intubated and had the lowest Glasgow Coma Scale Score (p < 0.05); PedSep-C patients (n = 110; mortality 10%) had the highest temperature and Glasgow Coma Scale Score, least pulmonary failure, and lowest lymphocyte counts (p < 0.05); and PedSep-D patients (n = 56, 34% mortality) had the highest creatinine and number of organ failures, including renal, hepatic, and hematologic organ failure, with the lowest platelet counts (p < 0.05). PedSep-D had the highest likelihood of developing thrombocytopenia-associated multiple organ failure (Adj OR 47.51 95% CI [18.83-136.83], p < 0.0001) and macrophage activation syndrome (Adj OR 38.63 95% CI [13.26-137.75], p < 0.0001).CONCLUSIONS: Four computable phenotypes are derived, with PedSep-D being optimal for enrollment in early personalized anti-inflammatory trials targeting thrombocytopenia-associated multiple organ failure and macrophage activation syndrome in pediatric sepsis. A computer tool for identification of individual patient membership ( www.pedsepsis.pitt.edu ) is provided. Reproducibility will be assessed at completion of two ongoing pediatric sepsis studies. Keywords Humans, Phenotype, Reproducibility of Results, Anti-Inflammatory Agents, Child, Machine Learning, C-Reactive Protein, Sepsis, Multiple Organ Failure, Thrombocytopenia, Clinical Trials as Topic, Creatinine, Ferritins, Macrophage Activation Syndrome, Organ Dysfunction Scores Journal Crit Care Volume 26 Issue 1 Pages 128 Date Published 2022 May 07 ISSN Number 1466-609X DOI 10.1186/s13054-022-03977-3 Alternate Journal Crit Care PMCID PMC9077858 PMID 35526000 PubMedPubMed CentralGoogle ScholarBibTeXEndNote X3 XML