基于机器学习的ICU重症肺炎患者MDRO感染危险因素分析及其预测模型

Risk factors for multidrug-resistant organism infection in intensive care unit patients with severe pneumonia based on machine learning and its prediction model

  • 摘要:
    目的 探究重症监护室(ICU)重症肺炎患者感染多重耐药菌(MDRO)的危险因素,基于机器学习构建MDRO感染的预测模型,为临床决策提供参考。
    方法 回顾性收集牡丹江医科大学附属红旗医院2021年1月-2023年12月收治的202例ICU重症肺炎患者的临床资料,分析感染病原菌及耐药性。将入住ICU后是否为MDRO感染将其分为MDRO组67例和非MDRO组135例,应用logistic回归分析MDRO感染的危险因素。以发生MDRO感染为最终结局,将数据集按7∶3随机划分为训练集和测试集。构建3种机器学习模型(随机森林、梯度提升、自适应增强)。对每种模型进行五折交叉验证,以曲线下面积(AUC)及阳性预测率作为主要指标评估模型性能。
    结果 202例重症肺炎患者共培养分离病原菌271株,其中革兰阴性菌193株占71.22%(以肺炎克雷伯菌为主),真菌45株占16.60%,革兰阳性菌33株占12.17%。多重耐药菌检出率为19.19%(52/271),肺炎克雷伯菌超广谱β-内酰胺酶(ESBLs)的检出率为30.56%(22/72)。ICU住院时长(OR=1.092)及APACHEⅡ评分(OR=1.123)是ICU重症肺炎患者发生MDRO感染的危险因素。在构建的3种机器学习模型中,梯度提升算法所构建的模型在AUC评价指标中优于其他模型,为0.916,其中随机森林算法、自适应增强算法的AUC值也均>0.9,但其阳性预测率低于梯度提升算法,因此梯度提升算法模型具有更好的预测能力。
    结论 梯度提升模型效能最优,识别出中性粒细胞比率升高、机械通气时长增加、血小板计数异常及抗菌药物使用≥3种等预测因素,为MDRO感染早期预警与个体化防控提供参考。

     

    Abstract:
    OBJECTIVE  To explore the risk factors for infection with multidrug-resistant organisms (MDRO) in patients with severe pneumonia in the intensive care unit (ICU), and to construct a predictive model for MDRO infection based on machine learning, thereby providing a reference for clinical decision-making.
    METHODS  Clinical data from 202 ICU patients with severe pneumonia admitted to Hongqi Hospital Affiliated to Mudanjiang Medical University from Jan. 2021 to Dec. 2023 were retrospectively collected to analyze the infectious pathogens and their drug resistance. Patients were divided into an MDRO group (n=67) and a non-MDRO group (n=135) according to the presence or absence of MDRO infection after ICU admission. Logistic regression analysis was employed to identify risk factors for MDRO infection. With the occurrence of MDRO infection as the final outcome, the dataset was randomly divided into a training set (70%) and a test set (30%). Three machine learning models (Random Forest, Gradient Boosting and AdaBoost) were developed. Each model underwent five-fold cross-validation, and the area under the curve (AUC) and positive predictive rate were adopted as the primary indicators to evaluate model performance.
    RESULTS  A total of 271 strains of pathogenic bacteria were isolated from the 202 patients with severe pneumonia, including 193 strains (71.22%) of gram-negative bacteria (mainly Klebsiella pneumoniae), 45 strains (16.60%) of fungi and 33 strains (12.17%) of gram-positive bacteria. The isolation rate of MDROs was 19.19%(52/271), and the isolation rate of extended-spectrum β-lactamases (ESBLs)-producing in K. pneumoniae was up to 30.56%(22/72). The length of ICU stay (OR=1.092) and the APACHE II score (OR=1.123) were identified as risk factors for MDRO infection in ICU patients with severe pneumonia. Among the three machine learning models, the Gradient Boosting algorithm achieved the highest AUC (0.916). Although the AUC values of the Random Forest and AdaBoost algorithms were also >0.9, their positive predictive rates were lower than that of the Gradient Boosting algorithm. Therefore, the Gradient Boosting algorithm model demonstrated superior predictive capability.
    CONCLUSIONS  The Gradient Boosting model exhibits the best performance, identifying predictive factors such as an elevated neutrophil ratio, prolonged mechanical ventilation duration, abnormal platelet count and the use of ≥3 antimicrobial agents. This provides a reference for early warning and individualized prevention and control of MDRO infection.

     

/

返回文章
返回