Skip to main content

An explainable machine learning-based model to predict intensive care unit admission among patients with community-acquired pneumonia and connective tissue disease

Abstract

Background

There is no individualized prediction model for intensive care unit (ICU) admission on patients with community-acquired pneumonia (CAP) and connective tissue disease (CTD) so far. In this study, we aimed to establish a machine learning-based model for predicting the need for ICU admission among those patients.

Methods

This was a retrospective study on patients admitted into a University Hospital in China between November 2008 and November 2021. Patients were included if they were diagnosed with CAP and CTD during admission and hospitalization. Data related to demographics, CTD types, comorbidities, vital signs and laboratory results during the first 24 h of hospitalization were collected. The baseline variables were screened to identify potential predictors via three methods, including univariate analysis, least absolute shrinkage and selection operator (Lasso) regression and Boruta algorithm. Nine supervised machine learning algorithms were used to build prediction models. We evaluated the performances of differentiation, calibration, and clinical utility of all models to determine the optimal model. The Shapley Additive Explanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) techniques were performed to interpret the optimal model.

Results

The included patients were randomly divided into the training set (1070 patients) and the testing set (459 patients) at a ratio of 70:30. The intersection results of three feature selection approaches yielded 16 predictors. The eXtreme gradient boosting (XGBoost) model achieved the highest area under the receiver operating characteristic curve (AUC) (0.941) and accuracy (0.913) among various models. The calibration curve and decision curve analysis (DCA) both suggested that the XGBoost model outperformed other models. The SHAP summary plots illustrated the top 6 features with the greatest importance, including higher N-terminal pro-B-type natriuretic peptide (NT-proBNP) and C-reactive protein (CRP), lower level of CD4 + T cell, lymphocyte and serum sodium, and positive serum (1,3)-β-D-glucan test (G test).

Conclusion

We successfully developed, evaluated and explained a machine learning-based model for predicting ICU admission in patients with CAP and CTD. The XGBoost model could be clinical referenced after external validation and improvement.

Background

Community-acquired pneumonia (CAP) is an acute lung parenchyma infection caused by bacteria, viruses or fungi acquired outside the hospital. It is one of the most common infectious diseases in clinical practice. Meanwhile, it has been recognized as a major health problem and one of the leading causes of morbidity and mortality in all age groups worldwide [1,2,3]. Connective tissue disease (CTD) represents a heterogenous group of systemic autoimmune diseases that affect multiple organs, including idiopathic inflammatory myopathies (IIM), rheumatoid arthritis (RA), Sjogren’s syndrome (SS), etc. It is characterized by the presence of circulating autoantibodies and the self-directed chronic inflammation leading to collagen deposition, tissue damage and fibrosis, and ultimately target organs failure. The prevalence and disease burden of CTD continue to increase significantly during past years [4]. Patients with CTD have a higher risk of CAP than the general population according to prior reports [5, 6]. Besides, patients with CTD, especially those with high disease activity, are predisposed to suffer from unfavorable outcomes of CAP compared with those without CTD [7, 8]. Reversely, pneumonia is reported to be the leading cause of intensive care unit (ICU) admission in CTD patients, followed by acute exacerbation of CTD [9]. The underlying mechanisms include immunosuppressive medication use, immune system dysfunction, related comorbidities, etc. Thus, much attention needs to be paid to those patients with CAP and CTD.

It is estimated that 23% of patients who are hospitalized with CAP require ICU admission [10]. Clinical deterioration may occur after hospital admission in CAP patients. However, admission to a non-ICU setting with later transfer to ICU may be associated with poor outcomes and increased mortality [11, 12]. Therefore, apart from timely initiations of appropriate antibiotics and respiratory support, predicting the likelihood of ICU admission is another important issue in the management of CAP patients. Traditional risk score systems such as pneumonia severity index (PSI) and CURB-65 (confusion, uremia, increased respiratory rate, hypotension, and age 65 years or older) have been widely used to facilitate choosing appropriate site-of-care and predicting the prognosis for patients with CAP [13, 14]. Meanwhile, the 2007 Infectious Diseases Society of America / American Thoracic Society criteria for defining severe community-acquired pneumonia (IDSA/ATS 2007 criteria) remains the most pragmatic tool to predict ICU admission in CAP [15]. However, it is reported that the predictive values of PSI and CURB-65 in patients with CAP and CTD were limited [16]. Unfortunately, there are few studies on the individualized risk stratification of them. To our knowledge, there is no specific prediction model for ICU admission on patients with CAP and CTD so far.

Machine learning, a branch of artificial intelligence, can handle plenty of high-dimensional data, analyze complex relationships and identify optimal predictors of clinical outcomes. Over the last few years, the prediction models for both medical diagnosis and prognosis assessment of various diseases have significantly benefited from diverse cutting-edge machine learning algorithms [17, 18]. They are more flexible and may have superior predictive powers than traditional linear models which use variables with statistical significance in some specific diseases according to previous reports [19, 20]. Furthermore, previous evidence demonstrated that machine learning algorithms had good performances in recognizing and predicting the need for intensive care in the initial assessment of patients [21]. In this study, we aimed to establish a machine learning-based model using noninvasive and readily available clinical parameters for predicting the need for ICU admission during hospitalization in patients with CAP and CTD.

Methods

Study designs

This was a single-center retrospective observational study on pneumonia patients admitted into West China Hospital of Sichuan University in China between November 2008 and November 2021. The study protocol was approved by the West China Hospital of Sichuan University Biomedical Research Ethics Committee (No.2022 − 733) and was conducted in accordance with the amended Declaration of Helsinki. The requirement for written informed consent from patients was waived due to retrospective design. All personal information of the patients had already been de-identified during the analysis.

Patients and data

Patients were eligible for inclusion if they were diagnosed with CAP and CTD during admission and hospitalization. The CAP was defined as a new pulmonary infiltrate on chest X-ray or computed tomography (CT) and at least one of the following acute lower respiratory infection symptoms: fever, productive cough, purulent expectoration, dyspnea, pleuritic chest pain, focal chest signs on auscultation, or abnormal peripheral white cell counts [22]. CTD included polymyositis/dermatomyositis (PM/DM), rheumatoid arthritis (RA), Sjogren’s syndrome (SS), systemic sclerosis (SSc), systemic lupus erythematosus (SLE), anti-synthetase syndrome (ASS), undifferentiated connective tissue disease (UCTD) and mixed connective tissue disease (MCTD) in present study. The diagnosis of each type of CTD was established based on corresponding criteria from related clinical guidelines or previous studies [23,24,25,26,27,28]. Individuals were excluded from the study if they were: (1) under 18 years old; (2) pregnant; (3) having incomplete clinical records. Besides, only the first admission was considered if the patient had multiple admissions during study period.

The clinical data of demographics, CTD types, comorbidities, vital signs and laboratory results during the first 24 h of hospitalization were collected. The first value was used for analysis if any data was repeated. The clinical data were reviewed and collected by two experienced physicians using a standardized data collection form independently. Any disagreement was solved by a third physician or team discussion until a consensus was reached. The primary outcome was the need for ICU admission during hospitalization.

Feature selection and model construction

The flowchart of this study was shown in Fig. 1. The included patients were randomly divided into two sets (70% in training set and 30% in testing set) by simple random sampling. The training set was used for developing the models and the testing set was utilized for evaluating the performances of models.

Fig. 1
figure 1

Study flow chart. CAP: community-acquired pneumonia; CTD: connective tissue disease; Lasso: least absolute shrinkage and selection operator; LR: logistic regression; CART: classification and regression tree; RF: random forest; SVM: support vector machine; KNN: k-nearest neighbors; DT: decision tree; GBM: gradient boosting machine; XGBoost: eXtreme gradient boosting; NB: naive bayes; AUC: area under the receiver operating characteristic curve; DCA: decision curve analysis; SHAP: Shapley additive explanations; LIME: Local Interpretable Model-Agnostic Explanations

The baseline variables were screened to identify potential predictors in training set via three independent methods, including univariate analysis, least absolute shrinkage and selection operator (Lasso) regression and Boruta algorithm [29]. The univariate analysis is a classic selection method based on P values. The variables with P value < 0.05 were regarded as statistically significant and were extracted. The Lasso regression model identifies the features having non-zero coefficients as potential predictors. It can eliminate multicollinearity and avoid over-fitting of variables. We used Lasso regression combined with 10-fold cross-validation to analyze the baseline high-dimensional data and screen variables. Boruta algorithm is a feature selection method that depends on the variable importance measure. To be specific, it identifies the most important features by comparing the Z-values of candidate features with that of “shadow features”. The Z-value of each real feature is obtained based on a random forest (RF) classifier in each iteration, and the Z-value of each shadow feature is created by random shuffling of the real features. It can iteratively remove features that have been proved to be less relevant than random shadow features. Thus, only those relevant features with Z-values higher than the maximal Z-value of shadow features by multiple internal bootstraps, are retained. Overlapping variables by intersecting univariate analysis, Lasso and Boruta were obtained to establish prediction models.

Nine supervised machine learning algorithms, including logistic regression (LR), classification and regression tree (CART), RF, support vector machine (SVM), k-nearest neighbors (KNN), decision tree (DT), gradient boosting machine (GBM), eXtreme gradient boosting (XGBoost) and naive bayes (NB), were used to construct prediction models. The 5-fold cross-validation was applied to ensure the stability and accuracy of the models.

Model assessment

We evaluated the performances of differentiation, calibration and clinical utility of nine models to identify the optimal model. The receiver operating characteristic (ROC) curves were plotted and the areas under the ROC curve (AUCs) were calculated to quantify their discriminative performances. The significant differences of AUCs among models were tested using Delong’s test. Moreover, accuracy, sensitivity, specificity and Kappa value were used as additional descriptions of the predictive abilities of the models [30]. Then, the precision-recall (PR) curves, which plot the positive prediction value (PPV) against the true positive rate (TPR) across all thresholds, were used to further evaluate the discrimination capabilities of the models. Besides, we also calculated the AUCs of three traditional predictive tools, including IDSA/ATS 2007 criteria, PSI and CURB-65, in predicting ICU admission of CAP patients with CTD. The calibration, which represents the agreement between predicted outcomes and actual outcomes, was performed via a bootstrap method with 1000 resamples and assessed by a calibration plot. The decision curve analysis (DCA) based on net benefits at different threshold probabilities was drawn to evaluate the model’s clinical validity and utility.

The performances of the machine learning-based models may be affected by class imbalance due to the low incidence of positive events (ICU admission) in this study. Thus, we performed complementary analyses using up-sampling, down-sampling and synthetic minority oversampling technique (SMOTE) approaches. Sampling techniques are often used to generate balanced datasets (50/50 majority–minority splits) in the training set by up-sampling (over-sampling) or down-sampling (under-sampling). The up-sampling algorithm improves the sensitivity of the minority by synthesizing the minority samples. The down-sampling algorithm achieves the balance of two classes by deleting the majority samples. The SMOTE is an efficient algorithm for addressing class imbalance and reducing over-fitting of the model, employing k-neighbor synthesis to synthesize new minority samples.

Model interpretation

The Shapley Additive Explanation (SHAP) values of features were evaluated to determine each characteristic’s contribution and significance based on its impact on the final classification outcome. The high SHAP value indicates great impact of a feature on model output. We reported the feature importance for interpreting the optimal model. At last, the Local Interpretable Model-Agnostic Explanations (LIME) technique was performed to further explain the model [31,32,33].

Statistical analysis

The clinical characteristics of patients were expressed as the mean ± standard deviation (SD) for continuous variables with normal distribution, the median (interquartile range [IQR]) for continuous variables with non-normal distribution, and the frequency (percentage) for categorical variables. The labels for categorical variables were coded as “1″ for “Yes″ and “0″ for “No″ during statistical analysis. Independent sample t test or Kruskal–Wallis test was used to analyze the differences between the continuous variables as appropriate. The chi-square test or Fisher exact test was performed to analyze the categorical variables. A two-sided P < 0.05 was considered statistically significant. We removed variables missing over 30% of observations to ensure the accuracy of study. After that, we employed the multiple imputation method dealing with missing values.

In this study, R software version 4.2.1 (R Foundation for Statistical Computing) was used to implement the statistical analysis. The Lasso and Boruta analyses were performed using the R package “glmnet” and “Boruta”. The R package “caret” was used to train the models. By default, hyperparameter tuning for each model was performed automatically by caret using a standard grid search approach. The SHAP and LIME methods were completed using the “shapviz” and “lime” package.

Results

Baseline characteristics

A total of 1626 individuals with CAP and CTD were admitted into our hospital. Among them, 97 individuals were excluded from analysis according to exclusion criteria. The included patients were divided into the training set (1070 patients) and the testing set (459 patients). In the training and testing set, the median age was 56 (IQR: 47, 66) and 57 (IQR:49, 66) years, and 334 (31.2%) and 158 (34.4%) patients were men, respectively. PM/DM was the most common type of CTD (28.9%), and interstitial lung disease (ILD) was observed in 65.1% of all patients. The top 3 comorbidities were hypertension (21.0%), diabetes (12.9%) and congestive heart failure (10.3%). The ICU admission rate and hospital mortality were 292 (27.3%) and 161 (15.0%) in the training set, and 121 (26.4%) and 76 (16.6%) in the testing set. The detailed features were summarized in Table 1, which suggested that the baseline characteristics and clinical outcomes of patients did not differ significantly between two sets roughly.

Table 1 Baseline characteristics of patients

Development of model

As shown in Supplementary Table S1, in the univariate analysis 36 variables were significantly different (P values < 0.05) between patients admitted and not admitted into ICU in training set. Among all baseline characteristics, the Boruta algorithm effectively selected 32 potential predictors according to the Z-values (importances) (Fig. 2A and B). Meanwhile, the optimal lambda value was 0.003 by using the Lasso algorithm and 33 variables were selected as potential predictors (Fig. 2C and D). The variables identified by Boruta and Lasso were listed in detail in Supplementary Table S2.

Fig. 2
figure 2

Features selected by Boruta, Lasso and univariate analysis. A and B. Variables selected by Boruta algorithm. The minimum, average and maximum shadow score are shown in blue. In terms of the score of feature importance, the 32 variables in green are regarded as important variables, while yellow are neutral and red are rejected. C. The Lasso regression coefficient profiles of all baseline characteristics. D. The optimal lambda selection in the Lasso regression with 10-fold cross-validation. Misclassification errors of different variables against log(lambda) are revealed. The two vertical dashed lines represent the optimal value under the minimum criterion and 1-SE criterion, respectively. The “lambda” is the tuning parameter. A total of 33 predictors with non-zero coefficients are identified. E. The Venn diagram of features selected by Boruta, Lasso and univariate analysis. The intersection results of three methods yield 16 clinical characteristics. SE, standard error; Lasso, least absolute shrinkage and selection operator

The intersection results of three independent methods were considered to be the optimal features. Thus, a total of 16 clinical characteristics, including N-terminal pro-B-type natriuretic peptide (NT-proBNP), CD4+T cell, lymphocyte, C-reactive protein (CRP), positive serum (1,3)-β-D-glucan test (G test), serum sodium, ratio of arterial oxygen partial pressure (mmHg) to fractional inspired oxygen (PF ratio), neutrophil, heart rate (HR), chronic obstructive pulmonary disease (COPD), serum glucose, pH, high density lipoprotein cholesterol (HDL-C), albumin, platelet and confusion, were served as predictors to establish machine learning-based prediction models (Fig. 2E).

Evaluation of model

All models had accuracy values and AUCs of 0.80 and above in the testing set (Table 2). The ROC curves were shown in Fig. 3A. The XGBoost model achieved the highest AUC (0.941) and accuracy (0.913), suggesting favorable and robust discrimination. LR model was usually used as a traditional baseline model. Thus, the AUCs of other models were compared to that of the XGBoost model and LR model using Delong’s test. The Delong test P value (vs. XGBoost model) were all under 0.05 except for RF model. Meanwhile, the Brier score, Kappa value, sensitivity, specificity, positive predict value and negative predict value of XGBoost model were all superior or similar to that of other models. Furthermore, the XGBoost model also achieved the highest area under the PR curve (0.897) among nine models (Supplementary Figure S1).

Table 2 Performance of nine machine learning-based models for predicting ICU admission in the testing set
Fig. 3
figure 3

Machine learning-based models used to predict ICU admission in patients with CAP and CTD. (A) ROC curves for the machine learning-based models used to predict ICU admission. (B) ROC curves for the traditional risk scores used to predict ICU admission. (C) Calibration curves of the machine learning-based models. (D) DCA of the machine learning-based models. ROC: receiver operating characteristic; DCA: Decision curve analysis; IDSA/ATS 2007 criteria: 2007 Infectious Diseases Society of America / American Thoracic Society criteria for defining severe community-acquired pneumonia; PSI: pneumonia severity index; CURB-65: confusion, uremia, increased respiratory rate, hypotension, and age 65 years or older

Among the traditional predictive tools, the IDSA/ATS 2007 criteria had higher discriminatory power for ICU admission (AUC: 0.817) compared with PSI (AUC: 0.697) and CURB-65 (AUC: 0.607) (Fig. 3B). However, they did not perform as well as the XGBoost model. The calibration curve and DCA were shown in Fig. 3C and D, both suggesting that the XGBoost model performed best among nine models. Thus, the XGBoost model was considered to be the optimal model. The AUCs of models with up-sampling, down-sampling and SMOTE were found to be similar to the original AUCs in the testing set, as listed in detail in Supplementary Table S3.

Model interpretation

The SHAP values could provide more insights into how the XGBoost model predicted outcomes. The feature importance was summarized by the SHAP summary plot in Fig. 4A. Figure 4B depicted the standard bar chart of the mean absolute SHAP value for each predictor in descending order. The force plots provided personalized feature attributions using two representative examples and illustrated how the SHAP could be used to explain individual model predictions, as shown in Fig. 4C (a patient actually not admitted into ICU) and D (a patient actually admitted into ICU). It started at the base value, that is, the average of all predictions. And then, each input predictor at different level could increase or decrease the predicted probability of outcome. The lengths of arrows reflected the SHAP values for these features. Finally, the predicted output value of model was obtained for a particular patient.

Fig. 4
figure 4

SHAP plots. (A) SHAP summary plot shows feature importance for each predictor of the XGBoost model in descending order. The upper predictors are more important to the model’s predictive outcome. A dot is created for each feature attribution value for the XGBoost model of each patient. The further away a dot is from the baseline SHAP value of zero, the stronger it effects the model output. Dots are colored according to the values of features. Yellow represents higher feature values and red represents lower feature values. (B) Bar chart of the mean absolute SHAP value for each predictor of the XGBoost model in descending order. C and D. The force plots provide personalized feature attributions using two representative examples. C: a patient actually not admitted into ICU; D: a patient actually admitted into ICU. SHAP: Shapley additive explanations; ICU: intensive care unit;

We also quantitatively visualized the relationships between main risk factors and outcomes. The SHAP dependence plots illustrated the top 6 features with the greatest importance (Fig. 5A-F). It demonstrated that higher NT-proBNP and CRP values, lower levels of CD4 + T cells, lymphocyte and serum sodium, and positive G test contributed to an elevated risk of ICU admission. Meanwhile, the cutoff value for each variable could also be determined to discriminate between high-risk (SHAP value > 0) and low-risk (SHAP value < 0) of ICU admission.

Fig. 5
figure 5

The SHAP dependence plots for the top 6 features with the greatest importance. A. N-terminal pro-B-type natriuretic peptide (NT-proBNP). B. CD4 + T cell. C. lymphocyte. D. C-reactive protein (CRP). E. positive serum (1,3)-β-D-glucan test (G test). The “1” represents “Yes″ and the “0″ represents “No″. F. serum sodium. The SHAP dependence plots show how a single feature affects the output of the XGBoost model. SHAP values for specific features exceed zero, representing an increased risk of ICU admission. SHAP: Shapley additive explanations; ICU: intensive care unit;

We selected two random samples from the testing set and used the LIME algorithm to further explain the individual ICU admission forecast. Supplementary Figure S2A depicts a case of patient admitted into ICU. The expected probability of ICU admission was 88% according to the XGBoost model. Supplementary Figure S2B described a case of patient not admitted into ICU. The expected probability of no ICU admission was 60%.

Discussion

A novel clinically available tool that provides an early assessment and rapid prediction of ICU admission is warranted considering that risk stratification of patients with CAP and CTD remains challenging caused by heterogeneous disease progression. A reliable and accurate prediction model would help clinicians identify specific patients who require special attentions and allocate resources appropriately, which is crucial to timely and effective intervention for prognosis improvement. Machine learning has gained popularity and is increasingly utilized in various domains of biomedicine. To our knowledge, this study is the first to develop a useful machine learning-based model for predicting the risk of ICU admission in patients with CAP and CTD. The promising performance of model was verified by the testing set. The use of advanced machine learning-based models is often limited in clinical practice because of the lack of clear interpretation of their decision-making process. Thus, we used SHAP and LIME to explain what features of the patient are responsible for the given prediction, avoiding the obstacle of “black-box” nature of machine learning algorithms. We found that the NT-proBNP, CD4 + T cell, lymphocyte, CRP, positive G test and serum sodium were the top 6 features of the XGBoost model in terms of their abilities to predict ICU admission.

In our study cohort, the median PSI was only 80 (IQR: 63,103) points, which was slightly lower than that of previous similar studies of CAP patients with a median PSI of about 100 points [34,35,36,37,38]. This may be caused by the differences of included population. The median age, the proportion of male and the number of comorbidities of our cohort were all lower than that of previous cohorts. However, the rate of ICU admission in our study was equal to that in previous reports. Thus, the adverse impact of CTD on the clinical outcomes of CAP should not be ignored. We believed that the CAP patients with CTD might represent a specific subgroup deserving of additional investigations. However, there is still a lack of personalized accurate assessment to guide optimal clinical decisions for them. Li et al. have conducted a similar study including 368 pneumonia patients with CTD treated with glucocorticoids or immunosuppressants [39]. They constructed a prognostic nomogram based on five variables (fever, cyanosis, blood urea nitrogen, ganciclovir treatment and anti-pseudomonas treatment) for predicting the 90-day mortality. The C index of training cohort and validation cohort was 0.808 and 0.762, respectively. Compared to their research, our study had a larger sample size and more predictors. Furthermore, the current study period was more than 10 years with the primary outcome of ICU admission.

The predicting values of these identified predictors are deemed worthy of clinicians’ attention. They can be used to clinically assist physicians to identify high-risk patients at an early stage. The main predictors can be supported by previous studies and theories. NT-proBNP is secreted by the heart in response to excessive stretching of cardiomyocytes. Thus, it is widely used as diagnostic biomarkers for ventricular insufficiency, heart failure and cardiac dysfunction. Previous evidence showed that NT-proBNP was positively correlated with the severity of CAP and remained an independent mortality predictor (hazard ratio [HR]: 1.004, 95% confidence interval [95%CI]: 1.00-1.01) in multivariable analysis [40, 41]. Meanwhile, in patients with CTD, elevated NT-proBNP is considered to be associated with pulmonary arterial hypertension or even heart involvement which may lead to a significant poor prognosis [42,43,44]. Although treatment with glucocorticoids or immunosuppressive drugs were not recorded, we collected and analyzed the count of lymphocyte subsets instead. It is an objective indicator of the immunosuppressive status of patients. In our study, the CD4 + T cell and lymphocyte were both identified as predictors. This result further confirmed that immunosuppression is indeed a critical indicator of disease severity. Meanwhile, it demonstrated that in various subsets of lymphocyte, low CD4 + T cell was mainly related to ICU admission in patients with CAP and CTD. Wang et al. also demonstrated CD4 + T cells (HR: 0.986, 95%CI: 0.978–0.994), rather than CD8 + T cell, was an independent risk factor for severe CAP in elderly and frailty patients [45]. CRP, a widely utilized inflammatory biomarker, is known to be associated with the severity and mortality of CAP. Meanwhile, it is also a marker for disease activity and risk of death in various types of CTD [46,47,48]. The G test is a serum pan-fungal marker used to detect the majority of pathogenic fungi, including Aspergillus spp., Candida spp., etc. It is highly accurate for diagnosing invasive fungal infections [49].

However, caution is needed in clinical use of some unexpected results of our study. In disagreement with prior studies, the age and sex, two traditional risk factors, were not identified as predictors in our study. We suspected that, as mentioned above, the median age and the proportion of male in present study were lower compared with that of prior researches, which might be partly responsible for this result. However, these two factors should also be considered by the clinicians, especially when treating those old male patients. Another unexpected result is that, the proportion of ILD was comparable between patients admitted into ICU and those not admitted into ICU. ILD was generally considered as a negative prognostic factor in CAP [50]. However, the studies from Li et al. and Liang et al. also showed that ILD was not independently associated with mortality in pneumonia patients with CTD [5, 39]. More prospective researches are needed to clarify the impact of ILD on CAP patients.

Gearhart et al. have found that assigning differential weights to variables could generate a novel risk score with higher accuracy than original IDSA/ATS 2007 criteria for predicting ICU admission in CAP patients [51]. Consistently, in present study the XGBoost model yielded considerably improved predictions compared with traditional tools. As a highly efficient gradient tree boosting technique, XGBoost is utilized in a variety of medical researches. It can optimally handle diverse variable types and imbalanced datasets, including high-order interactions, non-linearities, discontinuities, etc. Besides, it is resistant to outliers in the predictors and the potential multicollinearity among them. XGBoost uses advanced regularization (L1& L2) to prevent overfitting, optimize prediction model, and increase model’s generalization ability [52,53,54,55]. Xu et al. recruited 2302 adults with CAP and found that XGBoost model based on common clinical features had the best performance with AUC of 0.801 in the prediction of ICU admission among various machine learning algorithms [56]. Besides, among patients with COVID-19, XGBoost model was also confirmed to be an excellent prediction model for predicting ICU admission [57, 58].

Our study had some limitations. First, it was a single-center retrospective study, and the selection bias were difficult to completely avoid. Second, this study was only validated using an internal testing set due to the lack of available external validation cohort. The generalizability and robustness of model may be compromised. Third, some data were incomplete because of the retrospective design, including the ILD patterns, such as nonspecific interstitial pneumonia (NSIP), organizing pneumonia (OP) or usual interstitial pneumonia (UIP), the pulmonary function test results, the CTD disease activities, the titers of auto-antibodies, etc. Therefore, they were not included into the analysis. Last, the model was established by baseline characteristics, and the therapies and changes of features after admission were not considered. However, our early investigation may provide a basis for future studies.

Conclusions

In conclusion, we successfully developed, evaluated and explained a machine learning-based model for predicting ICU admission in patients with CAP and CTD. The XGBoost model showed the optimal performance among nine algorithms. The model could be clinical referenced after external validation and improvement.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ICU:

Intensive care unit

CAP:

Community-acquired pneumonia

CTD:

Connective tissue disease

Lasso:

Least absolute shrinkage and selection operator

SHAP:

Shapley Additive Explanations

LIME:

Local Interpretable Model-Agnostic Explanations

ILD:

Interstitial lung disease

PSI:

Pneumonia severity index

LR:

Logistic regression

CART:

Classification and regression tree

RF:

Random forest

SVM:

Support vector machine

KNN:

K-nearest neighbors

DT:

Decision tree

GBM:

Gradient boosting machine

XGBoost:

EXtreme gradient boosting

NB:

Naive bayes

AUC:

Area under the receiver operating characteristic curve

DCA:

Decision curve analysis

SMOTE:

Synthetic minority oversampling technique

IQR:

Interquartile range

NT-proBNP:

N-terminal pro-B-type natriuretic peptide

CRP:

C-reactive protein

G test:

Serum (1,3)-β-D-glucan test

PF ratio:

ratio of arterial oxygen partial pressure (mmHg) to fractional inspired oxygen

HR:

Heart rate

COPD:

Chronic obstructive pulmonary disease

HDL-C:

High density lipoprotein cholesterol

PM/DM:

Polymyositis / Dermatomyositis

RA:

Rheumatoid arthritis

SS:

Sjogren’s syndrome

SSc:

Systemic sclerosis

SLE:

Systemic lupus erythematosus

ASS:

Anti-synthetase syndrome

UCTD:

Undifferentiated connective tissue disease

MCTD:

Mixed connective tissue disease

References

  1. Aliberti S, Dela Cruz CS, Amati F, Sotgiu G, Restrepo MI. Community-acquired pneumonia. Lancet. 2021;398(10303):906–19. https://doi.org/10.1016/S0140-6736(21)00630-9.

    Article  PubMed  Google Scholar 

  2. Rothberg MB. Community-Acquired Pneumonia. Ann Intern Med. 2022;175(4):ITC49–64. https://doi.org/10.7326/AITC202204190.

    Article  PubMed  Google Scholar 

  3. Torres A, Cilloniz C, Niederman MS, et al. Pneumonia. Nat Rev Dis Primers. 2021;7(1):25. https://doi.org/10.1038/s41572-021-00259-0. Published 2021 Apr 8.

    Article  PubMed  Google Scholar 

  4. Conrad N, Misra S, Verbakel JY, et al. Incidence, prevalence, and co-occurrence of autoimmune disorders over time and by age, sex, and socioeconomic status: a population-based cohort study of 22 million individuals in the UK. Lancet. 2023;401(10391):1878–90. https://doi.org/10.1016/S0140-6736(23)00457-9.

    Article  PubMed  Google Scholar 

  5. Liang J, Sun C, Xu L, Xu G, Cao H, Lin J. Community-acquired pneumonia and hospital-acquired pneumonia in adult patients with idiopathic inflammatory myopathy: outcome and antibiotic therapy. Rheumatol Ther. 2021;8(1):255–72. https://doi.org/10.1007/s40744-020-00268-7.

    Article  PubMed  Google Scholar 

  6. Vinogradova Y, Hippisley-Cox J, Coupland C. Identification of new risk factors for pneumonia: population-based case-control study. Br J Gen Pract. 2009;59(567):e329–38. https://doi.org/10.3399/bjgp09X472629.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Coyne P, Hamilton J, Heycock C, Saravanan V, Coulson E, Kelly CA. Acute lower respiratory tract infections in patients with rheumatoid arthritis. J Rheumatol. 2007;34(9):1832–6.

    CAS  PubMed  Google Scholar 

  8. Holland-Fischer M, Thomsen RW, Tarp U, Nørgaard M. Prognosis of pneumonia in patients with rheumatoid arthritis: the role of medication and disease activity prior to admission a population-based cohort study. RMD Open. 2020;6(1):e001102. https://doi.org/10.1136/rmdopen-2019-001102.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Peng JM, Du B, Wang Q, et al. Dermatomyositis and Polymyositis in the Intensive Care Unit: a single-Center Retrospective Cohort Study of 102 patients. PLoS ONE. 2016;11(4):e0154441. https://doi.org/10.1371/journal.pone.0154441. Published 2016 Apr 26.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Cavallazzi R, Furmanek S, Arnold FW, et al. The Burden of Community-Acquired Pneumonia requiring admission to ICU in the United States. Chest. 2020;158(3):1008–16. https://doi.org/10.1016/j.chest.2020.03.051.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Espinoza R, Silva JRLE, Bergmann A, et al. Factors associated with mortality in severe community-acquired pneumonia: a multicenter cohort study. J Crit Care. 2019;50:82–6. https://doi.org/10.1016/j.jcrc.2018.11.024.

    Article  PubMed  Google Scholar 

  12. Restrepo MI, Mortensen EM, Rello J, Brody J, Anzueto A. Late admission to the ICU in patients with community-acquired pneumonia is associated with higher mortality. Chest. 2010;137(3):552–7. https://doi.org/10.1378/chest.09-1547.

    Article  PubMed  Google Scholar 

  13. Tokioka F, Okamoto H, Yamazaki A, Itou A, Ishida T. The prognostic performance of qSOFA for community-acquired pneumonia. J Intensive Care. 2018;6:46. https://doi.org/10.1186/s40560-018-0307-7. Published 2018 Aug 8.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Zaki HA, Hamdi Alkahlout B, Shaban E, et al. The battle of the Pneumonia predictors: a Comprehensive Meta-Analysis comparing the Pneumonia Severity Index (PSI) and the CURB-65 score in Predicting Mortality and the need for ICU support. Cureus. 2023;15(7):e42672. https://doi.org/10.7759/cureus.42672. Published 2023 Jul 29.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Torres A, Chalmers JD, Dela Cruz CS, et al. Challenges in severe community-acquired pneumonia: a point-of-view review. Intensive Care Med. 2019;45(2):159–71. https://doi.org/10.1007/s00134-019-05519-y.

    Article  PubMed  PubMed Central  Google Scholar 

  16. García-Guevara G, Ríos-Corzo R, Díaz-Mora A, et al. Pneumonia in patients with systemic lupus erythematosus: Epidemiology, microbiology and outcomes. Lupus. 2018;27(12):1953–9. https://doi.org/10.1177/0961203318799207.

    Article  PubMed  Google Scholar 

  17. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE Rev Biomed Eng. 2021;14:116–26. https://doi.org/10.1109/RBME.2020.3007816.

    Article  PubMed  Google Scholar 

  18. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2022;23(1):40–55. https://doi.org/10.1038/s41580-021-00407-0.

    Article  CAS  PubMed  Google Scholar 

  19. Khene ZE, Bigot P, Doumerc N, et al. Application of machine learning models to Predict Recurrence after Surgical Resection of Nonmetastatic Renal Cell Carcinoma. Eur Urol Oncol. 2023;6(3):323–30. https://doi.org/10.1016/j.euo.2022.07.007.

    Article  PubMed  Google Scholar 

  20. Yu YD, Lee KS, Man Kim J, et al. Artificial intelligence for predicting survival following deceased donor liver transplantation: retrospective multi-center study. Int J Surg. 2022;105:106838. https://doi.org/10.1016/j.ijsu.2022.106838.

    Article  PubMed  Google Scholar 

  21. Chen Y, Chen H, Sun Q, et al. Machine learning model identification and prediction of patients’ need for ICU admission: a systematic review. Am J Emerg Med. 2023;73:166–70. https://doi.org/10.1016/j.ajem.2023.08.043.

    Article  PubMed  Google Scholar 

  22. Metlay JP, Waterer GW, Long AC, et al. Diagnosis and treatment of adults with community-acquired Pneumonia. An Official Clinical Practice Guideline of the American Thoracic Society and Infectious Diseases Society of America. Am J Respir Crit Care Med. 2019;200(7):e45–67. https://doi.org/10.1164/rccm.201908-1581ST.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Bohan A, Peter JB. Polymyositis and dermatomyositis (first of two parts). N Engl J Med. 1975;292(7):344–7. https://doi.org/10.1056/NEJM197502132920706.

    Article  CAS  PubMed  Google Scholar 

  24. Aletaha D, Neogi T, Silman AJ, et al. 2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League against Rheumatism collaborative initiative. Arthritis Rheum. 2010;62(9):2569–81. https://doi.org/10.1002/art.27584.

    Article  PubMed  Google Scholar 

  25. Vitali C, Bombardieri S, Moutsopoulos HM, et al. Assessment of the European classification criteria for Sjögren’s syndrome in a series of clinically defined cases: results of a prospective multicentre study. The European Study Group on Diagnostic Criteria for Sjögren’s syndrome. Ann Rheum Dis. 1996;55(2):116–21. https://doi.org/10.1136/ard.55.2.116.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Preliminary criteria for the classification of systemic sclerosis (scleroderma). Subcommittee for scleroderma criteria of the American Rheumatism Association Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum. 1980;23(5):581–90. https://doi.org/10.1002/art.1780230510.

    Article  Google Scholar 

  27. Wolf L, Sheahan M, McCormick J, Michel B, Moskowitz RW. Classification criteria for systemic lupus erythematosus. Frequency in normal patients. JAMA. 1976;236(13):1497–9.

    Article  CAS  PubMed  Google Scholar 

  28. Sharp GC, Irvin WS, Tan EM, Gould RG, Holman HR. Mixed connective tissue disease–an apparently distinct rheumatic disease syndrome associated with a specific antibody to an extractable nuclear antigen (ENA). Am J Med. 1972;52(2):148–59. https://doi.org/10.1016/0002-9343(72)90064-2.

    Article  CAS  PubMed  Google Scholar 

  29. Sanchez-Pinto LN, Venable LR, Fahrenbach J, Churpek MM. Comparison of variable selection methods for clinical predictive modeling. Int J Med Inf. 2018;116:10–7. https://doi.org/10.1016/j.ijmedinf.2018.05.006.

    Article  Google Scholar 

  30. McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276–82.

    Article  PubMed  Google Scholar 

  31. Gaur Loveleen B, Mohan BS, Shikhar J, Nz M, Shorfuzzaman. and Mehedi Masud. 2023. Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer’s Disease. ACM Trans. Multimedia Comput. Commun. Appl. 20, 2, Article 41 (February 2024), 16 pages. https://doi.org/10.1145/3527174.

  32. Bhandari M, Shahi TB, Siku B, Neupane A. Explanatory classification of CXR images into COVID-19, Pneumonia and Tuberculosis using deep learning and XAI. Comput Biol Med. 2022;150:106156. https://doi.org/10.1016/j.compbiomed.2022.106156.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Patel AN, Murugan R, Srivastava G, et al. An explainable transfer learning framework for multi-classification of lung diseases in chest X-rays. Alexandria Eng J. 2024;328–43. https://doi.org/10.1016/j.aej.2024.04.072.

  34. Chen S, Jiang J, Su M, et al. A nomogram based on the expression level of angiopoietin-like 4 to predict the severity of community-acquired pneumonia. BMC Infect Dis. 2023;23(1):677. https://doi.org/10.1186/s12879-023-08648-4. Published 2023 Oct 11.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Jones BE, Ying J, Nevers M, et al. Computerized mortality prediction for community-acquired pneumonia at 117 Veterans Affairs Medical Centers. Ann Am Thorac Soc. 2021;18(7):1175–84. https://doi.org/10.1513/AnnalsATS.202011-1372OC.

    Article  PubMed  Google Scholar 

  36. Chen L, Xue J, Zhao L, et al. Lysophosphatidylcholine acyltransferase level predicts the severity and prognosis of patients with community-acquired pneumonia: a prospective multicenter study. Front Immunol. 2024;14:1295353. https://doi.org/10.3389/fimmu.2023.1295353. Published 2024 Jan 8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Ma CM, Wang N, Su QW, et al. Age, pulse, Urea, and Albumin score: a Tool for Predicting the short-term and long-term outcomes of community-acquired pneumonia patients with diabetes. Front Endocrinol (Lausanne). 2022;13:882977. https://doi.org/10.3389/fendo.2022.882977. Published 2022 Jun 1.

    Article  PubMed  Google Scholar 

  38. Lee JS, Ko SH, Lee J, Jeong KY. The relationship between body mass index and N-terminal pro-B-type natriuretic peptide in community-acquired pneumonia. Eur J Clin Nutr. 2021;75(7):1088–98. https://doi.org/10.1038/s41430-020-00817-x.

    Article  CAS  PubMed  Google Scholar 

  39. Li D, Ding L, Luo J, Li QG. Prediction of mortality in pneumonia patients with connective tissue disease treated with glucocorticoids or/and immunosuppressants by machine learning. Front Immunol. 2023;14:1192369. https://doi.org/10.3389/fimmu.2023.1192369. Published 2023 May 25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Li J, Ye H, Zhao L. B-type natriuretic peptide in predicting the severity of community-acquired pneumonia. World J Emerg Med. 2015;6(2):131–6. https://doi.org/10.5847/wjem.j.1920-8642.2015.02.008.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Nowak A, Breidthardt T, Christ-Crain M, et al. Direct comparison of three natriuretic peptides for prediction of short- and long-term mortality in patients with community-acquired pneumonia. Chest. 2012;141(4):974–82. https://doi.org/10.1378/chest.11-0824.

    Article  CAS  PubMed  Google Scholar 

  42. Huang J, An Q, Shi H, Li C, Zhang W, Wang L. Retrospective cohort study of pulmonary arterial hypertension associated with connective tissue disease effect on patients’ prognosis. Clin Rheumatol. 2023;42(11):3131–42. https://doi.org/10.1007/s10067-023-06667-9.

    Article  PubMed  Google Scholar 

  43. Chikhoune L, Brousseau T, Morell-Dubois S, et al. Association between Routine Laboratory Parameters and the severity and progression of systemic sclerosis. J Clin Med. 2022;11(17):5087. https://doi.org/10.3390/jcm11175087. Published 2022 Aug 30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Ross L, Moxey J, Nikpour M. Are troponin and B-type natriuretic peptides useful biomarkers for the diagnosis of systemic sclerosis heart involvement? A systematic literature review. Semin Arthritis Rheum. 2021;51(1):299–309. https://doi.org/10.1016/j.semarthrit.2020.10.016.

    Article  CAS  PubMed  Google Scholar 

  45. Wang J, Pei L, Zhao T, et al. CD4+ T cells related to disease severity in elderly and frailty community-acquired pneumonia patients: a retrospective cohort study. Immun Inflamm Dis. 2023;11(9):e1009. https://doi.org/10.1002/iid3.1009.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Xie H, Zhang D, Wang Y, et al. Risk factors for mortality in patients with anti-MDA5 antibody-positive dermatomyositis: a meta-analysis and systematic review. Semin Arthritis Rheum. 2023;62:152231. https://doi.org/10.1016/j.semarthrit.2023.152231.

    Article  CAS  PubMed  Google Scholar 

  47. Aringer M. Inflammatory markers in systemic lupus erythematosus. J Autoimmun. 2020;110:102374. https://doi.org/10.1016/j.jaut.2019.102374.

    Article  CAS  PubMed  Google Scholar 

  48. Kay J, Upchurch KS. ACR/EULAR 2010 rheumatoid arthritis classification criteria. Rheumatology (Oxford). 2012;51(Suppl 6):vi5–9. https://doi.org/10.1093/rheumatology/kes279.

    Article  PubMed  Google Scholar 

  49. He S, Hang JP, Zhang L, Wang F, Zhang DC, Gong FH. A systematic review and meta-analysis of diagnostic accuracy of serum 1,3-β-D-glucan for invasive fungal infection: focus on cutoff levels. J Microbiol Immunol Infect. 2015;48(4):351–61. https://doi.org/10.1016/j.jmii.2014.06.009.

    Article  CAS  PubMed  Google Scholar 

  50. Dusemund F, Chronis J, Baty F, Albrich WC, Brutsche MH. The outcome of community-acquired pneumonia in patients with chronic lung disease: a case-control study. Swiss Med Wkly. 2014;144:w14013. Published 2014 Sep 3. https://doi.org/10.4414/smw.2014.14013.

  51. Gearhart AM, Furmanek S, English C, Ramirez J, Cavallazzi R. Predicting the need for ICU admission in community-acquired pneumonia. Respir Med. 2019;155:61–5. https://doi.org/10.1016/j.rmed.2019.07.007.

    Article  PubMed  Google Scholar 

  52. Li X, Wu R, Zhao W et al. Machine learning algorithm to predict mortality in critically ill patients with sepsis-associated acute kidney injury. Sci Rep. 2023;13(1):5223. Published 2023 Mar 30. https://doi.org/10.1038/s41598-023-32160-z.

  53. Fan Z, Jiang J, Xiao C et al. Construction and validation of prognostic models in critically Ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach. J Transl Med. 2023;21(1):406. Published 2023 Jun 22. https://doi.org/10.1186/s12967-023-04205-4.

  54. Fahmy AS, Csecs I, Arafati A, et al. An Explainable Machine Learning Approach reveals prognostic significance of right ventricular dysfunction in nonischemic cardiomyopathy. JACC Cardiovasc Imaging. 2022;15(5):766–79. https://doi.org/10.1016/j.jcmg.2021.11.029.

    Article  PubMed  Google Scholar 

  55. Islam MM, Rahman MJ, Rabby MS, et al. Predicting the risk of diabetic retinopathy using explainable machine learning algorithms. Diabetes Metab Syndr. 2023;17(12):102919. https://doi.org/10.1016/j.dsx.2023.102919.

    Article  CAS  PubMed  Google Scholar 

  56. Xu Z, Guo K, Chu W, Lou J, Chen C. Performance of Machine Learning algorithms for Predicting adverse outcomes in Community-Acquired Pneumonia. Front Bioeng Biotechnol. 2022;10:903426. https://doi.org/10.3389/fbioe.2022.903426. Published 2022 Jun 29.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Heldt FS, Vizcaychipi MP, Peacock S, et al. Early risk assessment for COVID-19 patients from emergency department data using machine learning. Sci Rep. 2021;11(1):4200. https://doi.org/10.1038/s41598-021-83784-y. Published 2021 Feb 18.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Deif MA, Solyman AAA, Alsharif MH, Uthansakul P. Automated Triage System for Intensive Care Admissions during the COVID-19 pandemic using hybrid XGBoost-AHP Approach. Sens (Basel). 2021;21(19):6379. https://doi.org/10.3390/s21196379. Published 2021 Sep 24.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

None.

Funding

This work was supported by the Science and Technology Department of Sichuan Province (2023NSFSC1459), West China Hospital Postdoctoral Science Foundation (2023HXBH043), China Postdoctoral Science Foundation (2022M722258) and National Natural Science Foundation of China (82200102).

Author information

Authors and Affiliations

Authors

Contributions

Dong Huang and Linjing Gong gave the study concept and design; all authors acquired, analyzed, and interpreted the data, and critically revised the manuscript for important intellectual content; Dong Huang drafted the manuscript; Dong Huang carried out the statistical analysis; Zongan Liang supervised the study; all authors read and approved the final manuscript.

Corresponding author

Correspondence to Zongan Liang.

Ethics declarations

Ethics approval and consent to participate

Ethics approval was obtained from the West China Hospital of Sichuan University Biomedical Research Ethics Committee (No.2022 − 733), with waiver of written informed consent due to retrospective design. All personal information of the patients was de-identified.

Consent for publication

Consent for publication was provided by all authors.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, D., Gong, L., Wei, C. et al. An explainable machine learning-based model to predict intensive care unit admission among patients with community-acquired pneumonia and connective tissue disease. Respir Res 25, 246 (2024). https://doi.org/10.1186/s12931-024-02874-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12931-024-02874-3

Keywords