Skip to main content

Development and performance assessment of novel machine learning models to predict pneumonia after liver transplantation



Pneumonia is the most frequently encountered postoperative pulmonary complications (PPC) after orthotopic liver transplantation (OLT), which cause high morbidity and mortality rates. We aimed to develop a model to predict postoperative pneumonia in OLT patients using machine learning (ML) methods.


Data of 786 adult patients underwent OLT at the Third Affiliated Hospital of Sun Yat-sen University from January 2015 to September 2019 was retrospectively extracted from electronic medical records and randomly subdivided into a training set and a testing set. With the training set, six ML models including logistic regression (LR), support vector machine (SVM), random forest (RF), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost) and gradient boosting machine (GBM) were developed. These models were assessed by the area under curve (AUC) of receiver operating characteristic on the testing set. The related risk factors and outcomes of pneumonia were also probed based on the chosen model.


591 OLT patients were eventually included and 253 (42.81%) were diagnosed with postoperative pneumonia, which was associated with increased postoperative hospitalization and mortality (P < 0.05). Among the six ML models, XGBoost model performed best. The AUC of XGBoost model on the testing set was 0.734 (sensitivity: 52.6%; specificity: 77.5%). Pneumonia was notably associated with 14 items features: INR, HCT, PLT, ALB, ALT, FIB, WBC, PT, serum Na+, TBIL, anesthesia time, preoperative length of stay, total fluid transfusion and operation time.


Our study firstly demonstrated that the XGBoost model with 14 common variables might predict postoperative pneumonia in OLT patients.


Postoperative pulmonary complications (PPC) adversely affect the clinical course of orthotopic liver transplantation (OLT) and play an important role in poor survival [1]. Postoperative pneumonia is the most common type of PPC, contributing to morbidity, length of hospital stay, and mortality [2]. Identification of patients at high risk of developing postoperative pneumonia is the key to early implementing interventions to prevent its onset or antibiotics to treat bacterial infection [3]. On the contrary, unnecessary and excessive antibiotic use in patients at low risk for postoperative pneumonia can lead to antibiotic resistance and side effects. For instance, recent studies have shown that extensive use of antibiotics for anti-bacteria prophylaxis, multi-drug resistant bacteria in post-transplant patients have been induced [4, 5]. Therefore, it is essential to establish a reliable model for prediction of postoperative pneumonia to tailor preventive interventions and treatments for patients at high-risk of postoperative pneumonia and avoid unnecessary use of antibiotics in low-risk patients.

In recent years, several scoring systems for prediction of postoperative pneumonia have been reported to improve risk-stratification [6], such as the Prestroke Independence, Sex, Age, National Institutes of Health Stroke Scales (ISAN) in acute ischemic stroke patients [7], a pneumonia risk index for patients undergoing major noncardiac surgery [8], and a systemic inflammation score for patients after radical resection of gastric cancer [9, 10]. However, these predictive models are not applicable to liver transplant recipients, mainly due to the preoperative pulmonary condition of patients with end-stage liver disease and the immunosuppressive status of allograft recipients [10]. Currently, an effective risk classification for postoperative pneumonia has not yet been available for liver transplant recipients.

Compared with the traditional scoring systems, machine learning (ML) models have shown better performance in predicting various diseases or clinical conditions [11,12,13]. ML models are usually constructed based on high volume data recorded in the electronic patient record (EPR) systems and its deep learning ability allows ML models to capture complex, nonlinear relationships, even previously unknown correlations in big data, digging deeper into clinical data [14], and shows promising potential in clinical scenes where large amount of data were collected and integrated every day. Recently, Li and colleagues [15] have developed a model using ML methods to predict stroke-associated pneumonia in Chinese patients with acute ischemic stroke. In addition, ML was used to predict severe pneumonia during posttransplant hospitalization in recipients of a kidney transplant [16]. ML was also applied in developing models for liver disease and transplantation to predict post-transplant survival and complications, including acute kidney injury (AKI) and diabetes [17]. To date, there has been no ML model for prediction of postoperative pneumonia in recipients of liver transplant [18].

In this study, we aimed to develop predictive models using ML methods, and to evaluate their performance in predicting postoperative pneumonia in OLT patients. The findings obtained through conducting this study was expected to provide a novel ML algorithm for prediction of postoperative pneumonia in patients after liver transplantation.

Materials and methods

Human subjects and study design

In this retrospective study, data of 894 patients who underwent either living donor liver transplantation (LDLT) or deceased donor liver transplantation (DDLT) in the Third Affiliated Hospital of Sun Yat-sen University-Lingnan Hospital (Guangzhou, Guangdong, China) spanning from January 2015 to September 2019 were retrieved from the EPR systems. All the patients were registered as recipients of organ transplantation in the China Organ Transplant Response Systems ( During the retrospective enrollment, the patients aged < 18 years, presented with preoperative pneumonia or lack of sufficient post-operative data were excluded from this study.

In the EPR systems of our hospital, a database platform was established by extracting medical records from hospital information system (HIS), laboratory information system (LIS), picture archiving and communication system (PACS), and Docare Anesthesia System (2005–2020 Medicalsystem Co., Ltd. Suzhou, China). This database platform enabled access to comprehensive data collected during hospital admission, inpatient stay, and post-hospital follow-up visit, including demographic characteristics, daily documentation, laboratory tests, imaging results, anesthesia records, and other clinical characteristics. This study was reported in accordance with the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines.

Primary outcome

The primary outcome was the incidence of postoperative pneumonia during the postoperative period before hospital discharge. Postoperative pneumonia was defined on the basis of European Perioperative Clinical Outcome (EPCO) definitions, in which at least one of the following definitive chest X-ray or CT findings was fulfilled: infiltrate, consolidation, cavitation; and at least one of the following signs and symptoms of infection (Temperature > 38 °C or < 36 °C with no other causes, white blood cell (WBC) count > 10 × 109/L or < 4 × 109/L)[6].

Data selection

The data elements related to the following categories were chosen from database platform: (1) Demographics: age, gender, height and weight; (2) Preoperative comorbidities: hypertension, coronary heart disease, myocardial infarction, diabetes mellitus, history of alcohol abuse, smoking, and past surgery; (3) Etiology: primary liver diseases contributing to the decision of LT with main focus on hepatitis B, hepatitis C, dual infection of any combination of the known hepatitis virus A to E, hepatic malignancy (including hepatocellular carcinoma and cholangiocarcinoma), alcohol-related liver disease (ALD), drug-induced liver injury (DILI), and autoimmune liver disease; (4) Perioperative laboratory values: lab results concerning liver function, kidney function, electrolytes, and count of blood cells. The results of the latest tests prior to surgery were collected. Lab MELD score prior to surgery was calculated; (5) Preoperative complications: complications and metrics reminding the severity of the patients were collected, which mainly consist of complications related to cirrhosis and portal hypertension, the documentation of treatment escalation including length of stay in ICU, use of continuous blood purification (CBP) and mechanical ventilation; (6) Intraoperative incidents: incidents indicating hemodynamic instability, such as cardia arrest, arrhythmia, lactic acidosis, acidosis, hypernatremia, hypokalemia, and hypotension; (7) Intraoperative medication: including intraoperative use of vasoconstrictors (either used as bolus or continuously) and blood coagulant, which reflected the extent of hemodynamic instability and hemorrhagic tendency. The data collected were the accumulative sum by the end of the surgery; (8) Intraoperative fluid and transfusion: the total of intraoperative fluid infusion and output, as well as the total of blood product transfused were respectively extracted. Red blood cell transfusion, plasma transfusion, total blood product transfusion and total fluid transfusion were all classified into two categories based on specific criterions; (9) Post-operative medications with mainly traced the post-operative medications within 7 days after surgery. These medications consist of colloid, vasoconstrictors, as well as immunosuppressant, antifungal agents and antibiotics; (10) Microorganism observation: test on microorganism during preoperative period and post-operative period.

Variable selection

With 591 records and 148 features, overfitting could occur during training and undermine model performance. Therefore, we first implemented univariate test to filter out features that were statistically insignificant. Finally, 33 features were statistically significant (P < 0.05) and proceeded to be used in a recursive feature elimination (RFE) method embedded with random forest [19]. Initially, RFE method trained on all features and then it recursively removed least important features, the subset of features which had the highest sensitivity score was selected.

Development of machine learning models

To predict postoperative pneumonia, the following six different machine learning models were developed and evaluated for their performance: logistic regression (LR) [20], support vector machine (SVM) [21], random forest (RF) [22], MLP (multilayer perceptron) [23], extreme gradient boosting (XGBoost) [24], and gradient boosting machine (GBM) [25].

XGBoost model was constructed using the xgboost package ( The remaining five models were established via Scikit-learn package ( Considering that machine learning models had multiple tuning parameters which were essential for model performance, fivefold cross-validation grid search method was used for selection of the best parameters and AUCs on testing set were measured (Additional file 1: Table S1). The complete data set of 591 adult was then randomly separated into 70% train and 30% test for validation. Bootstrap method was then used to sample 1000 different test sets in order to get 95% confidence interval (CI) of the best tuned models’ evaluation metrics. Model performance was evaluated by area under receiver-operating curve (AUC), accuracy, sensitivity, and specificity.

Statistical analysis

Python (Anaconda Distribution, version 3.7) package Numpy (version 1.16.5) and Pandas (version 0.25.1) were employed for data cleaning. Python (Anaconda Distribution version 3.7) Scipy package (version 1.3.1) were used to analyze the data. The continuous variables were presented with the mean along with standard deviation (SD), or median along with interquartile range. Independent sample t-test was used for normally distributed data, while Mann–Whitney U test was used for non-normal distribution data in univariate analyses. Categorical variables were expressed with quantities and percentages, and tested by Chi-square test or Fisher’s exact test. Kaplan–Meier methods were applied to estimate the long-term survival rates. Besides, the comparisons between groups were performed by Gehan–Breslow–Wilcoxon test and Log-rank test.

No variables had missing percentage higher than 1%. We employed mean imputation, which imputed missing value with the mean of each feature, to fill in missing values. Before we proceeded to machine learning models, continuous variables were normalized based on the mean and SD of the training set. Categorical variables were encoded into binary variable, 1 represents having an incident, 0 represents not having an incident. Gender was also encoded, 1 represents male, 0 represents female. The whole dataset was split into 70% of training set and 30% of testing set. The data in the training set was used for development of predictive models, while the testing set was used to validate models’ performance.


Characteristics of the study subjects and preoperative factors associated with postoperative pneumonia

A total of 894 patients who underwent orthotopic liver transplantation in our hospital, spanning the period from January 2015 to September 2019, were assessed for eligibility. After 65 pediatric patients, 226 patients with preoperative pneumonia, and 12 patients lack of sufficient postoperative data, were excluded, 591 patients were finally enrolled and used for development and performance evaluation of machine learning models to predict postoperative pneumonia. The flow diagram of the enrollment was presented in Fig. 1. Notably, pneumonia occurred in 253 patients, accounting for as high as 42.81% of the study subjects following liver transplantation, while 338 (57.19%) patients did not have postoperative pneumonia.

Fig. 1

Flow chart of patient enrollment in this study. LR logistic regression, SVM support vector machine, RF random forest, GBM gradient boosting machine, MLP multilayer perceptron, XGB extreme gradient boosting

The demographic characteristics, laboratory tests results, and clinical features of the enrolled patients with or without postoperative pneumonia were summarized in Table 1. The demographic characteristics and preoperative comorbidities did not differ significantly between the patients with or without occurrence of postoperative pneumonia (P > 0.05). Notably, hepatic malignancy, hematocrit (HCT), alanine transaminase (ALT), total bilirubin (TBIL), albumin (ALB), coagulation function, MELD score, and hospital stay were found to have significant differences between patients with or without postoperative pneumonia (P < 0.05). In particular, the patients without postoperative pneumonia had significantly better preoperative hepatic function, as reflected by preoperative liver function tests in comparison with those patients who developed pneumonia after surgery (P < 0.05).

Table 1 Preoperative characteristics of the study patients

Analysis of intraoperative and postoperative factors related to postoperative pneumonia

The intraoperative factors, including those in the following three categories: intraoperative incidents, fluid management and transfusion, and medications, were compared between the study patients with or without postoperative pneumonia. As shown in Table 2, hypernatronemia, longer operation time and anesthesia time, more red blood cell (RBC) transfusion and blood product transfusion, larger volume of infusion and more blood loss were found to be significantly associated with postoperative pneumonia (P < 0.05). Notably, the proportions of patients with RBC transfusion > 18U, blood product transfusion > 5000 mL, total volume of infusion > 10 L, and blood loss > 2 L were significantly higher in the pneumonia group than the non-pneumonia group (Table 2). In addition, higher doses of recombinant activated factor VII (0.343 ± 1.031 vs. 0.134 ± 0.615, P = 0.008) and prothrombin complex concentrate (602.367 ± 410.826 vs. 506.719 ± 359.224, P = 0.01) were administrated in the patients without pneumonia than those with pneumonia.

Table 2 Comparison of intraoperative factors between patients with or without postoperative pneumonia

In terms of postoperative medications (Table 3), the doses of telipressin and dopamine in patients without pneumonia were significantly higher than those with pneumonia (0.148 ± 0.414 vs. 0.079 ± 0.314 mg/day, P = 0.012; 47.544 ± 72.198 vs. 35.473 ± 63.069 mg/day, P = 0.013; respectively). There were no significant differences between the two groups in terms of norepinephrine, dopamine, epinephrine and tacrolimus (P > 0.05).

Table 3 Comparison of postoperative features of the study patients

Feature selection using univariate and recursive feature elimination methods

As partially relevant or less important features may negative affect performance of machine learning models, we performed feature selection and ranked levels of feature importance. Feature selection was performed using univariate and recursive feature elimination (RFE) methods, after which dimensionality was reduced from 148 to 14 features. These 14 features were listed as follows: preoperative international normalized ratio (INR), HCT, platelets (PLT), ALB, ALT, fibrinogen (FIB), WBC, prothrombin time (PT), serum sodium (Na+), TBIL, anesthesia time, preoperative hospital stay, total fluid transfusion, and operation time. Further, feature importance plot was created to rank the levels of importance using fine tuned eXtreme Gradient Boosting (XGBoost) model. As a result, preoperative length of hospital stay, PT, and WBC were ranked first, second, and third, respectively (Fig. 2).

Fig. 2

Feature importance ranking of the selected 14 features illustrated by random forest. PT prothrombin time, WBC white blood cells, FIB fibrinogen, INR international normalized ratio, TBIL total bilirubin, SCR serum creatinine, ALB albumin, HCT hematocrit, ALT glutamic pyruvic transaminase

Performance assessment of the machine learning models for prediction of postoperative pneumonia

Six machine learning models, including LR, SVM, RF, MLP XGBoost, and GBM, were constructed, and their performance for prediction of postoperative pneumonia was assessed. Additional file 1: Table S1 and Fig. 3 showed the best hyperparameter combination for each model and their AUCs in predicting postoperative pneumonia. XGBoost had the highest AUC value (0.793) with the lowest AUC value (0.674) for SVM. The AUC values of LR, SVM, and MLP were relatively lower than other three machine learning models. Ensemble machine learning models such as XGBoost, RF, and GBM showed significantly higher AUC values compared with LR, SVM, and MLP.

Fig. 3

ROC curves for prediction of postoperative pneumonia on one of the test data set. Greater AUC shows higher discriminative ability of the model. AUC area under the receiver operating characteristic curve, SVM support vector machine, GBM gradient boosting machine

In addition to AUCs, accuracy, sensitivity, and specificity were used for evaluation of performance of the six machine learning models. As shown in Table 4, on the 1000 bootstraped test data sets, XGBoost model showed both best AUC (median, 0.794; 95% CI 0.735–0.84) and highest specificity (median, 0.815; 95% CI (0.75–0.872) among the machine learning models. The random forest model showed both best accuracy (median, 0.736; 95% CI 0.674–0.787) and highest sensitivity (median, 0.632; 95% CI (0.538–0.72) among the machine learning models (Table 4).

Table 4 Performance of the six ML models in the testing set

Effect of postoperative pneumonia on patient outcomes

Compare with the non-pneumonia group, the pneumonia group had longer postoperative hospital stay [22 (17, 31) vs. 23 (17, 33) days, P = 0.046] (Table 4) and lower 6-month, (91.0% vs. 96.2%; P = 0.01), 12-month (88.6% vs. 93.4%; P = 0.0045), 2-year (85.3% vs. 91.5%; P = 0.021), and 3-year (84.9% vs. 90.9%; P = 0.03) survival rates and overall survival rates (P = 0.0446; Table 5, Fig. 4) than patients without occurring postoperative pneumonia.

Table 5 Comparison of survival rate of the study patients
Fig. 4

Survival rates of patients with or without postoperative pneumonia. 591 cases with a survival data that last over a 5-year interval were analyzed. The difference of both curves were examined by Log-rank test (Chi square 4.034, df 1, P = 0.0446) and Gehan–Breslow Wilcoxon test (Chi square 4.288, df 1, P = 0.0384)


Early detection of postoperative pneumonia is critical for timely interventions to prevent the onset of the complication. Until now, the predication of postoperative pneumonia has been challenging, and there is need for reliable and accurate predictive model for patients after liver transplantation. This study, based upon large volume of data and ML methods, has the following major novel findings: (1) The incidence of postoperative pneumonia was high in patients after OLT, and the occurrence was significantly associated with prolonged hospital stay and increased mortality after liver transplantation; (2) A total of 14 factors were identified to be significantly correlated with postoperative pneumonia after OLT, including INR, HCT, PLT, ALB, ALT, FIB, WBC, PT, serum Na+, TBIL, anesthesia time, preoperative length of hospital stay, total fluid transfusion, and operation time; (3) The XGBoost model exhibited the best overall performance in predicting postoperative pneumonia among the developed ML models, with the value of AUC of 0.794, sensitivity of 52.6%, and specificity of 77.5%; (4) Multiple lines of evidence support that the XGBoost model holds promise for future clinical application in predicting postoperative pneumonia in patients after liver transplantation.

XGBoost model is recognized as an efficient and scalable tree boosting system [26], and it has performed well in the ML competitions, especially the simplicity in use and the accuracy in prediction [27, 28]. In the present study, we developed a total of six ML models, of these, XGBoost model had the best overall performance, with a specificity of 77.5% and a sensitivity of 52.6% in predicting postoperative pneumonia in OLT patients. In the study, the AUC values of LR, SVM, and MLP were relatively lower than other three ensemble machine learning models including XGboost, RF and GBM, whose accuracy and robustness might be attributed to their nature of integrating multiple base classifiers or learners. However, RF is a bagging ensemble, and it needs to train a large amount of decision trees and aggregate them. As a result, it usually takes much more time to trade numerous random computations for high accuracy, compared with GBM and XGboost, which both belong to boosting ensemble method. Moreover, compared to GBM, XGboost leverages second order derivative and implements sampling method in each iteration to alleviate overfitting and speed up computation.

Considering the high prevalence of multi-drug resistant bacteria in post-transplant patients induced by the excessive use of antibiotics [4], high specificity is especially necessary in clinical practice to avoid an unnecessary and overuse of antibiotics in low-risk patients. By contrast, all patients received peri-operative antibiotic therapy for 72 h, and this has posed considerable challenge in predicting pneumonia at an early stage [29]. Therefore, the novel XGBoost model as established in this study may assist clinicians in making optimal interventions and treatments, and eventually improve care for affected patients.

It has been reported that a number of risk factors, including age of recipient, liver dysfunction score, indication for OLT, perioperative transfusions especially the blood and fresh frozen plasma units, restrictive preoperative pulmonary testing pattern and INR measured prior OLT, are significantly associated with post-liver transplant pneumonia [3, 30, 31]. However, these factors are limited for its underutilization of within-category information, causing a loss of information [32]. For instance, patients above or below the optimal cut-point value had been equally considered in the risk-factor prediction, yet the risk of post-transplant pneumonia may vary considerably. As the risk-factor prediction is developed with neither combining all factors together nor weighting difference between different factors, it is not widely used in clinical practice. In addition, the traditional scores were given on the basis of the assumption that all misclassification errors have equal costs. In fact, this assumption is indefensible if apply in real-world applications [33]. In this study, we applied RFE feature selection method on 33 features which were statistically significant, of which 14 best features with the highest sensitivity score, including preoperative laboratory results of INR, HCT, PLT, ALB, ALT, FIB, WBC, PT, serum Na+, TBIL, anesthesia time; preoperative length of hospital stay, total fluid transfusion, and operation time. We found that most of the factors have been reported to be associated with pneumonia and PPCs except for PLT and serum Na+ [18, 30, 31, 34, 35]. As the risk factors reported in different literatures are quite different and this may be attributed to different population and definition of pneumonia and PPCs, we think it just reflects the advantage of ML models to capture previously unknown correlations in big data. Although the underlying mechanism remained unclear, the high clinical relevance of these factors laid a solid foundation for the consequent ML process and made the conclusion more practical and clinically valuable [36]. Moreover, we found the 14 features in ML model were all routinely recorded and widely used, and no factors need special instrument or equipment to obtain, indicating that our models are feasible and can be widely used in hospitals.

To date, ML models have shown outstanding performance in prediction of diseases and clinical conditions, for which these models can be helpful in decision-making about the use of interventions and medications [33]. For example, ML models can generate an individualized probability for each patient. Additionally, implementation of sophisticated computer algorithms at the bedside has become a reality since the popularity of EPR systems and wide availability of structured patient data. In our study, the EPR systems included HIS, LIS, PACS, and Docare Anesthesia System, which allowed us to integrate medical data generated during admission, covering demographic data, daily documentation, laboratory and imaging results, anesthesia records and thorough record of medication, and treatment. In addition, we separated the patients 1000 times (70% train and 30% test) into 1000 different pairs of train and test sets and this could minimize accidental error and enhance the accuracy of the current ML models. This result showed that in predicting post-transplant pneumonia, we should not apply only one of the ML model.

In the study, we found that patients with hepatic malignancy, better hepatic function before surgery, and longer hospital stay before surgery were significantly associated with lower risk of developing postoperative pneumonia. We postulated that this could be attributed to the better preoperative treatment and preparation, suggested that interventions should be implemented to improve the patients’ overall preoperative conditions. In consistence with previous reports [37, 38], we identified that a number of intraoperative factors, such as the longer operation and anesthesia time, excessive blood product transfusion, and fluid transfusion, were significantly related to postoperative pneumonia in patients following liver transplantation. By contrast, we found that there was an association between the use of telipressin and dopamine and decreased incidence of postoperative pneumonia in patients after liver transplantation. These findings are clinically important for the intraoperative anesthetic management and help improving the clinical outcomes.

The study may have several limitations. Firstly, the ML models are developed on the basis of a single-center cohort study, and future multi-center study will be needed for external validation. Secondly, this study is performed retrospectively, for which collection and entry bias, as well as possible residual confounding may occur. Thirdly, we were unable to incorporate the metrics of liver donors as training variables in our study, due to the lack of donor information in the EPR systems of our hospital.


Our study has successfully established six novel ML models to predict postoperative pneumonia among OLT patients. Of these, the XGboost model has demonstrated overall best performance, and therefore holds promise for future clinical application to predict post-transplant pneumonia in OLT patients. To the best of our knowledge, this is the first ML-based study to provide a novel ML algorithm for prediction of postoperative pneumonia in patients after liver transplantation.

Availability of data and materials

All data generated or analysed during this study are included in this published article [and its additional information files].



Postoperative pulmonary complications


Orthotopic liver transplantation


Machine learning


Logistic regression


Support vector machine


Random forest


Adaptive boosting


Extreme gradient boosting


Gradient boosting machine


National Institutes of Health Stroke Scales


Electronic patient record


Acute kidney injury


Living donor liver transplantation


Deceased donor liver transplantation


Hospital information system


Laboratory information system


Picture archiving and communication system


Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis


European Perioperative Clinical Outcome


White blood cell


Donation of circulatory death


Donation of brain death


Continuous blood purification


Arterial blood pressure


Cardiac output


Stroke volume variation


Estimated blood loss




Intensive care unit


Alcohol-related liver disease


Drug-induced liver injury


Recursive feature elimination


Multilayer perceptron


Gradient boosting machine


Confidence interval


Area under receiver-operating curve


Standard deviation




Alanine transaminase


Total bilirubin




Red blood cell


International normalized ratio






Prothrombin time


  1. 1.

    Magalhães CBA, Nogueira IC, Marinho LS, Daher EF, Garcia JHP, Viana CFG, de Bruin PFC, Pereira EDB. Exercise capacity impairment can predict postoperative pulmonary complications after liver transplantation. Respiration. 2017;94(3):272–8.

    Article  Google Scholar 

  2. 2.

    Li X, Wei X, Chen C, Zhang Z, Liu D, Hei Z, Yao W. N-Acetylcysteine inhalation improves pulmonary function in patients received liver transplantation. Biosci Rep. 2018;38(5).

  3. 3.

    Levesque E, Hoti E, Azoulay D, Honore I, Guignard B, Vibert E, Ichai P, Antoun F, Saliba F, Samuel D. Pulmonary complications after elective liver transplantation-incidence, risk factors, and outcome. Transplantation. 2012;94(5):532–8.

    Article  Google Scholar 

  4. 4.

    Freire MP, Villela Soares Oshiro IC, Bonazzi PR, Pierrotti LC, de Oliveira LM, Machado AS, Van Der Heijdenn IM, Rossi F, Costa SF, Carneiro D’Albuquerque LA, Abdala E. Surveillance culture for multidrug-resistant gram-negative bacteria: performance in liver transplant recipients. Am J Infect Control. 2017;45(3):e40–4.

    Article  Google Scholar 

  5. 5.

    Miguel Montanes R, Elkrief L, Hajage D, Houssel P, Fantin B, Francoz C, Dreyfuss D, Ricard JD, Durand F. An outbreak of Pneumocytis jirovecii pneumonia among liver transplant recipients. Transpl Infect Dis. 2018;20(5):e12956.

    Article  Google Scholar 

  6. 6.

    Miskovic A, Lumb AB. Postoperative pulmonary complications. Br J Anaesth. 2017;118(3):317–34.

    CAS  Article  Google Scholar 

  7. 7.

    Smith CJ, Bray BD, Hoffman A, Meisel A, Heuschmann PU, Wolfe CD, Tyrrell PJ, Rudd AG. Can a novel clinical risk score improve pneumonia prediction in acute stroke care? A UK multicenter cohort study. J Am Heart Assoc. 2015;4(1):e001307.

    Article  Google Scholar 

  8. 8.

    Arozullah AM, Khuri SF, Henderson WG, Daley J. Development and validation of a multifactorial risk index for predicting postoperative pneumonia after major noncardiac surgery. Ann Intern Med. 2001;135(10):847–57.

    CAS  Article  Google Scholar 

  9. 9.

    Shoka M, Kanda M, Ito S, Mochizuki Y, Teramoto H, Ishigure K, Murai T, Asada T, Ishiyama A, Matsushita H, Tanaka C, Kobayashi D, Fujiwara M, Murotani K, Kodera Y. Systemic inflammation score as a predictor of pneumonia after radical resection of gastric cancer: analysis of a multi-institutional dataset. Dig Surg. 2020;37(5):401–10.

    Article  Google Scholar 

  10. 10.

    Kostakis ID, Sotiropoulos GC, Kouraklis G. Pneumocystis jirovecii pneumonia in liver transplant recipients: a systematic review. Transplant Proc. 2014;46(9):3206–8.

    CAS  Article  Google Scholar 

  11. 11.

    Nishi H, Oishi N, Ishii A, Ono I, Ogura T, Sunohara T, Chihara H, Fukumitsu R, Okawa M, Yamana N, Imamura H, Sadamasa N, Hatano T, Nakahara I, Sakai N, Miyamoto S. Predicting clinical outcomes of large vessel occlusion before mechanical thrombectomy using machine learning. Stroke. 2019;50(9):2379–88.

    Article  Google Scholar 

  12. 12.

    Macesic N, Bear Don’t Walk OI, Peer I, Tatonetti NP, Peleg AY, Uhlemann AC. Predicting phenotypic polymyxin resistance in Klebsiella pneumoniae through machine learning analysis of genomic data. mSystems. 2020;5(3):e00656-19.

    Article  Google Scholar 

  13. 13.

    Quesada JA, Lopez-Pineda A, Gil-Guillen VF, Durazo-Arvizu R, Orozco-Beltran D, Lopez-Domenech A, Carratala-Munuera C. Machine learning to predict cardiovascular risk. Int J Clin Pract. 2019;73(10):e13389.

    Article  Google Scholar 

  14. 14.

    Artzi NS, Shilo S, Hadar E, Rossman H, Barbash-Hazan S, Ben-Haroush A, Balicer RD, Feldman B, Wiznitzer A, Segal E. Prediction of gestational diabetes based on nationwide electronic health records. Nat Med. 2020;26(1):71–6.

    CAS  Article  Google Scholar 

  15. 15.

    Li X, Wu M, Sun C, Zhao Z, Wang F, Zheng X, Ge W, Zhou J, Zou J. Using machine learning to predict stroke-associated pneumonia in Chinese acute ischaemic stroke patients. Eur J Neurol. 2020;27(8):1656–63.

    CAS  Article  Google Scholar 

  16. 16.

    Luo Y, Tang Z, Hu X, Lu S, Miao B, Hong S, Bai H, Sun C, Qiu J, Liang H, Na N. Machine learning for the prediction of severe pneumonia during posttransplant hospitalization in recipients of a deceased-donor kidney transplant. Ann Transl Med. 2020;8(4):82.

    CAS  Article  Google Scholar 

  17. 17.

    Spann A, Yasodhara A, Kang J, Watt K, Wang B, Goldenberg A, Bhat M. Applying machine learning in liver disease and transplantation: a comprehensive review. Hepatology. 2020;71(3):1093–105.

    Article  Google Scholar 

  18. 18.

    Li X, Chen C, Wei X, Zhu Q, Yao W, Yuan D, Luo G, Cai J, Hei Z. Retrospective comparative study on postoperative pulmonary complications after orthotopic liver transplantation using the Melbourne Group Scale (MGS-2) diagnostic criteria. Ann Transplant. 2018;23:377–86.

    CAS  Article  Google Scholar 

  19. 19.

    Darst BF, Malecki KC, Engelman CD. Using recursive feature elimination in random forest to account for correlated variables in high dimensional data. BMC Genet. 2018;19(Suppl 1):65.

    CAS  Article  Google Scholar 

  20. 20.

    Yang HC, Islam MM, Nguyen PAA, Wang CH, Poly TN, Huang CW, Li YJ. Development of a web-based system for exploring cancer risk with long-term use of drugs: logistic regression approach. JMIR Public Health Surveill. 2021;7(2):e21401.

    Article  Google Scholar 

  21. 21.

    Wang RZ, Sun CH, Schroeder PH, Ameko MK, Moore CC, Barnes LE. Predictive models of sepsis in adult ICU patients. In: IEEE international conference on healthcare informatics (ICHI), New York, NY, USA. 2018:390–1

  22. 22.

    Zhang K, Liu X, Jiang J, Li W, Wang S, Liu L, Zhou X, Wang L. Prediction of postoperative complications of pediatric cataract patients using data mining. J Transl Med. 2019;17(1):2.

    Article  Google Scholar 

  23. 23.

    Nasser IM, Abu-Naser SS. Lung cancer detection using artificial neural network. Int J Eng Inf Syst. 2019;3(3):17–23.

    Google Scholar 

  24. 24.

    Zeng X, An J, Lin R, Dong C, Zheng A, Li J, Duan H, Shu Q, Li H. Prediction of complications after paediatric cardiac surgery. Eur J Cardiothorac Surg. 2020;57(2):350–8.

    PubMed  Google Scholar 

  25. 25.

    Zhu SL, Dong J, Zhang C, Huang YB, Pan W. Application of machine learning in the diagnosis of gastric cancer based on noninvasive characteristics. PLoS ONE. 2020;15(12):e0244869.

    CAS  Article  Google Scholar 

  26. 26.

    Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29(5):1189–232.

    Article  Google Scholar 

  27. 27.

    Sheridan RP, Wang M, Liaw A, Ma J, Gifford E. Correction to extreme gradient boosting as a method for quantitative structure-activity relationships. J Chem Inf Model. 2020;60(3):1910.

    CAS  Article  Google Scholar 

  28. 28.

    Adam-Bourdarios C, Cowan G, Germain-Renaud C, Guyon I, Kégl B and Rousseau D. The Higgs machine learning challenge. J Phys Conf Ser. 2015;664

  29. 29.

    Li F, Wang C, Liu X, Peng Y, Jin S. A composite model of wound segmentation based on traditional methods and deep neural networks. Comput Intell Neurosci. 2018.

    Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    El-Badrawy MK, Ali RE, Yassen A, AbouElela MA, Elmorsey RA. Early-onset pneumonia after liver transplant: microbial causes, risk factors, and outcomes, Mansoura University, Egypt, Experience. Exp Clin Transplant. 2017;15(5):547–53.

    PubMed  Google Scholar 

  31. 31.

    Koo HJ, Lee HN, Choi SH, Sung H, Oh SY, Shin SY, Kim HJ, Do KH. Human metapneumovirus infection: pneumonia risk factors in patients with solid organ transplantation and computed tomography findings. Transplantation. 2018;102(4):699–706.

    Article  Google Scholar 

  32. 32.

    Chen CY, Lin WC, Yang HY. Diagnosis of ventilator-associated pneumonia using electronic nose sensor array signals: solutions to improve the application of machine learning in respiratory research. Respir Res. 2020;21(1):45.

    Article  Google Scholar 

  33. 33.

    Deo RC. Machine learning in medicine. Circulation. 2015;132(20):1920–30.

    Article  Google Scholar 

  34. 34.

    Zhu M, Zhu Y, Guo F, Zhang J, Liu W, Hou W. Clinical and laboratory characteristics of 215 cases of coronavirus disease 2019 with different prognosis. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2020;32(12):1428–33.

    PubMed  Google Scholar 

  35. 35.

    Li D, Liu C, Liu J, Hu J, Yang Y, Zhou Y. Analysis of risk factors for 24 patients with COVID-19 developing from moderate to severe condition. Front Cell Infect Microbiol. 2020;10:548–82.

    Google Scholar 

  36. 36.

    Hu Z, Melton GB, Moeller ND, Arsoniadis EG, Wang Y, Kwaan MR, Jensen EH, Simon GJ. Accelerating chart review using automated methods on electronic health record data for postoperative complications. AMIA Annu Symp Proc. 2016;2016:1822–31.

    PubMed  Google Scholar 

  37. 37.

    Nijbroek SG, Schultz MJ, Hemmes SNT. Prediction of postoperative pulmonary complications. Curr Opin Anaesthesiol. 2019;32(3):443–51.

    Article  Google Scholar 

  38. 38.

    Siniscalchi A, Aurini L, Benini B, Gamberini L, Nava S, Viale P, Faenza S. Ventilator associated pneumonia following liver transplantation: etiology, risk factors and outcome. World J Transplant. 2016;6(2):389–95.

    Article  Google Scholar 

Download references


The authors thank all of the patients who kindly participated in this study.


This study was supported by the National Natural Science Foundation of China (Grant No. 81974296) and Postdoctoral Science Foundation of China (Grant No. 2019M663260 and 2020T130148ZX).

Author information




SZ, ZH and YY participated in research design. CC, DY, LC and YZ participated in the writing of the paper. CC, DY, SG and ZM analyzed and explained the data; LC, YZ, SG, BW and CC collected and helped with analyzing the data of the study. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yang Yang or Ziqing Hei or Shaoli Zhou.

Ethics declarations

Ethics approval and consent to participate

Ethical approval for this study (No.[2019]02-609-01) was provided by the Ethnic Committee of the Third Affiliated Hospital of Sun Yat-sen University-Lingnan Hospital, Guangzhou, China (Chairperson Prof. Xie Dongying) on 18 May 2019. The requirement for informed consent and clinical trial registration were waived by the committee, mainly due to the retrospective nature of this study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

: Table S1. Hyperparameter combinations for each machine learning models and their corresponding test set AUC.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Chen, C., Yang, D., Gao, S. et al. Development and performance assessment of novel machine learning models to predict pneumonia after liver transplantation. Respir Res 22, 94 (2021).

Download citation


  • Liver transplantation
  • Postoperative pneumonia
  • Machine learning
  • Postoperative pulmonary complications
  • Disease prediction
  • Risk factors
  • Early intervention
  • Deep learning
  • ML algorithm
  • Extreme gradient boosting