Abstract
Abstract
Background:
The (Immunodeficiency, Blood pressure [<90 mm Hg], Multilobular intiltrates [chest x-ray], Platelets [<100×109/L], hospitalization [<10 days] before the onset of ventilator-associated pneumonia [VAP]) IBMP-10 is a new scoring system proposed as an easy-to-use alternative to the Acute Physiology and Chronic Health Evaluation II (APACHE II) score for predicting mortality in patients with ventilator-associated pneumonia (VAP). The objective of this study was to determine the validity of the IBMP-10 score compared with APACHE II in predicting mortality for an independent population consisting predominantly of surgical and neurotrauma patients.
Methods:
The IBMP-10 and APACHE II scores on the day of VAP diagnosis were calculated, and areas under the receiver-operating characteristic curves (AUROCs) were compared to determine the tests' abilities to predict 14- and 28-day mortality.
Results:
A total of 168 patients meeting the radiologic and clinical criteria for VAP for a single hospitalization between 2004 and 2007 were included; 80% of these were from the surgical or neurotrauma intensive care unit. Overall mortality rates were 15% and 23% at 14 and 28 days, respectively. The AUROC for the IMBP-10 score for predicting 14-day mortality was 0.609 (p=0.084) compared with 0.648 (p=0.017) for the APACHE II score. Both IBMP-10 and APACHE II AUROCs for predicting 14-day mortality were lower than observed in the original score validation (0.808 and 0.743, respectively). The AUROCs for predicting 28-day mortality were 0.602 (p=0.056) and 0.705 (p<0.001) for IBMP10 and APACHE II, respectively.
Conclusions:
The IBMP-10 score was less reliable than the APACHE II score in predicting 14-day mortality in this independent population of VAP patients. This finding highlights the need for additional validation of new disease severity scoring systems in a study population independent of the population used to derive score criteria, as well as in more specific populations of critically ill patients.
The (Immunodeficiency, Blood pressure [<90 mm Hg], Multilobular intiltrates [chest x-ray], Platelets [<100×109/L], hospitalization [<10 days] before the onset of ventilator-associated pneumonia [VAP]) IBMP-10 score is a simple five-point system developed recently for predicting mortality in VAP patients. One point is assigned for each of the following: immunosuppression (I), systolic blood pressure <90 mm Hg or mean arterial pressure ≤70 mm Hg (B), multilobar infiltration on chest radiograph obtained the day of VAP diagnosis (M), platelet count <100,000/mm3 (P), and >10 days of hospitalization prior to VAP diagnosis (10) [8]. Immunosuppression is defined as steroid use >10 mg of prednisolone or an equivalent for more than five days, use of an immunomodulator, active malignant disease at the time of VAP diagnosis or cancer diagnosed within one year preceding the current VAP episode, acquired immunodeficiency syndrome, or chemotherapy or radiotherapy within the 30 days prior to the diagnosis of VAP [8].
The IBMP-10 scoring system has been validated only in the population of patients used to derive the five criteria that best predicted 14-day mortality in developing the scoring system. Additionally, demographics regarding the type of ICU where the patient developed VAP were not reported. Therefore, the ability of this system to predict 14-day mortality in a population independent of those patients used initially or within special populations of critically ill patients has not been determined. The objective of this study was to validate the utility of this new scoring system in an independent population of critically ill patients with VAP, consisting primarily of surgical and neurotrauma patients.
Patients and Methods
Study Design
We conducted a secondary analysis of data gathered from a previous evaluation of patients with VAP at a single institution, Hartford Hospital in Hartford, CT. The original study was a prospective, observational evaluation of patients with VAP who were managed using an antibiotic clinical pathway compared with patients in a historical control group [9]. The study was approved by the Institutional Review Committee at Hartford Hospital, and the need for informed consent was waived. Baseline characteristics, all variables needed to calculate IBMP-10 and APACHE II scores on the day of VAP diagnosis, and patient outcomes at 14 days, 28 days, and hospital discharge were available from an electronic database and review of patient medical records.
The IBMP-10 scores were calculated according to the criteria outlined in the original study by Mirsaeidi et al., and APACHE II scores were calculated as previously described [6,8]. Scores were calculated using the worst value within 24 h before or after VAP diagnosis. The ability of the IBMP-10 score to predict mortality was compared with that of the APACHE II score. For comparison with IBMP-10 scores, APACHE II scores were divided into categorical ranges, as was done by Mirsaeidi et al.: 0=score <15 points; 1=score 15–19 points, 2=score 20–24 points; 3=score 25–29 points; 4=score ≥30 points [8]. In addition, we compared our results with those of the original IMBP-10 study, which included patients from the Improving Medicine through Pathway Assessment of Critical Therapy of Hospital-Acquired Pneumonia (IMPACT-HAP) database [8].
Patient Population
A total of 168 patients from three ICUs (medical, surgical, and neurotrauma) at Hartford Hospital, an 867-bed tertiary-care center, were included. All patients were hospitalized adults >17 years of age who acquired VAP between July 2004 and September 2007 [9]. Ventilator-associated pneumonia was diagnosed using clinical criteria similar to those employed in the original score development and validation [8]. Specifically, VAP was defined as a new or progressing infiltrate on chest radiograph in a patient who had been ventilated mechanically for more than 48 h or weaned from mechanical ventilation within the previous 48 h, and who met at least two of the following clinical criteria: Body temperature >38°C or <36°C without another recognized cause, white blood cell count >10,000 cells/mm3 or <5,000 cells/mm3, or a macroscopically purulent tracheal aspirate [1,9].
Statistical Analysis
Categorical variables were compared using either the chi-square test or the Fisher exact test where appropriate, and continuous variables were compared using either the Student t-test or the Mann-Whitney rank-sum test. All statistical tests were completed using SigmaStat Statistical Software Version 2 (SPSS Inc., Chicago, IL). Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for 14-day, 28-day, all-cause in-hospital mortality were determined for both the APACHE II and the IMBP-10 scores; receiver operating characteristic (ROC) curves were constructed; and the area under the ROC curves was calculated and analyzed using MedCalc Version 11 (MedCalc Software, Mariakerke, Belgium). A p value <0.05 was considered statistically significant.
Results
A total of 168 patients were included; the sample size was similar to that of the initial score validation set (n=178) [7]. The mean age (± standard deviation) was 56.7±20.6 years, and 109 patients (65%) were male. There were 26 nonsurvivors at 14 days (15%), and 39 non-survivors at 28 days (23%). A 14-day mortality rate of 15% was observed during the initial score validation [8]. No significant differences in patient demographics or co-morbidities were observed between survivors and non-survivors at 14 or 28 days, with the exception that the non-survivors had a significantly higher APACHE II score on the day of VAP diagnosis (Table 1). Immunosuppression was uncommon in our patient population (n=34; 20%) and was slightly less frequent than in the population used in the initial IBMP-10 validation (32%). Eighty percent of the patients were from surgical and neurotrauma ICUs (Table 1). A large percentage of the current study population also had cerebrovascular disease (n=106; 63%). At least one causative organism was isolated from 144 patients (88%); a total of 182 baseline organisms were identified in these patients (Table 2).
APACHE II=Acute Physiology and Chronic Health Evaluation II; Late-onset VAP=pneumonia developing greater than four days after initiation of mechanical ventilation; MICU=medical intensive care unit; NTICU=neurotrauma intensive care unit; SICU=surgical intensive care unit; SD=standard deviation.
Mortality rates by IBMP-10 score are depicted in Figure 1. Overall, the mortality rates for the current study population were similar to those from the IMPACT-HAP database when the IBMP-10 and APACHE-II scores were ≤2 points but lower in the current study when severity scores were >2 points.

Mortality rates at 14 days for Acute Physiology and Chronic Health Evaluation II (APACHE II) and IBMP-10 (see text for definition) scores in 168 patients with ventilator-associated pneumonia. Categories for APACHE II Scores: 0=<15; 1=15–19, 2=20–24; 3=25–29; 4=≥30. Abbreviations: HH=Hartford Hospital; IMPACT-HAP=Improving Medicine through Pathway Assessment of Critical Therapy of Hospital-Acquired Pneumonia.
Patient numbers, sensitivity, specificity, PPV, and NPV for both scores and at various cut-off points are listed in Table 3. The NPV was relatively high, whereas the PPV was low for both scoring systems. The areas under the ROC curves (AUROCs) for predicting 14-day mortality with the IBMP-10 score (0.609; 95% confidence interval [CI] 0.5301–0.683; p=0.0844) as well as the APACHE II score (0.648; 95% CI, 0.571–0.720; p=0.0174) were lower than those observed in the initial score validation, 0.808 and 0.743, respectively (Table 4) [8].
APACHE=Acute Physiology and Chronic Health Evaluation II; NPV=negative predictive value; PPV=positive predictive value.
CI=confidence interval; HH=Hartford Hospital; IMPACT-HAP=Improving Medicine through Pathway Assessment of Critical Therapy of Hospital-Acquired Pneumonia; VAP=ventilator-associated pneumonia.
ROC curves also were constructed and analyzed for 28-day mortality as well as several other subpopulations (Table 4). The results were similar when controlling for traumatic brain injury, which could carry an additional mortality risk. Because utilization of the antibiotic clinical pathway in our original study was shown to have an impact on the infection-related mortality rate, we also analyzed ROC curves for both scoring systems in patients prior to and after VAP clinical pathway implementation. The AUROCs for 14-day mortality were not statistically significant for either score in patients prior to pathway implementation, whereas the AUROC for the APACHE II score was statistically significant after pathway implementation. However, when looking at discrimination for predicting 28-day and all-cause mortality, the AUROCs for the IBMP-10 score were not only statistically significant, but also higher than those for the APACHE II score in the pre-pathway group only.
Lastly, rates of occurrence for each of the five components of the IBMP-10 score were compared for survivors and nonsurvivors at 14 and 28 days. Although all five score criteria were significantly more prevalent in non-survivors in the initial score validation, thrombocytopenia was the only criterion that was significantly more prevalent in non-survivors at 14 days in the current study (Table 5). Multi-lobar pneumonia was also observed at the same rate in non-survivors than survivors at 14 days (the difference was not statistically significant. The prevalences of immunosuppression and hypotension were similar in non-survivors and survivors at 14 days. Hospitalization for >10 days before the onset of VAP was not more frequent in survivors than in non-survivors (the difference was not statistically significant).
MAP=mean arterial blood pressure; SBP=systolic blood pressure; SD=standard deviation; VAP=ventilator-associated pneumonia; IMBP-10=see text for definition.
Discussion
The IBMP-10 score was recently proposed as an easier-to-use alternative to the APACHE II score for predicting mortality in patients with VAP, but before this study had been validated only in the study population used to derive the score criteria. The IBMP-10 score was not able to predict accurately 14-day mortality in our independent series of critically ill patients. Conversely, the APACHE II was a valid predictor of both 14- and 28-day mortality when calculated on the day of VAP diagnosis, similar to what has been demonstrated in previous studies [7].
In the current study, we attempted to reproduce the validation of the IBMP-10 as originally described by Mirsaeidi et al. [8] using an independent population of patients with VAP. Hence, we used the same clinical diagnostic criteria for VAP employed in the original study. We recognize that using clinical criteria alone is not the most accurate for diagnosing VAP, as alternative diagnoses could be responsible for patients' clinical findings. Despite using the same methodologies and a population of similar size (n=168 vs. 178) with the same 14-day mortality rate (15%), the IBMP-10 score could not be validated. Differences in patient characteristics between our population and the population used in the initial validation could account for the discordant results. Our population consisted predominately of patients from surgical and neurotrauma ICUs. However, the original validation study did not differentiate patients by ICU, nor were patient demographics described in detail, making it difficult to make a comparison. Another factor that could have contributed to our conflicting results is variations in the number of patients in each score category. However, without knowing the distributions in the initial validation populations, the role this played is difficult to assess.
It also is important to recognize the potential bias introduced when the IBMP-10 score was validated initially, as this procedure was conducted in the same population used to derive the five score criteria found to be predictive of mortality. In contrast, there was no difference in occurrence rates for four of these five criteria between survivors and nonsurvivors at 14 days in the current study. This observation stresses the need for the current validation, conducted in a population that did not contribute to the construction of the scoring system. Although validation in a large, multi-center population certainly is useful, single-center observations such as this one also are important in validating new scoring systems, as this is how the score likely will be used.
In our single-center validation, the IBMP-10 was not as reliable as APACHE II in predicting mortality in a population consisting largely of surgical and neurotrauma patients. Perhaps the score would have performed better in a population of medical ICU patients, where the mortality rate attributable to VAP could be lower. However, the current study did not have a sufficient population of medical ICU patients to allow this comparison.
The AUROCs from the current analysis of the IBMP-10 score's ability to predict 14- and 28-day mortality associated with VAP (0.609; p=0.084 and 0.602; p=0.056, respectively) were lower than those observed for other prognostic indices in previous studies [6,9]. In a study evaluating the prognostic capability of APACHE II, the Sequential Organ Failure Assessment (SOFA), and the Clinical Pulmonary Infection Score (CPIS) for VAP in 63 patients, the APACHE II and SOFA scores had excellent discriminatory ability, with AUROCs of 0.81 (p=0.001) and 0.71 (p=0.005), respectively, compared with 0.63 (p=0.069) for the CPIS [7]. The PIRO (predisposition, insult, response, organ dysfunction) score is another new prognostic system for patients with VAP [10]. In the initial validation of this score, PIRO was a better discriminator of ICU mortality than the APACHE II score in 441 patients, with AUROCs of 0.81 and 0.53, respectively. The inclusion of additional variables associated with ICU mortality in this scoring system compared with the IBMP-10 score may make this a better discriminator of disease severity. This scoring system also requires further validation prior to use in clinical practice.
In conclusion, the prognostic capability of the IBMP-10 score for determining the 14-day mortality risk in patients with VAP could not be validated in the current study population, comprised of predominantly surgical and neurotrauma patients. In this independent population, the IBMP-10 score was less reliable than the APACHE II score in predicting 14-day mortality, although its performance improved for predicting 28-day mortality. This finding highlights the need for additional validation of new prognostic scoring systems in a study population independent of the population used to derive the score criteria, in addition to validations in more specific populations of critically ill patients (i.e., medical or surgical) prior to implementation in clinical practice.
Footnotes
Acknowledgments and Disclosure
We thank Ilene Staff, Ph.D., for her assistance with statistical analyses. This study was undertaken with funds from the Center for Anti-Infective Research and Development, Hartford Hospital.
The authors have no financial conflicts to disclose.
