Abstract
Background Chest computed tomography (CT) is commonly used to diagnose pneumonia in Japan, but its usability in terms of prognostic predictability is not obvious. We modified CURB-65 (confusion, urea >7 mmol·L−1, respiratory rate ≥30 breaths·min−1, blood pressure <90 mmHg (systolic) ≤60 mmHg (diastolic), age ≥65 years) and A-DROP scores with CT information and evaluated their ability to predict mortality in community-acquired pneumonia patients.
Methods This study was conducted using a prospective registry of the Adult Pneumonia Study Group – Japan. Of the 791 registry patients, 265 hospitalised patients with chest CT were evaluated. Chest CT-modified CURB-65 scores were developed with the first 30 study patients. The 30-day mortality predictability of CT-modified, chest radiography-modified and original CURB-65 scores were validated.
Results In score development, infiltrates over four lobes and pleural effusion on CT added extra points to CURB-65 scores. The area under the curve for CT-modified CURB-65 scores was significantly higher than that of chest radiography-modified or original CURB-65 scores (both p<0.001). The optimal cut-off CT-modified CURB-65 score was ≥4 (positive-predictive value 80.8%; negative-predictive value 78.6%, for 30-day mortality). For sensitivity analyses, chest CT-modified A-DROP scores also demonstrated better prognostic value than did chest radiography-modified and original A-DROP scores. Poor physical status, chronic heart failure and multiple infiltration hampered chest radiography evaluation.
Conclusion Chest CT modification of CURB-65 or A-DROP scores improved the prognostic predictability relative to the unmodified scores. In particular, in patients with poor physical status or chronic heart failure, CT findings have a significant advantage. Therefore, CT can be used to enhance prognosis prediction.
Abstract
Chest CT modification of CURB-65 and A-DROP improves prognosis prediction in community-acquired pneumonia. Patients with low physical status or chronic heart failure may have mismatch of chest CT and radiography findings. https://bit.ly/30GbNZS
Community-acquired pneumonia (CAP) is one of the four major causes of death worldwide [1]. Pneumonia-related mortality in older individuals (≥70 years) increased by 33.6% in the decade leading up to 2017 [1], and so prognostic prediction is crucial for determining the site of care of patients with pneumonia and for avoiding unnecessary hospital admission [2]. The CURB-65 (confusion, urea >7 mmol·L−1, respiratory rate ≥30 breaths·min−1, blood pressure <90 mmHg (systolic) ≤60 mmHg (diastolic), age ≥65 years), which assigns scores for confusion, uraemia, respiratory rate, blood pressure and age ≥65 years [3], is the most widely known scoring system and, given its simplicity, European guidelines recommend its use after a patient is diagnosed with CAP in a hospital care setting [3, 4].
Another useful prognostic scoring system, the A-DROP, proposed by the Japanese Respiratory Society, uses scores that are similar to those of the CURB-65, but with a different cut-off age: ≥70 years for men and ≥75 years for women [5]. Some studies in Japan and the US have reported a slightly higher area under the receiver operating characteristic curve (AUROC) for the A-DROP scoring system as compared to the CURB-65 [6, 7].
A score of ≥2 on the CURB-65 generally indicates the need for admission and has high sensitivity (92.8%) for 30-day mortality, but its specificity (49.2%) is unsatisfactory [8]. Thus, in practice, clinicians always require supplemental information to improve the specificity for prognosis and to determine whether CAP patients should be hospitalised or can be treated as outpatients. Indeed, the updated US guidelines suggest further investigation of the CURB-65 scores and new prediction rules for prognosis [9].
Chest computed tomography (CT) has been reported as a useful diagnostic tool for detecting pneumonia when chest radiography could not detect abnormal infiltration in the lung field [10]. Additionally, chest CT is helpful in detecting complications in pneumonia or the specific aetiology of pneumonia [11]. Careful differentiation of pneumonia from non-infectious diseases, including heart failure, interstitial lung disease and chronic obstructive pulmonary disease, is also difficult with chest radiograph alone [12].
In terms of prognostic assessment, some prospective studies have reported that chest radiograph findings are useful for predicting the prognosis of CAP [13–15]. In fact, these findings have been included in some of the major prognostic scoring systems, including the Pneumonia Severity Index (PSI) [2] and SMART-COP [10]. In contrast, there are currently no data available to verify the prognostic predictability of chest CT in pneumonia cases [2]. The quantity of infiltrates and ancillary findings on chest CT is easily considered for the prognosis of pneumonia; however, this was merely an extrapolation of evidence from chest radiography [13–15]. Therefore, the objective of this study was to evaluate the ability of CURB-65 scores modified with CT information to predict mortality in CAP patients.
Methods
Study design and study participants
This study was a secondary analysis of the Adult Pneumonia Study Group – Japan registry, a multicentre prospective surveillance programme conducted from 2011 to 2013 at four hospitals located in the major islands of Japan [16, 17].
The current study targeted patients registered in the cohort at Kameda Medical Center, Chiba, Japan, a 925-bed tertiary care centre that is the largest of the four participating facilities, from February 2012 to January 2013. Patients were enrolled who met the following criteria: 1) age ≥15 years; 2) symptoms compatible with pneumonia (e.g. fever, cough, sputum, pleuritic chest pain and dyspnoea); 3) presence of new pulmonary infiltrates on chest radiograph or chest CT that were consistent with pneumonia [16]; 4) chest CT performed at the emergency or outpatient department within 48 h of hospital visit; and 5) hospitalisation. Exclusion criteria were: 1) hospitalisation for reasons other than pneumonia; 2) patients with advanced bronchiectasis or interstitial pneumonia on chest CT classified as severe because the observer could not assess the images; and 3) patients with missing values for calculation of CURB-65 scores.
The Kameda Medical Center institutional review board approved the study (ID 14-042) and allowed the opt-out method instead of requiring informed patient consent.
Chest radiography and chest CT acquisition
Chest radiography was performed using a standardised protocol with specified position (standing or sitting) and we evaluated the frontal view only, because the lateral view is rarely taken in elderly patients who had difficulty walking. A multidetector chest CT scan was performed using a standard-dose protocol with 2- to 5-mm collimation at the same mm intervals. Contrast material was not used in all cases. We acquired prior chest radiograph and CT results if possible, to evaluate whether the lung infiltrates or pleural effusion were new. Because our facility was the only hospital for tertiary care in our city, study participants without prior chest imaging were very rare.
Index test and reference standard
Patients' chest radiograph and CT images were evaluated and scored independently by two senior chest physicians (K.N. and S.N.) who were blinded to clinical information. Consensus was reached by discussion in the case of disagreement. The observers assessed multilobar chest radiograph involvement, consolidation affecting more than two-thirds of the unilateral lung field or affecting bilateral lung fields, and the presence of pleural effusion, based on previous studies [13–15]. The presence of each of these contributed an additional point to the CURB-65 scores (i.e. the chest radiography-modified CURB-65 scores).
We defined the study reference standard as 30-day mortality from the day of admission.
Score development and validation
For chest CT analysis, the number of lobes involved and the presence of pleural effusion were assessed in a way similar to a previous study using chest radiograph imaging [13–15]. One observer (K.N.) reviewed the chest CT and the outcome of the first 30 consecutive patients who registered as study participants to establish the scoring system. When determining features that could be used to modify the CURB-65, we considered those that would be clinically amenable. Each of these features was assigned one whole additional point, rather than using statistical weighting, to facilitate subsequent analysis [8]. The scores were then used for validation in the remaining study participants.
Statistical analysis
The characteristics of both the study and the overall registry participants from Kameda Medical Center were described to shed light on potential selection bias. Categorical variables, expressed as the total number and percent, were compared using chi-squared tests. Continuous variables, expressed as the median and interquartile range, were compared using the Mann–Whitney U-test. Kappa coefficients were used to estimate inter-rater agreement. AUROC was calculated to compare the ability of chest CT-modified CURB-65 scores to discriminate CAP with that of chest radiography-modified and original scores. DeLong's test was used to compare ROC curves. The confidence intervals for the different AUROCs were calculated by bootstrapping (100 000 iterations). To control the family-wise error rate of multiple comparisons to 5%, we used a Bonferroni-corrected threshold of p<0.017. As a sensitivity analysis, the A-DROP score was also tested using the same scoring method as that for the CURB-65. All analyses were performed using R ver. 3.5.2 (R Foundation for Statistical Computing, Vienna, Austria) [18] with the add-on package EZR ver. 1.36 (Saitama Medical Center, Jichi Medical University, Saitama, Japan) [19].
Results
Study participants
In total, 791 participants were enrolled in the Adult Pneumonia Study Group – Japan registry at Kameda Medical Center, from February 2012 to January 2013; of these, 265 patients were included in the current study. The first 30 of the 265 patients were selected for score development and the remaining 235 patients were used for validation (figure 1). The baseline characteristics of the 265 eligible patients and overall registry population are summarised in table 1. Old age, male sex, low Eastern Cooperative Oncology Group (ECOG) performance status, presence of certain comorbidities and CURB-65 scores were statistically more frequent in the eligible patients than in the total registry population.
Patient selection flow chart. CT: computed tomography; CURB-65: confusion, urea >7 mmol·L−1, respiratory rate ≥30 breaths·min−1, blood pressure <90 mmHg (systolic) ≤60 mmHg (diastolic), age ≥65 years
Baseline patient characteristics
Chest CT score development and validation
In score development using the first 30 patients, the expert panel (K.N. and M.A.) decided to add a single point to the original CURB-65 or A-DROP score for identification of infiltration affecting ≥4 lobes and the presence of pleural effusion (unilateral or bilateral) on chest CT, mainly because of correlation with the 30-day mortality and because infiltration affecting ≥4 lobes automatically corresponded to chest radiograph involvement affecting the bilateral lungs.
Subsequently, the 30-day mortality prediction was tested by chest CT- and chest radiography-modified scores in 235 participants (table 2). For both infiltration and pleural effusion, a CT-modified score of 1 significantly correlated with a higher mortality rate, as compared with a score of 0; however, for both systems, a chest radiography-modified score of 1 did not correlate with the mortality rate, as compared with a score of 0.
Assessment of 30-day mortality risk using chest computed tomography (CT) and chest radiograph scores
The inter-rater agreement for chest CT assessment was good for both the presence of unilateral effusion (κ=0.88) and involvement of ≥4 lobes (κ=0.71). For chest radiography observation, the agreement was moderate for multilobar involvement (κ=0.46) and fair for the presence of unilateral pleural effusion (κ=0.35).
Comparison of predictive accuracy
The AUROC for all scores for 30-day mortality are displayed in figure 2. There were significant differences between the AUROCs for chest CT-modified CURB-65 or A-DROP scores and each of the other scores (table 3); there were no significant differences between that of the chest radiography-modified CURB-65 or A-DROP scores and that of the original CURB-65 or A-DROP scores, respectively. The AUROC of PSI also showed no statistically significant difference as compared to that of the original CURB-65 scores (figure S1).
Area under the receiver operating characteristic curve for prediction of 30-day mortality using the original, chest radiography-modified, and chest computed tomography-modified a) A-DROP and b) CURB-65 (confusion, urea >7 mmol·L−1, respiratory rate ≥30 breaths·min−1, blood pressure <90 mmHg (systolic) ≤60 mmHg (diastolic), age ≥65 years scores.
AUROC for 30-day mortality for the various prediction scores
The sensitivity, specificity and positive- and negative-predictive values for 30-day mortality for all scoring models are shown in table S1. A CURB-65 score of 2, which is the recognised cut-off value for hospitalisation in general [2], has 82.8% sensitivity with specificity of only 38.8%; in contrast, a chest CT-modified A-DROP score of 4 (sensitivity=79.3%) showed the highest specificity (89.8%). Regarding predictive value, chest CT-modified CURB-65 and A-DROP scores of 0–3 showed remarkable negative-predictive value (>94%) and scores of 4–7 showed good positive-predictive value (>80%). In contrast, the original scores showed similar negative-predictive value for scores of 0–2, but even scores of 4–5 showed a negative-predictive value of <75%, which was unsatisfactory.
Discrepancy between chest images and patient characteristics
There were discrepant findings in the presence of pleural effusion between chest CT and chest radiographs: 19.4% versus 2.4% in survivors, and 82.8% versus 6.9% in non-survivors. We compared the characteristics of the patients with and without such radiological discrepancy in pleural effusion (table S2). Low ECOG performance status, heart failure as a chronic underlying disease, and chest CT-demonstrated infiltration affecting ≥4 lobes were significantly more frequent in patients with this radiological discrepancy.
Discussion
Prognostic prediction in cases with CAP is crucial for deciding site of care and avoiding unnecessary hospitalisation in the current rapidly aging society. As the prognostic value of chest CT findings has previously been unappreciated and sometimes the evidence was extrapolated from that of chest radiographs, we investigated the prognostic predictability of chest CT information. Chest CT-modified CURB-65 scores had a significantly higher AUROC than did the original and chest radiography-modified CURB-65 scores for predicting 30-day mortality. Moreover, the results of a sensitivity analysis using chest CT-modified A-DROP scores showed the same superiority, which confirmed the robustness of the predictive ability of chest CT modification.
In a review of the literature, one prospective study showed that chest CT contributed more to the diagnostic accuracy of CAP than did chest radiograph imaging in an emergency department setting [20]. No previous studies had evaluated the incremental predictability of chest CT. Our results may therefore provide physicians with new insights into an effective strategy for managing CAP.
In previous studies, chest radiography was evaluated by radiologists [13–15]. In contrast, chest image evaluation by clinicians in this study was akin to that in a practical clinical setting, such as an emergency room. Although we have not excluded the prognostic predictability of chest radiographs per se, chest radiography-modified CURB-65 and A-DROP scores did not show superiority, in terms of the AUROC, for predicting 30-day mortality. The reproducibility of chest radiograph observations was also inadequate. Chest radiographs tended to under-detect pleural effusion, particularly in patients with a low ECOG performance status, history of chronic heart failure, and chest CT-determined infiltration affecting ≥4 lobes. In contrast, chest CT observations were very reproducible. In particular, the presence of pleural effusion showed almost perfect inter-rater agreement. Assessment of multilobar involvement showed some inter-rater disagreement in terms of the recognition of old shadows, subtle infiltrates, ground-glass opacities and granular shadows, but in terms of involvement of ≥4 lobes, there was substantial inter-rater agreement. These findings might be advantageous for physicians who must evaluate them in a tertiary care hospital.
As for the predictive value of chest CT-modified CURB-65 and A-DROP scores, the high sensitivity (both 89.7%) and negative-predictive value (97.8% and 97.9%, respectively) of these systems for scores ≥3 might also be useful for deciding site of care. In contrast, a previous study reported the sensitivity and specificity of a CURB-65 score ≥2 as 92.8% and 49.2%, respectively [8]. Another study suggested that the underestimation of the CURB-65 in older adults may be due to the inappropriate weight given to age [21]. The current study investigated one of the most elderly societies in the world [22]. In Japan, the incidence of CAP among people aged ≥65 years was 42.3 per 1000 per year; meanwhile, the CAP incidence was 28.4 per 1000 per year in the USA [23], 14 per 1000 per year in Spain [24], and 8 per 1000 per year in the UK [25]. Herein, the findings that the AUROC for the A-DROP were numerically higher than those for the CURB-65, and that the PSI did not show higher AUROC than the original CURB-65 scores, might be explained by the age characteristics of our sample. Although the updated US guidelines in 2019 recommend using PSI based on several prospective studies from 2000 to 2005 [9], the PSI might not be a good fit for rapidly aging societies like Japan.
As facility-limited usage, cost and radiation exposure are often considered reasons to avoid chest CT in a clinical setting, the true value of chest CT might be underestimated due to a lack of research-based evidence. The current study not only provided additional academic insights to improve our knowledge of chest CT, but also demonstrated a practical application. In the sub-analysis comparing the characteristics of patients with or without radiological discrepancy, poor physical status, a history of chronic heart failure and multilobar infiltration were identified as factors that increase the difficulty of chest radiograph evaluation. Another reason for the discrepancy, which could not be addressed in the current study, was the presence of infiltration in the lower regions of the lung. This might complicate the detection of pleural effusion on the chest radiographs, whereas there would be no such difficulty with CT. As the efficiency of chest CT usage has improved, cost-effectiveness can also be improved by using chest CT for difficult patients with chronic thoracic disease or for patients who cannot maintain their posture for chest radiographs or cannot be assessed accurately due to comorbid cardiac disease [22–25]. Although we did not assess the prognostic value of other comorbidities in the lung, especially emphysema and fibrosis, those lung conditions might be reflected in the respiratory score in CURB-65, A-DROP and PSI. Further studies are needed to improve the prognostic ability of CT alone.
This study had several limitations. First, it used a registry of prospectively enrolled CAP patients, but analysed a limited number of hospitalised patients, who were relatively older and sicker, due to lack of manpower to evaluate the CT image data of all 792 participants in order to include outpatients. Therefore, the results of this study might not be generalisable to all patients. Nevertheless, the current sub-analysis showed that specific patients with a low ECOG performance status or history of chronic heart failure might particularly benefit from undergoing chest CT for severity assessment and site of care decisions. Second, an external validation for chest CT-modified scoring was not tested. Thus, a population-based prospective study is required to verify our findings; however, this requires a large population and high costs, and the results may still not be applicable to real-world practice because chest CT for the detection of pneumonia is not routinely recommended for facility-limited equipment, higher cost and radiation exposure. Therefore, multiple studies and meta-analyses are needed to make recommendations regarding the use of chest CT in the practice of CAP.
Conclusions
In conclusion, chest CT-modified scores were shown to improve the prediction of 30-day mortality in patients with CAP. Further large-cohort, multinational research studies are needed to confirm our results.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Supplementary material 00079-2020.supplement
Acknowledgements
We would like to thank Editage (www.editage.jp) for English language editing. We are also grateful to the staff of the Adult Pneumonia Study Group – Japan (APSG-J) study, in particular, the staff of the Depts of Pulmonary Medicine, Infectious Diseases, Laboratory Medicine and General Internal Medicine, and the Emergency and Trauma Center at Kameda Medical Center, Chiba, Japan.
Footnotes
This article has supplementary material available from openres.ersjournals.com
Author Contributions: M. Nemoto, K. Nakashima, Y. Matsue, K. Yoshida, Y. Matsui, and M. Aoshima contributed to the study's conception and design. K. Nakashima and M. Aoshima provided administrative support. All authors were involved in provision of study materials or patients. K. Nakashima and M. Nemoto were involved in collection and assembly of data. All authors were involved in data analysis and interpretation. All authors were involved in manuscript writing. All authors read and approved the final version of the manuscript.
Conflict of interest: M. Nemoto has nothing to disclose.
Conflict of interest: K. Nakashima has nothing to disclose.
Conflict of interest: S. Noma has nothing to disclose.
Conflict of interest: Y. Matsue has nothing to disclose.
Conflict of interest: K. Yoshida has nothing to disclose.
Conflict of interest: H. Matsui has nothing to disclose.
Conflict of interest: A. Shiraishi has nothing to disclose.
Conflict of interest: T. Ishifuji has nothing to disclose.
Conflict of interest: K. Morimoto reports grants and personal fees from Pfizer, and personal fees from MSD Pharma, outside the submitted work.
Conflict of interest: K. Ariyoshi has nothing to disclose.
Conflict of interest: M. Aoshima has nothing to disclose.
- Received February 19, 2020.
- Accepted July 19, 2020.
- Copyright ©ERS 2020
This article is open access and distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0.