Abstract
Introduction In recent years, ventilatory efficiency (minute ventilation (V′E)/carbon dioxide production (V′CO2) slope) and partial pressure of end-tidal carbon dioxide (PETCO2) have emerged as independent predictors of postoperative pulmonary complications (PPC). Single parameters may give only partial information regarding periprocedural hazards. Accordingly, our aim was to create prediction models with improved ability to stratify PPC risk in patients scheduled for elective lung resection surgery.
Methods This post hoc analysis was comprised of consecutive lung resection candidates from two prior prospective trials. All individuals completed pulmonary function tests and cardiopulmonary exercise testing (CPET). Logistic regression analyses were used for identification of risk factors for PPC that were entered into the final risk prediction models. Two risk models were developed; the first used rest PETCO2 (for patients with no available CPET data), the second used V′E/ V′CO2 slope (for patients with available CPET data). Receiver operating characteristic analysis with the De-Long test and area under the curve (AUC) were used for comparison of models.
Results The dataset from 423 patients was randomly split into the derivation (n=310) and validation (n=113) cohorts. Two final models were developed, both including sex, thoracotomy, “atypical” resection and forced expiratory volume in 1 s/forced vital capacity ratio as risk factors. In addition, the first model also included rest PETCO2, while the second model used V′E/V′CO2 slope from CPET. AUCs of risk scores were 0.795 (95% CI: 0.739–0.851) and 0.793 (95% CI: 0.737–0.849); both p<0.001. No differences in AUCs were found between the derivation and validation cohorts.
Conclusions We created two multicomponental models for PPC risk prediction, both having excellent predictive properties.
Shareable abstract
Two models with improved ability to predict postoperative pulmonary complications in patients scheduled for lung resection surgery may improve preoperative algorithms as they stratify risk based on more relevant parameters https://bit.ly/3xUzdyY
Introduction
Surgery remains the preferred treatment of early-stage (IA–IIIA) lung cancer due to benefits on long-term survival [1]. However, postoperative pulmonary complications (PPC) remain a major problem within the 30-day postoperative period, contributing to increased morbidity and mortality, decreased quality of life, prolonged duration of hospitalisation and intensive care unit (ICU) stay, and economic burden [2–4]. 30-day postoperative mortality rates after lung resection range from 2.3% to 3% [5, 6] and from 6.6% to 7.5% [4, 7], depending on surgical technique and extent of resection.
A recognised factor associated with PPC is patient preoperative functional status. Guidelines of the European Respiratory Society (ERS) and the American College of Chest Physicians (ACCP) [8, 9] emphasise the important role of spirometry and assessment of diffusing capacity of the lungs for carbon monoxide (DLCO). Patients with impaired lung function may also benefit from risk stratification by cardiopulmonary exercise testing (CPET) [8, 9]. Despite adherence to the ERS/ACCP guidelines, postoperative morbidity and mortality rates remain high, especially when compared to other elective surgeries including reported 30-day mortality rates of 0.15% for cholecystectomy [10] and 0.08% for appendectomy [11].
In recent years, it has been demonstrated that for patients evaluated by CPET, peak oxygen consumption (peak V′O2), previously considered to be the gold standard exercise parameter for risk assessment, is a suboptimal predictor of PPC [7, 12–14]. In contrast, ventilatory efficiency (minute ventilation (V′E)/carbon dioxide production (V′CO2) slope) and partial pressure of end-tidal carbon dioxide (PETCO2) have been demonstrated to be independent predictors of PPC [7, 12, 15, 16]. Indeed, V′E/V′CO2 slope has been proposed to be included in preoperative functional assessments [17].
Single parameters for clinical risk prediction may give only partial information regarding periprocedural hazards. Accordingly, our aim was to identify factors associated with the development of PPC and create prediction models with improved ability to stratify PPC risk in patients scheduled for elective lung resection surgery. We created two scores – the first for patients able to undergo CPET (for patients with available CPET data) and the second for patients unable (or unwilling) to undergo CPET (for patients with no available CPET data).
Methods
Subjects
This post hoc analysis comprised consecutive lung resection candidates (with confirmed or highly suspected lung tumour) from two prior prospective trials conducted at two sites in the Czech Republic (St. Anne's University Hospital in Brno and University Hospital Brno). Inclusion requirements were ability to undergo CPET and age ≥18 years. Exclusion criteria included inoperable tumour (lung resection not performed) and/or contraindication for lung resection due to low predicted postoperative peak V′O2 [8]. Both studies were registered at ClinicalTrials.gov (NCT03498352 and NCT04826575), conducted in accordance with the Declaration of Helsinki, and approved by the local Ethics Committees of St. Anne's University Hospital in Brno (reference No. 19JS/2017; reference No. 2G/2018; reference No. 03G/2021) and University Hospital Brno (references No. 150617/EK and No. 14-100620/EK). All participants provided written informed consent for participation in the study.
Pulmonary function tests
All patients completed pulmonary function tests as reported previously [12] in accordance with the ERS and American Thoracic Society (ATS) technical standards [18, 19]. The tests included DLCO with PowerCube Diffusion+ (Ganshorn Medizin Electronic GmbH, Germany) and spirometry with the ZAN100 device (nSpire Health, Inc., USA). The analysed spirometry measures included forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC) and the FEV1/FVC ratio. FEV1 and FVC were analysed as % of the predicted values, with FEV1/FVC analysed as a ratio. In the 2017–2021 period, the ECCS93 reference values were used [20], while after 2021, the Global Lung Function Initiative (GLI) equations were introduced in both centres [21]. DLCO values were recorded as % of the predicted values; in the 2017–2021 period, the Crapo reference values were used [22], while after 2021, the GLI equations were introduced in both centres [23].
Z-scores for FEV1, FVC and FEV1/FVC were calculated using the official ERS online calculator [24].
Cardiopulmonary exercise testing
All individuals had a symptom-limited CPET as previously reported [12] prior to surgery on an electronically braked bicycle ergometer (Ergometrics 800®, Ergoline, Bitz, Germany) with a 12-channel electrocardiography unit (AT-104®, Schiller AG, Baar, Switzerland). For the analysis of expired gases and quantities, a PowerCube-Ergo® cardiopulmonary exercise system (Ganshorn Medizin Electronic GmbH, Niederlauer, Germany) was used. At the end of the rest period and during maximal effort, blood gas analyses were performed using the ABL90 Flex Plus® (Radiometer Medical ApS, Denmark) system. Carbon dioxide output (V′CO2), V′O2, tidal volume (VT), dead space to tidal volume ratio (VD/VT), breathing frequency (fb), V′E, PETCO2, respiratory exchange ratio and V′E/V′CO2 slope were among the analysed CPET measures.
Postoperative pulmonary complications
Pulmonary complications were defined as in previous studies [4, 12, 15, 25, 26] and assessed from the first 30 postoperative days or from the hospital stay, and included the following conditions: pneumonia (chest radiograph infiltrates+fever and/or leukocytosis/leukopenia and/or purulent sputum production); atelectasis (chest radiograph signs+bronchoscopy with plug being removed); respiratory failure requiring mechanical ventilation (noninvasive or invasive ventilation); adult respiratory distress syndrome (bilateral chest radiograph infiltrates+arterial partial pressure of oxygen/fraction of inspired oxygen <300) [27], prolonged air leak (presence of air leak from the chest tube on the 5th postoperative day) [28] and tracheostomy. In addition, hospital length of stay (LOS), ICU LOS and type (thoracotomy versus video-assisted thoracic surgery (VATS)) and extent of surgical procedure (atypical resection, lobectomy, bilobectomy, pneumonectomy) were recorded. In the multivariate model, “atypical resection” included wedge resection or segmentectomy, while “atypical resection – no” included lobectomy or larger anatomical resection.
Statistical analyses
Categorical parameters were described by absolute and relative frequencies. Independence of two categorical parameters was tested by the Pearson chi square test or the Fisher exact-test. Numerical parameters were described by valid n, mean±SD and median supplemented by the 5th and 95th quantile. Differences between the two groups were tested by the independent t-test or Mann–Whitney U-test, depending on normality of the data that was evaluated by the Shapiro–Wilk test.
Univariate and multivariate logistic regression were used for identification of risk factors for PPC. Statistically significant parameters from basic description (potential confounding factors) were included in univariate regressions. Multivariate models included statistically significant parameters identified by univariate regression. The forward stepwise method for selection was used. Odds ratios (OR) were supplemented by 95% confidence intervals (CI). Spearman correlation coefficients were calculated to assess correlation of these parameters. Statistically significant parameters (p<0.05) from multivariate logistic regression were entered into the final risk prediction models. Two risk models were developed: the first used rest PETCO2 (for patients with no available CPET data); the second used V′E/V′CO2 slope (for patients with available CPET data). Receiver operating characteristic (ROC) analysis with the DeLong test and area under the curve (AUC) were used for comparison of these models (scores). ROC analysis was also used for comparison of results from the derivation and validation cohorts.
Due to the low number of events (deaths) in both cohorts, we were unable to develop any model for prediction of death risk. However, 30-day and 90-day mortality rates were summarised by survival analysis methods. Kaplan–Meier curves were constructed for visualisation of survival probability. Cox models for proportional hazards were used for hazard ratio (HR) calculation.
Analysis was performed using SPSS Statistics 24 and R 4.2.0 software; p-values <0.05 were considered statistically significant.
Results
In total, data from 423 patients were included in the analysis. This dataset was randomly split into the derivation (n=310) and validation (n=113) cohorts on a 3:1 ratio basis. The derivation cohort (three-quarters of dataset) was used for models and risk scores development while the validation cohort (a quarter of dataset) was used to test the models.
Basic characteristics of study cohorts
Basic demographic characteristics of the derivation cohort are presented in table 1. 75 (24.2%) patients from the derivation cohort (n=310) had PPC. Patients with PPC were significantly more likely to have COPD, lobectomy or thoracotomy (all p<0.001) compared to the group without PPC. In contrast, patients without PPC more frequently underwent atypical lung resection. Patients with PPC had significantly worse lung function, lower rest PETCO2, higher V′E/V′CO2 slope and a longer hospital LOS and ICU LOS (table 1).
Univariate logistic regression and ROC analysis – identification of risk factors
A total of 44 factors or parameters were entered in the univariate logistic regression and ROC analysis. Of these, a total of seven factors were included in the multivariate analysis; the other parameters were omitted due to collinearity or correlation of similar parameters (e.g., V′E/V′CO2 slope and V′E/V′CO2 ratio or FEV1 % predicted, FEV1 Z-score and FEV1/FVC). The results of ROC analysis are presented in supplementary table S1.
Multivariate logistic regression – risk scores development
Two final models (multivariate logistic regression) are presented in table 2. Numerical parameters were categorised according to the best cut-off value (FEV1/FVC ≤75%, rest PETCO2 ≤28 mmHg and V′E/V′CO2 slope ≥33) based on ROC analysis and our previous studies [12]. Both models included sex, thoracotomy, resection other than wedge (“atypical“) and FEV1/FVC ratio as risk factors. In addition, the first model also included rest PETCO2, while the second model used V′E/V′CO2 slope from CPET. ORs for risk of PPC for all included factors are presented in table 2.
Calculated risk of PPC during the first 30 postoperative days is summarised in figure 1. The two prediction tools yielded similar results. Women with higher values of FEV1/FVC and rest PETCO2 (or lower values of V′E/V′CO2 slope, respectively) undergoing a wedge resection via VATS approach had the lowest risk of PPC – around 2%. In contrast, men with decreased FEV1/FVC and rest PETCO2 (or higher values of V′E/V′CO2 slope, respectively) undergoing lobectomy or bilobectomy via open thoracotomy had the greatest risk of PPC development – >75% (figure 1). The concordance of the two risk scores was also shown by ROC analysis (figure 2), where AUCs of risk scores were 0.795 (95% CI: 0.739–0.851) and 0.793 (95% CI: 0.737–0.849), respectively (figure 2). There was no significant difference between the AUCs of the two models (p=0.867).
Validation of risk scores
Validation of risk scores was performed on the independent validation cohort (n=113). Comparison of parameters included in the scores for both cohorts is presented in table 3. There were no statistically significant differences between the cohorts. Figure 3a,b summarises the comparison of the discrimination ability of these scores between derivation and validation cohorts. There were no statistically significant differences in AUCs on comparison of the derivation and validation cohorts (p=0.063 and p=0.173, respectively).
90-day overall survival according to PPC
As an additional exploratory analysis, 90-day overall survival was evaluated and patients without PPC were compared with those with PPC. Only two patients (0.9%) without PPC died within the first 90 postoperative days, while eight patients (10.7%) died in the group with PPC. Kaplan–Meier curves are presented in supplementary figure S1. PPC were observed to be a statistically significant risk factor of 90-day postoperative mortality (HR 13.04; 95% CI: 2.77–61.40; p=0.001).
Discussion
We developed two novel risk models for prediction of PPC in lung resection candidates. The two models had comparable predictive properties for both the derivation and validation cohorts. Both models are composed of five parameters (sex, FEV1/FVC, extent of resection, surgical technique and V′E/V′CO2 slope for patients with CPET, and sex, FEV1/FVC, extent of resection, surgical technique and rest PETCO2 if CPET is not available).
As PPC have been found to be the main determinant of 30-day postoperative mortality [3], identification of patients at risk is a key strategy to improve postoperative outcomes. The current ERS/ACCP guidelines recommend FEV1 and DLCO measurements as part of routine preoperative evaluation [8, 9]. While spirometry and DLCO assessment have a reasonable negative predictive value in patients with preserved lung function [29], patients with a predicted postoperative (ppo) FEV1 of <40% or <30% were observed with 16–50% [30, 31] and approaching 60% rates of postoperative mortality [32], respectively. FVC wasn't significantly different between groups with and without PPC. Indeed, FVC is not included in any of the guidelines on preoperative functional assessment [8, 9]. Similarly, a decreased ppo DLCO has a fair predictive value [33], even in patients with a normal FEV1 [34]. In our study, both FEV1 and DLCO were associated with PPC in the univariate model, but compared to FEV1, the Tiffeneau index (FEV1/FVC) had better predictive properties. For this reason and considering the limitations of FEV1 mentioned above, we used FEV1/FVC in the multivariate model instead of FEV1. DLCO failed to predict PPC in the multivariate model.
Peak V′O2 predicts PPC and mortality only if calculated ppo peak V′O2 is <10 mL·min−1·kg−1. Patients with such values are considered inoperable [8, 9]. Values of ppo peak V′O2 >10 mL·min−1·kg−1 are disputed regarding their role in prediction of PPC [7, 12–17]. However, a number of previous studies have demonstrated that both V′E/V′CO2 slope and resting PETCO2 are independent predictors of PPC for lung resection surgery candidates [7, 12, 15–17]. These parameters relate more directly to ventilation, sharing nearly the same physiological determinants, i.e., dead-space ventilation and hyperventilation [7, 12, 15]. In our study, V′E/V′CO2 slope and resting PETCO2 were independently associated with PPC and constituted important components of the newly developed prediction tools.
In previous years, various composite scores have been introduced for PPC prediction based on demographic, lung function and other score-based data [9, 35], the former having an insufficient focus on PPC if viewed from the perspective of current knowledge. In addition, in prior studies scores were constructed with retrospective or registry-based data. In contrast, the new predictive models described herein are based on prospective multicentre data from two separate cohorts/studies. In addition, the incorporation of V′E/V′CO2 slope and PETCO2 measurement introduces more specific parameters associated with PPC; the predictive value of both parameters has been demonstrated in a number of previous studies and replicated in different centres [7, 13–17]. V′E/V′CO2 slope is a potential therapeutic target, though there remains some potential controversy about its role in lung resection surgery [36]. Due to the shared underlying pathophysiology of V′E/V′CO2 slope and rest PETCO2, the two parameters can be used as mutual surrogates [7, 37]. Despite this fact, rest PETCO2 shouldn't be viewed as superior to V′E/V′CO2 slope. Indeed, CPET still has an important role in preoperative assessment as calculated values of ppo peak V′O2 <10 mL·min−1·kg−1 are strongly indicative of PPC and mortality risk [8, 9]. Therefore, the most appropriate role of rest PETCO2 appears to be in those patients unable or unwilling to undergo the test or if CPET is unavailable.
We incorporated surgery-related factors, including technique and extent of resection. The introduction of modern surgical techniques (VATS and robotic-assisted thoracic surgery (RATS)), decreasing volume of lung tissue loss with lung parenchyma-sparing surgery (segmentectomy or atypical/wedge resection) [38] or implementation of the Enhanced Recovery After Thoracic Surgery (ERATS) protocol has clearly resulted in a decreased rate of PPC and postoperative mortality in the last decade [39]. Robotic surgery alone has also decreased the rate of in-hospital postoperative mortality by ∼50%, compared to open thoracotomy [40]. An important consideration is that surgery-related factors are modifiable, i.e. the type of surgery and extent of resection can be discussed within the multidisciplinary team and with the patient.
Interpretation of the calculated risk
An important question is what extent of calculated risk should be perceived as acceptable, increased or prohibitive for lung resection. In a recent study, the risk of PPC in patients with preserved lung function was 9% [29], while it was 14.5–17% or even 25% in less selected populations [3, 4, 12]. In view of these data, we suggest that acceptable risk of PPC shouldn't exceed 30%, while a >50% risk is prohibitive for surgical treatment. Patients with a calculated risk of PPC between 31% and 50% should participate in risk-reducing programmes/strategies. However, we can't give an exact guidance, as the latter should routinely be discussed with the patient and within the interdisciplinary team.
Limitations
The main limitation of our work is that the predictive tools have not yet been evaluated in a larger external validation cohort. A second limitation is that the tools are not suitable for mortality risk assessment as they relate only to PPC, since the limited number of deaths in our cohorts did not permit development of suitable models for mortality prediction. Third, patients with ppo peak V′O2 < 10 mL·min−1·kg−1 have been excluded as they are considered inoperable in agreement with the current guidelines [8, 9]. This might reduce the predictive value of peak V′O2. Fourth, in ∼30% of patients, we lack data on arterial blood gases (ABG). Due to the data incompleteness, we didn't include ABG parameters in the multivariate models. As data for DLCO were recorded only as % of predicted, we were not able to calculate Z-scores for DLCO. Despite these limitations, we believe the models described represent an incremental improvement in PPC prediction and merit consideration for routine use in clinical practice.
Conclusion
We created two multicomponental models/scores for PPC risk prediction, both having excellent predictive properties. We suggest that the models be used in daily clinical practice.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Table S1 00978-2023.SUPPLEMENT
Figure S1 00978-2023.SUPPLEMENT2
Data availability
The datasets generated and analysed during this study are available from the corresponding author on reasonable request.
Acknowledgements
We would like to thank all the participating physicians for their effort and all recruited patients for their willingness to share their data with the scientific community.
Footnotes
Provenance: Submitted article, peer reviewed.
This study is registered at www.clinicaltrials.gov with identifier number NCT03498352.
Ethics statement: This study was approved by the local Ethics Committees of St Anne's University Hospital in Brno (reference numbers 19JS/2017, 2G/2018 and 03G/2021) and University Hospital Brno (reference numbers 150617/EK and No. 14-100620/EK).
Conflict of interest: K. Brat received lecture and consulting fees from Chiesi CZ, Boehringer Ingelheim CZ, Novartis CZ, AstraZeneca CZ and Angelini CZ, outside the submitted work.
Conflict of interest: The other authors (I. Cundrle Jr, P. Homolka, M. Svoboda, L. Mitas, M. Plutinsky, Z. Chovanec and L.J. Olson) have nothing to disclose.
Support statement: The study was funded by Ministry of Health of the Czech Republic research grant number NU21-06-00086. Further support received from Ministry of Health of the Czech Republic – Conceptual Development of Research Organisations (MH CZ-DRO FNBr 65269705). The sponsors had no role in the study design, data collection or analysis and preparation of the manuscript. Funding information for this article has been deposited with the Crossref Funder Registry.
- Received December 8, 2023.
- Accepted April 16, 2024.
- Copyright ©The authors 2024
This version is distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0. For commercial reproduction rights and permissions contact permissions{at}ersnet.org