Abstract
Introduction Non-small cell lung cancer (NSCLC) is often associated with compromised lung function. Real-world data on the impact of surgical approach in NSCLC patients with compromised lung function are still lacking. The objective of this study is to assess the potential impact of minimally invasive surgery (MIS) on 90-day post-operative mortality after anatomic lung resection in high-risk operable NSCLC patients.
Methods We conducted a retrospective multicentre study including all patients who underwent anatomic lung resection between January 2010 and October 2021 and registered in the Epithor database. High-risk patients were defined as those with a forced expiratory volume in 1 s (FEV1) or diffusing capacity of the lung for carbon monoxide (DLCO) value below 50%. Co-primary end-points were the impact of risk status on 90-day mortality and the impact of MIS on 90-day mortality in high-risk patients.
Results Of the 46 909 patients who met the inclusion criteria, 42 214 patients (90%) with both preoperative FEV1 and DLCO above 50% were included in the low-risk group, and 4695 patients (10%) with preoperative FEV1 and/or preoperative DLCO below 50% were included in the high-risk group. The 90-day mortality rate was significantly higher in the high-risk group compared to the low-risk group (280 (5.96%) versus 1301 (3.18%); p<0.0001). In high-risk patients, MIS was associated with lower 90-day mortality compared to open surgery in univariate analysis (OR=0.04 (0.02–0.05), p<0.001) and in multivariable analysis after propensity score matching (OR=0.46 (0.30–0.69), p<0.001). High-risk patients operated through MIS had a similar 90-day mortality rate compared to low-risk patients in general (3.10% versus 3.18% respectively).
Conclusion By examining the impact of surgical approaches on 90-day mortality using a nationwide database, we found that either preoperative FEV1 or DLCO below 50% is associated with higher 90-day mortality, which can be reduced by using minimally invasive surgical approaches. High-risk patients operated through MIS have a similar 90-day mortality rate as low-risk patients.
Tweetable abstract
Studying the impact of surgical approaches on 90-day mortality on a nationwide database, preoperative FEV1 and/or DLCO <50% is associated with higher 90-day mortality that can be mitigated by the use of minimally invasive surgical approaches. https://bit.ly/3QOl4Jn
Introduction
Non-small cell lung cancer (NSCLC) accounts for around 80% of all lung cancers, with surgical resection being the optimal treatment for early-stage disease [1]. Surgical treatment of NSCLC is associated with 90-day mortality rates that vary between 0 and 5%, depending on the patient's characteristics, tumour extension and surgical management [2]. Previous studies have identified pre-operative forced expiratory volume in 1 s (FEV1) and diffusing capacity of the lung for carbon monoxide (DLCO) as a simple and effective tool for stratifying surgical risk of operable patients, with preoperative FEV1 and/or DLCO below 50% of the predicted value, thus defining high-risk operable patients [3].
After surgical resection of NSCLC, post-operative complications are usually due to post-operative pain, leading to atelectasis and pneumonia, which can lead to acute respiratory distress syndrome and prolonged stay in the intensive care unit, ultimately altering survival and quality of life [2]. Minimally invasive surgery (MIS), including video-assisted thoracic surgery (VATS) and robotic-assisted thoracic surgery (RATS), has been associated with significant improvements in post-operative pain, complications and quality of life compared with open surgery in low-risk patients [4, 5]. These improvements are associated with a similar overall survival and relapse-free survival [5].
In the specific context of high-risk operable patients, MIS may reduce operative risk and allow for anatomic lung resection rather than non-anatomic resection, but real-world data are currently lacking. We therefore hypothesised that MIS might be associated with a survival benefit at 90 days in operable high-risk patients. The objective of this study is to evaluate 90-day mortality as a function of patient surgical risk using the French Epithor database and then to evaluate the impact of MIS on post-operative mortality after anatomic lung resection in high-risk operable patients.
Patients and methods
Study design
We conducted a multicentre retrospective study of the French nationwide prospective database Epithor [6], including all adult patients who underwent anatomical lung resection for primary NSCLC from January 2010 to October 2021 in 83 French thoracic centres (supplementary table S1). Adult patients who underwent an anatomical lung resection for primary NSCLC were included. Patients who had wedge resection or extended resection (to the superior vena cava, superior sulcus, carina, ribs, spine and subclavian vessels) were excluded. Patients with preoperative FEV1 and/or DLCO value below 50% were defined as high-risk operable patients [3]. Patients with both preoperative FEV1 and DLCO above 50% were defined as low-risk operable patients. The co-primary end-points were: 1) the impact of risk status on 90-day mortality and 2) the impact of the surgical approach on 90-day mortality in high-risk operable patients.
Ethical consideration
All data came from the French registry Epithor managed by the French Society of Thoracic and Cardiovascular Surgery [6]. Created in 2002 under a specific authorisation by the National Commission of Information and Liberties (CNIL # 809833), this registry aims at collecting data from every patient who underwent thoracic surgery in France. The study was approved by the Ethical Committee of the French Society of Thoracic and Cardiovascular Surgery (Société Française de Chirurgie Thoracique et Cardiovasculaire, SFCTCV IRB DELIBERE_CS-SFCTCV-2023–05-11_28684_Harry Etienne). Since this was a retrospective study, all data were anonymised and direct patients’ consent was waived.
Surgery
All patients underwent a complete work-up including chest computed tomography (CT), brain imaging (either magnetic resonance imaging or enhanced CT), abdominal imaging (enhanced CT), positron emission tomography–computed tomography (PET-CT) and bronchoscopy. Preoperative assessment was made according to the 2009 European Respiratory Society (ERS)/European Society of Thoracic Surgery (ESTS) guidelines and included a pulmonary function test (PFT) and echocardiography when indicated [1]. After assessing cardiac function, operability was decided on lung function testing (FEV1 and DLCO). If their values were not satisfying to authorise a major lung resection, the assessment would be completed by spiroergometry and/or a single-photon emission computed tomography scan. All patients were then discussed during multidisciplinary team meetings including a pulmonologist, an oncologist, a radiation therapist, a radiologist, a pathologist and a board-certified thoracic surgeon. The surgical approach and type of anatomical resection were left at the discretion of the senior surgeon responsible for the patient. Systematic lymph node dissection was associated with the anatomical lung resection. Post-operative management included daily chest radiograph for early detection of atelectasis and assessment of drain removal.
Data collection
The following data were gathered: age, sex, past medical history, surgical approach, type of anatomical lung resection, peri-operative management (induction chemotherapy or radiotherapy), PFT (FEV1 and DLCO), preoperative staging, post-operative staging, upstaging, pathology results and 90-day mortality (supplementary table S2). The anatomic extent of the tumour was based on the IASLC (International Association for the Study of Lung Cancer) 7th edition tumour, node, metastasis (TNM) classification for NSCLC in patients operated before 2016, and it was based on the IASLC 8th edition TNM classification for patients operated after 2016 [7, 8]. Missing data are summarised in supplementary table S3. The missing data were kept in the database, except for the variable smoking status, which has been replaced by 0 and for the variable age, which has been replaced by the mean age of all patients.
Statistical analysis
Univariate analysis of baseline characteristics by type of risk was performed. Categorical variables were reported as percentages and analysed with Chi-square tests. Continuous data were reported as mean±sd and analysed using independent sample t-tests for normally distributed variables and using Mann–Whitney U-tests otherwise. Survival curves were plotted during the first 90 days following surgery and compared with the logrank test. Univariate analyses were performed by surgical approach between RATS, VATS and open thoracotomy, with one analysis considering all patients and another one considering only high-risk patients. As before, categorical variables were reported as percentages and analysed with Chi-square tests. Continuous data were reported as mean±sd and analysed using one-way ANOVA, for normally distributed variables, and using Kruskal–Wallis tests otherwise. Stepwise multivariable logistic regression analysis was conducted to identify the factors predicting 90-day mortality. Interaction terms between variables were considered and reported whenever needed.
To minimise the effects of confounding factors, a 1:1 propensity score matching (PSM) with a caliper at 0.1 was performed using logistic regression analysis. The surgical approach was grouped into two classes: open surgery and MIS (including VATS and RATS) and used as a dependent variable while the baseline characteristics were included as independent variables. The variables included were sex, age, body mass index (BMI), dyspnoea, smoking status, FEV1 %, DLCO %, type of intervention, tumour size, nodal status and cancer pathology. The c-statistics were calculated as a measure of accuracy of the propensity score prediction. The histogram of the propensity score by type of surgery was plotted (supplementary figure S1). The standardised mean differences before and after matching were reported (supplementary figure S2). The logistic regression model was then used to estimate the association between baseline characteristics, including type of surgery (MIS versus open surgery), and 90-day mortality rate in the PSM sample. Odds ratios, 95% confidence intervals (CIs) and p-values were reported.
We then used artificial intelligence algorithms to investigate whether this method could be used to build a predictive model for 90-day mortality in high-risk patients. The data were randomly divided into training (70% of the data, 3286 observations) and test (30% of the data, 1409 observations) samples. The split was done while maintaining the 90-day mortality rate in each sample. Because the variable of interest is binary, we chose logistic regression, an interpretable random forest classifier, and an optimised random forest classifier [9, 10]. For the interpretable random forest, we set the number of trees to 10 and the maximum depth of each tree to 3 to make its predictions interpretable. The parameters of the optimised random forest were determined by cross-validation search over the parameter settings. The models are fitted to the training set and tested on the test set. The deployment of these algorithms was done in Python using the Scikit-Learn packages.
Results
Overall population
A study flowchart is shown in figure 1. All together 52 240 patients were included in the Epithor database during the study period. After exclusion of 5331 patients who underwent non-anatomical or extended surgery, data from 46 909 patients were used for analysis. There were 7391 patients aged 75 years or more (15.75%). The high-risk group included 4695 patients (10.0%) with preoperative FEV1 and/or preoperative DLCO below 50%, and the low-risk group included 42 214 patients (90.0%) with preoperative FEV1 and DLCO above 50%.
High-risk group
The characteristics of the overall population according to risk groups are shown in table 1. Briefly, as compared to low-risk patients, high-risk patients were characterised by a younger age (63.8±8.8 versus 65.0±9.3 years; p<0.0001), more squamous cell carcinoma (37.6% versus 24.9%; p<0.0001), more open surgery (71.8% versus 61.9%; p<0.0001) and more pneumonectomies (18.7% versus 6.4%; p<0.0001) but a similar rate of nodal upstaging (6.4% versus 6.8%). The rate of 90-day mortality was higher in the high-risk group compared to the low-risk group (5.96% versus 3.18%; p<0.0001). Between 2010 and 2020, 90-day mortality remained higher in the high-risk group throughout the years (figure 2). Survival curves during the first 90 days are shown in figure 3.
Surgical approaches
We then investigated the high-risk group according to the surgical approaches, classified as three categories (open surgery, VATS, RATS, table 2) and then two categories (open surgery, MIS including VATS and RATS, table 3). As compared with open surgery, MIS group was characterised by a lower 90-day mortality (3.1% versus 7.09%, respectively, p<0.0001).
Logistic regression
We then performed a multivariable logistic regression in the high-risk group as shown in table 4. In univariate analysis, factors associated with 90-day mortality in the high-risk group included sex, age, BMI, FEV1, DLCO, Ipal score and surgical approach. In multivariable analysis, the factors associated with 90-day mortality in the high-risk group included sex, DLCO and surgical approach.
Propensity score analysis
We then performed a propensity score analysis (PSM) to investigate the impact of MIS on 90-day mortality of high-risk patients. The results are summarised in table 5. The PSM sample had 2168 patients. The area under the receiver operating curve (ROC) for the predicted propensity score was 0.82. The distribution of propensity score was similar between MIS and open surgery (supplementary figure S1) with a mean±sd score of 0.57±0.20 for MIS and 0.58±0.20 for open surgery. In the PSM sample the 90-day mortality rate was 3.0% for MIS and 6.0% for open surgery. The propensity score matched odds ratio (OR) of 90-day mortality was lower in high-risk patients who received MIS compared to those who received open surgery (OR 0.46; 95% CI 0.30–0.69, p<0.001). The area under the ROC for the predicted 90-day mortality rate (fitted on the PSM sample) was 0.66. High-risk patients operated through MIS had a 90-day mortality rate similar to low-risk patients in general (3.10% versus 3.18% respectively).
In the low-risk group, we did a similar analysis to investigate the impact of MIS on 90-day mortality. The PSM sample had 24 408 patients. The distribution of propensity score was similar between MIS and open surgery with a mean score of 0.46±0.19 for MIS and 0.45±0.20 for open thoracotomy. In the PSM sample the 90-day mortality rate was 0.02±0.14 for MIS and 0.03±0.17 for open thoracotomy. The propensity matched odds ratio of 90-day mortality was, in low-risk patients, in favour of MIS compared to open thoracotomy (OR 0.709; 95% CI 0.602–0.835, p<0.001). The area under the ROC for the predicted 90-day mortality rate (fitted on the PSM sample) was 0.68.
We then performed a propensity score analysis to decipher the impact of MIS on the 90-day mortality of high-risk patients aged 75 years or more. The area under the ROC for the predicted propensity score was 0.77. The PSM sample had 282 patients (550 in the full sample). The distribution of propensity score was similar between MIS and open surgery with a mean score of 0.41±0.19 for MIS and 0.39±0.19 for open surgery. In the PSM sample the 90-day mortality rate was 0.06±0.25 for MIS and 0.11±0.32 for open surgery, but this difference was not statistically significant (p=0.16). Similarly, the propensity matched odds ratio of 90-day mortality was, in high-risk patients aged ≥75 years, in favour of MIS compared to open thoracotomy who received open surgery compared to those who received MIS, but this difference was not statistically significant (OR 0.456; 95% CI 0.172–1.213).
Machine learning algorithms
The ROC, the area under the curve (AUC) and the F1 score (recommended for imbalanced data sets) of the machine learning approaches performed in the high-risk group are summarised in figure 4. As compared with logistic regression associated with PSM, the interpretable random forest classifier was not associated with any improvement in F1 score or AUC, while the optimised random forest classifier was associated with a limited improvement in both values.
Discussion
Studying the impact of surgical approach on 90-day mortality following lung surgery for NSCLC on a nationwide database, we found that patients with FEV1 or DLCO values below 50% preoperatively were experiencing a higher risk of post-operative mortality that can be mitigated through MIS. High-risk patients operated through MIS achieved 90-day mortality rate similar to low-risk patients.
Previous randomised studies have shown the post-operative benefits of VATS surgery in post-operative outcomes in low-risk patients [4, 5, 11]. Bendixen et al. [4] have shown that VATS significantly decreased early post-operative pain and chronic post-operative pain compared to anterior thoracotomy. Long et al. [11] reported no difference in length of stay or any other clinical outcomes. Lim et al. [5] confirmed that compared to open surgery, VATS lobectomy results in better physical function at 5 weeks, shorter post-operative hospital stay, fewer serious adverse events after discharge and less pain. Using a data set of nearly 10 000 patients from the Veterans Health Administration, Heiden et al. [12] developed a surgical quality score named VALCAN-O for patients diagnosed with resectable early-stage NSCLC. The score reflects the risk-adjusted association between five quality metrics including the surgical approach (MIS and open surgery) and the overall survival and recurrence-free survival. In all these trials, description of the patients included suggests that their expected operative mortality was low, as was the observed mortality in both arms. No secondary analysis was done in patients with compromised pulmonary function. Our study was designed to investigate the benefits of MIS in this subgroup of patients.
No standard definition exists for patients at high risk of post-operative mortality following thoracic surgery. Guidelines advocate that patients being considered for thoracic surgery should undergo a comprehensive preoperative risk assessment to explore possible pulmonary and cardiac comorbidities. The ERS/ESTS 2009 guidelines on preoperative assessment before lung resection insist on both FEV1 and DLCO as key markers of post-operative functional status, morbidity and mortality [1]. A multivariable analysis by Ceppa et al. [13] showed that use of thoracotomy, decreasing predicted post-operative (ppo) FEV1 and ppoDLCO were independent predictors of increased post-operative morbidity after major lung resection. Risk prediction models have been used to define high-risk patients, but they often lack external validation and their discrimination ability is questionable [14, 15]. Among risk prediction models, the Thoracoscore was constructed using the Epithor database [16, 17]. Looking for a simple definition of high-risk patients, we decided to use the main criteria of the ACOSOG Z4099/RTOG 1021 and ACOSOG Z4032, both randomised controlled trials [3, 18]. We did not include the minor criteria, as not all those data were necessarily available in the Epithor database. Using this simple risk stratification, we found a significant difference in 90-day mortality between high-risk patients and low-risk patients. The fact that there was a high rate of pneumonectomy in the high-risk group compared to the low-risk group could be a potential bias. It is possibly explained by a higher proportion of squamous cell carcinoma and large cell carcinoma present in the high-risk group: we know that those tumours tend to be more central compared to adenocarcinoma, which are more peripheral. The extent of parenchymal resection was not associated with 90-day mortality in univariate or multivariate analysis in the high-risk group thus relativising the impact of this potential bias. Our study corroborates the results from a study by Donahoe et al. [19] with a much larger sample size. In this retrospective study, 72 high-risk patients were compared to 536 low-risk patients. Rates of overall and pulmonary complications were significantly higher in high-risk patients compared to low-risk patients (p=0.0001). Overall, 90-day survival was significantly lower for high-risk patients who had an open thoracotomy compared to the high-risk patients who had MIS. We found similar trends on a larger scale, reinforcing the generalisability of the results.
Propensity matching reduces the confounding bias when comparing two surgical approaches, but it does not replace a randomised controlled trial. It is a good compromise when study groups differ not only in numbers but also in various demographic characteristics. Using this technique, an analysis of the ESTS database found a significant reduction in post-operative morbidity and mortality in frail patients (older than 70 years, BMI <18.5 kg·m−2 or ASA >2) who had VATS lobectomy instead of open surgery [2]. A subgroup of patients with ppoFEV1 <40% had the same tendency but no mention was made of the DLCO, although it remains a key element in operability assessment as previously mentioned. In our study use of the machine learning approach has been disappointing. As compared to the results for the PSM sample, AI algorithms did not reach any additional predictive value when using the interpretable random forest including 10 trees and a maximum depth of each tree to 3. AI algorithms reached a modest additional predictive value with the optimised random forest, but the results cannot be interpreted due to unintelligible variables. Even though the AUC of the logistic regression is good, the F1 score shows that the raw data is not sufficient to build an effective predictive model. It might be interesting to add a preprocessing step to create more informative feature data and then to test more algorithms. However, the machine learning approach is usually limited by the difficulty to generate synthetic data that prove to be realistic, accurate and to represent the complex state. In this particular field of medicine, the interpretability of a predictive model is required to allow its use in clinical practice.
The main limitation of this study is its retrospective design. The data come from the French national database Epithor, and its quality depends on the accuracy of the submitted data from the individual participating centres. Depending on the centres, the open surgical approach (postero-lateral thoracotomy, axillary thoracotomy or anterior thoracotomy), VATS approach (anterior approach, totally thoracoscopic approach or subxyphoid approach) or RATS approach (three-arm or four-arm) differ. Moreover, no information was available on the conversion rate from VATS to thoracotomy. Lastly, the 90-day mortality might be underreported in surgical databases. The 90-day mortality has indeed been reported to be less important in the surgeon-based Epithor database than in the administration-based PMSI database, but the difference is less pronounced in high-volume centres, and this bias might affect both the low-risk and the high-risk groups in a similar way [20]. We performed a sensitivity analysis using only the high-volume centres and the results were not changed. The large sample size of our study allows us to have valuable information on the management of high-risk patients. Further randomised controlled studies would still be necessary to confirm these results.
Conclusion
Examining the impact of surgical approaches on 90-day mortality using a nationwide database, we found that preoperative FEV1 and/or DLCO below 50% were associated with higher 90-day mortality, which can be reduced by using minimally invasive surgical approaches. High-risk patients who underwent MIS had a 90-day mortality rate similar to that of low-risk patients.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Figure S1 00653-2023.SUPPLEMENT
Figure S2 00653-2023.SUPPLEMENT2
Table S1 00653-2023.SUPPLEMENT
Table S2 00653-2023.SUPPLEMENT2
Table S3 00653-2023.SUPPLEMENT3
Acknowledgement
The authors would like to thank David Grimbert from Mazars for his precious advice on the statistics and for proofreading the manuscript.
Footnotes
Provenance: Submitted article, peer reviewed.
Conflict of interest statement: The statistical analysis in this work was financially supported by the Marc Laskar grant. J-M. Baste and P-B. Pagès receive consulting fees from Medtronic and from Intuitive Surgical. P-A. Thomas is a consultant for Ethicon Endosurgery, AstraZeneca and Europrisme. A. Seguin-Givelet is a speaker for AstraZeneca and Medtronic. H. Etienne, J. Iquille, P.E. Falcoz, L. Brouchet, J-P. Berthet, F. Le Pimpec Barthes, J. Jougon, M. Filaire, V. Anne, S. Renaud, T. D'Annoville, J.P. Meunier, C. Jayle, C. Dromer, A. Legras, P. Rinieri, S. Jaillard, V. Margot, M. Dahan and P. Mordant have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Support statement: This study was supported by the Société Française de Chirurgie Thoracique et Cardio-Vasculaire Marc Laskar grant. Funding information for this article has been deposited with the Crossref Funder Registry.
Ethics statement: The study was approved by the Ethical Committee of the French Society of Thoracic and Cardiovascular Surgery (Société Française de Chirurgie Thoracique et Cardio-Vasculaire, SFCTCV IRB DELIBERE_CS-SFCTCV-2023-05-11_28684_Harry Etienne).
- Received September 4, 2023.
- Accepted November 15, 2023.
- Copyright ©The authors 2024
This version is distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0. For commercial reproduction rights and permissions contact permissions{at}ersnet.org