Abstract
We assessed efficacy and effectiveness of pharmacological and nonpharmacological interventions in improving symptom control, functional exercise capacity and quality of life (QoL) in people living with fibrotic interstitial lung disease (ILD).
We summarised evidence from three previous reviews (to June 2014) and conducted an updated search of nine databases and grey literature (2011–2019) (registration: CRD42017065933) for prospective studies of interventions aimed to alleviate symptoms, improve QoL or functional exercise capacity in fibrotic ILD. Data were synthesised through narrative synthesis or meta-analysed as appropriate.
Forty-seven studies with 2527 participants were included. From 22 pharmacological studies of 11 different interventions (n=1683), the most tested interventions were bosentan and sildenafil. From 25 nonpharmacological studies, the most tested intervention was for pulmonary rehabilitation/exercise training (PR) (22 studies, n=748). There was an improvement in 6-min walk distance immediately following PR (six studies; n=200, mean difference (MD) (95% CI) 39.9 m (18.2 to 61.5)), but not longer term (3 or 6 months, four studies; n=147, MD 5.3 m (−12.9 to 23.4). Multiple, varied outcome measures were used (e.g. 37 studies assessing dyspnoea used 10 different scales with a lack of reporting of rate of deterioration in outcomes). Evidence gap mapping highlighted the most and least researched symptoms, as dyspnoea and cough, respectively.
This evidence synthesis highlights overwhelmingly that the most researched symptom is dyspnoea and the strongest evidence base is for short-term PR. The least researched symptom was cough. Research going forward must focus on prioritising and standardising meaningful outcomes and focussing interventions on neglected symptoms.
Abstract
Cough is a neglected symptom in ILD. Future research must prioritise and standardise meaningful outcomes, and focus interventions on neglected symptoms while ensuring dyspnoea is prioritised as a primary end-point for future studies. https://bit.ly/2RNg9KH
Introduction
Rationale
Patients with interstitial lung disease (ILD) can live for many years, with some ILDs being responsive to treatment. However, fibrotic ILDs (such as idiopathic pulmonary fibrosis (IPF)) have a shorter disease trajectory, which can be rapidly advancing [1]. The resulting physical and psychological burden with impact on quality of life (QoL) can be substantial for both patients and carers [2]. A recent pivotal ILD position statement stressed the need to deliver early and effective palliative care and the importance of “living as well as possible” as disease advances [3]. Essential to achieving this is to improve symptom management and QoL as well as improving functional exercise capacity. Previous systematic reviews have summarised interventions to improve symptoms and QoL in ILD [4–6] (see online supplement 1 for further details). As the importance of improving the symptoms, functional exercise capacity and QoL of these patients has become increasingly recognised [7, 8], there has been a surge in intervention studies. There is therefore a need to synthesise previous research with more recently published studies and highlight areas in which we may move research forward in a meaningful way.
Objectives
We aimed to synthesise relevant studies from three previous systematic reviews with more recently published studies and highlight gaps in research through an evidence gap map.
Methods
Protocol and registration
The systematic review was carried out in line with Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines and the protocol is registered on the PROSPERO database (CRD42017065933). The protocol stated that participants with a confirmed diagnosis of ILD would be included. However, after reflection within the Project Advisory Team, to limit heterogeneity, it was decided to focus on fibrotic ILDs only as they all shared the commonality of shorter disease trajectory, severity of symptoms and poor functional exercise capacity.
Eligibility criteria
We included studies recruiting participants with a confirmed clinical or pathological/radiological diagnosis of a fibrotic ILD (including, but not restricted to IPF and nonspecific interstitial pneumonia). We excluded studies in people with connective tissue ILDs or obstructive sleep apnoea and those for which the breakdown of ILD diagnoses was not available. Eligible interventions were any pharmacological or nonpharmacological intervention aimed at managing symptoms or improving QoL (such as oxygen therapy, opioids, corticosteroids and noninvasive ventilation) or functional exercise capacity (as measured by 6-min walk distance (6MWD)). Radical disease modifying interventions, that were evaluated in studies primarily focused on improving survival and lung function, were deemed out of scope. We accepted any comparators/controls. Eligible outcomes were all symptom control outcomes (such as breathlessness, cough and fatigue), QoL outcomes and 6MWD. Prospective efficacy and effectiveness studies of any design were eligible including observational (e.g. cohort studies) and interventional prospective studies (e.g. randomised controlled studies, controlled clinical trials, before-and-after studies).
Information sources
Two authors (AO and SB) independently screened studies that were included in the previous reviews [4–6] for inclusion. In addition, nine electronic databases were searched for journal articles, meeting abstracts, ongoing studies and reviews from 2011 to January 2019 with no language restrictions: Ovid MEDLINE, Ovid Embase, Web of Science Core Collection, Cochrane Library, Centre for Reviews and Dissemination (CRD) databases, National Institutes of Health (NIH) ClinicalTrials.gov, World Health Organization International Clinical Trials Registry Platform (ICTRP), UK Clinical Trials Gateway and PROSPERO.
Reference lists of included studies were screened and experts in the field contacted to identify additional studies. Searches of grey literature, conference proceedings and research in progress databases were also conducted.
Search
A combination of MeSH headings and keywords were used. For example in MEDLINE, the following terms were combined: Lung Diseases, Interstitial/dt, pc, rh, su [Drug Therapy, Prevention & Control, Rehabilitation, Surgery] OR Idiopathic Pulmonary Fibrosis/dt, pc, rh, su [Drug Therapy, Prevention & Control, Rehabilitation, Surgery] OR Idiopathic Interstitial Pneumonias/dt, pc, th [Drug Therapy, Prevention & Control, Therapy] OR interstitial lung disease*.tw OR idiopathic pulmonary fibrosis.tw OR non-specific interstitial pneumonia*.tw OR idiopathic interstitial pneumonia*.tw OR cryptogenic organi* pneumonia*.tw NOT (comment or letter or editorial).pt. The searches were then limited to 2011 to current year. The full search strategy can be found in online supplement 1.
Study selection and data collection process
Studies were selected for inclusion in a two-stage process using predefined criteria. The full literature search results were screened independently by at least two reviewers (JC, EL, AO and SB) in order to identify all citations that potentially met the inclusion criteria, following which the full manuscripts of all selected citations were retrieved and assessed, again by two reviewers, against the inclusion criteria. Studies published as abstracts or conference presentations were only included if there were wsufficient details to allow an appraisal of the methodology and an assessment of the results. Study authors were contacted for missing information and authors of the abstracts published after 2015 were contacted if there were insufficient details to allow its inclusion in the review. Where authors did not respond, only abstracts with enough details of study participants, intervention and results were included. Where studies included a proportion of ineligible lung diseases, or where diagnoses were unclear, we contacted the authors for further information and only included these studies if this information became available. A list of potentially relevant but excluded studies is provided in online supplement 2. An inclusion flow chart and data extraction sheet were developed and used for each paper assessed. All data extraction was conducted independently by at least two reviewers. Disagreements were resolved by consensus or, if necessary, through arbitration by a third reviewer.
Data items
Data extracted included authors, intervention, comparator, diagnostic criteria, study design, quality rating and summary of results.
Risk of bias in individual studies
In assessing study quality, we used the Cochrane risk of bias tool [9] for randomised controlled trials (RCTs) and controlled clinical trials (CCTs), with the risk of selection bias used to establish the overall risk of bias for each study. We used tools developed by the NIH National Heart, Lung and Blood Institute (NHLBI) to assess the quality of other prospective studies, including cohort studies [10].
Summary measures, synthesis of results and analysis
Studies were synthesised through a narrative synthesis, with tabulation of the results of included studies. Data were combined using Review Manager (RevMan, version 5.3) [11].
Studies were examined for clinical, methodological and statistical heterogeneity. Only if sufficient homogeneity existed, was meta-analysis considered appropriate. If data permitted meta-analysis, we assessed statistical heterogeneity of the pooled effect size using the I2 statistic, the Chi-squared test for heterogeneity, and through a visual inspection of forest plots. Possible reasons for heterogeneity were explored, and fixed-effects and random-effects meta-analyses were undertaken as appropriate. The choice of model was determined by the degree of heterogeneity, as judged by the I2 statistic and p-value for the Chi-squared test (a random-effects model was used if p<0.10 and/or I2>50%).
Dichotomous outcome data were expressed as odds ratios or risk ratios. Continuous outcome data were expressed as mean differences (MDs) or standardised mean differences (SMDs) as appropriate. For continuous variables, we considered presenting the results for end-point data or for the change from baseline. All meta-analyses were presented in forest plots, with point estimates and measures of variance (95% confidence intervals).
Results
Study selection
The updated searches identified 18 133 publications after removal of duplicates. Titles/abstracts of these publications were screened for eligibility resulting in exclusion of 17 902. In total, 231 full-text articles were assessed for eligibility. Of these, 58 records (30 studies) were included in data synthesis. The reasons for exclusions of the remaining 173 articles are detailed in figure 1. Overall, 18 records (17 studies) were included from the previous reviews. In total, we included 47 primary studies reported in 76 articles (figure 1).
Study characteristics and risk of bias
A detailed summary of study characteristics is included in online supplement 3. A total of 22 (46.8%) studies were RCTs, 3 (6.4%) were CCTs, 20 (42.3%) were before-and-after studies and 2 (4.3%) were cohort studies.
The included studies analysed 2527 participants in total, 27% to 85% were males and mean ages were from 54 to 86 years. Overall, 33 studies included only people with IPF [12–44]. Of the remaining studies, 23% to 85% had IPF and 7% to 46% had nonspecific interstitial pneumonia (NSIP) (where reported). The mean percentage predicted forced vital capacity ranged from 49.8%–103.6%, and the 6MWD was from 71 to 906 metres.
A total of 22 (46.8%) studies assessed 11 different pharmacological interventions [12–18, 20–25, 39, 42, 43, 45–50] and 25 (53.2%) studies assessed nonpharmacological interventions. Pulmonary rehabilitation (PR) or exercise training was evaluated in 22 studies [27–32, 34–37, 40, 41, 44, 51–59], while 3 studies used other nonpharmacological interventions [2, 26, 38].
Among 22 studies that assessed pharmacological interventions, the comparator was placebo in 10 (45.5%) studies [13, 14, 21–23, 39, 42, 45, 48, 49], while in 3 (13.6%) studies the comparator was another treatment [17, 25, 43]. In six (27.3%) studies, there was no control group [12, 15, 18, 20, 24, 46], while one (4.5%) study had a healthy control group [16]. One study that assessed pharmacological interventions had room air as the control [47], while another study had “no oxygen” as control [50].
Three studies compared nonpharmacological complex interventions with usual care [2, 26, 38]. Among 22 studies that assessed PR or exercise training, one study compared PR in participants with IPF to PR in those with other ILD diagnoses [54], while another study compared PR in participants with IPF with those with sarcoidosis [31]. One study compared PR in participants with IPF with PR in those with COPD from a previous study [58]. Eleven (50%) PR studies had no control group. Three (13.6%) studies compared PR with usual care [28, 32, 35], one study compared PR with a control group that did not involve PR or addition of new medicine [27], one study compared PR with a control group that did not involve any PR [59], and one study compared PR with a control group that only involved regular medical treatment [34]. Two (9.1%) studies compared PR with another intervention [40, 41].
A total of 25 (53.2%) studies did not specify their primary outcome; 12 (25.5%) studies used 6MWD as their primary outcome [13, 22–24, 28, 32, 34, 41, 48, 52, 53, 59]; 4 (8.5%) studies measured cough as primary outcome [21, 39, 42, 49]; 1 (2.1%) study measured QoL as primary outcome [50] and 1 (2.1%) study had both QoL and 6MWD as primary outcomes [29]; 4 (8.5%) studies used other primary outcomes besides cough, QoL and 6MWD [2, 14, 45, 46]. None of the studies examined dyspnoea as their primary outcome.
Table 1 presents study characteristics (study size; population, intervention, comparison, outcomes, study design (PICOS); and follow-up duration) with both a summary of risk of bias and outcomes. Online supplement 3 describes detailed PICOS characteristics of included studies, supplement 4 provides detailed baseline characteristics of included participants, while supplement 5 presents a more detailed account of their quality assessment.
Main findings
Detailed results are available in online supplement 6 with a summary of the main findings presented here.
Pharmacological interventions
Bosentan
Bosentan was assessed in three RCTs (reported in five papers [13, 14, 45, 60, 61]) and 808 participants: BUILD 1 [13, 61] showed no benefit of bosentan orally twice daily (62.5 mg up-titrated to 125 mg twice daily after 1 month) compared with placebo for 6MWD or dyspnoea in 154 participants followed-up to 12 months. These findings were supported by a second larger RCT (BUILD 3) with 616 participants [14]. BUILD 1 [13] found no difference for any domain of the St George's Respiratory Questionnaire (SGRQ) at 12 months, 42% of bosentan-treated participants had an improved 36-item Short-Form Health Survey (SF-36) health transition score compared with 28% of placebo (p=0.084). Subgroup analysis of participants who had undergone diagnostic biopsy, however, favoured bosentan showing a significant beneficial effect on QoL with mean total SGRQ scores favouring bosentan [61]. Significant treatment effects were observed at 12 months in the impact domain of the SGRQ (median treatment effect (MTE) −7.0, p=0.03), physical functioning (MTE 9.3, p=0.04), general health (MTE=9.4, p=0.01) and role emotional domains of the SF-36 (MTE 0, p=0.04) [13]. However, none of these findings were supported in the larger BUILD 3 study [14]. A third RCT reported in two papers [45, 60] in 39 participants receiving similar doses of bosentan or placebo for 16 weeks, found no statistically significant difference in 6MWD or QoL.
Sildenafil
In the review by Bajwah et al. [4], sildenafil was assessed in four studies (one before-and-after study [24] and two RCTs [22, 23], of which one of the RCTs [22] was followed by an open-label study) and 378 participants: the before-and after study [24] found a significant mean (90% CI) improvement in 6MWD of 49.0 m (95% CI 17.5 to 84.0) in 11 participants (receiving between 20 and 50 mg sildenafil three times daily for 12 weeks). However, meta-analysis of data from the two RCTs [22, 23] with 209 IPF participants did not support this finding (effect size 5.25 (95% CI −8.90 to 19.40) I2=56%, p=0.467). One RCT [22] showed less deterioration in dyspnoea in the sildenafil group (receiving 20 mg three times daily for 12 weeks) compared to the placebo group but overall benefit was not supported by meta-analysis of data from Zisman et al. [22] and Jackson et al. [23]: effect size -0.34 (95% CI −0.82 to 0.13, I2=39.5%, p=0.198. Zisman et al. [22] found that SGRQ total score remained stable in the sildenafil group (receiving 20 mg three times daily for 12 weeks), while worsening in placebo group (mean difference (95% CI) −4.08 (−7.3, −0.86)). The SF-36 general health subscore was better preserved in the sildenafil group than placebo (MD (95% CI) 2.86 (0.76, 4.95)). This was not seen during the 12-week open-label phase.
Traditional Chinese medicine
Traditional Chinese medicine (TCM) was assessed in two RCTs [25, 43] and 197 participants: one RCT [25] assessed the therapeutic effects of Feiwei granules in 77 participants. The intervention group (n=62) received Feiwei granules twice a day for 6 months, while the control group (n=15) received Jinshuibao capsules, also a TCM. Participants were followed-up for 6 months. Dyspnoea and SGRQ total score were significantly worse in the Feiwei granules group at baseline, and between-group change in dyspnoea scores from baseline to 3 months was not significant (TCM group MD (sd) −0.6 (0.7), control group MD (sd) −0.3 (0.7); p=0.111). However, between-group change in dyspnoea scores from baseline to 6 months (TCM group MD (sd) −0.8 (0.8), control group MD (sd) 0.1 (1.0); p=0.001) and from 3 months to 6 months (TCM group MD (sd) −0.2 (0.7), control group MD (sd) 0.3 (0.7); p=0.009) showed significant improvement in the intervention group. SGRQ total score change from baseline to 3 months also showed significant improvement (TCM group MD (sd) −8 (14), control group MD (sd) −1 (6); p=0.011). There was no difference in change in 6MWD from baseline to 3 months (TCM group MD (sd) 17 (124), control group MD (sd) 19 (38); p>0.05), baseline to 6 months (TCM group MD (sd) 48 (107), control group MD (sd) 30 (54); p>0.05) and from 3 months to 6 months (TCM group MD (sd) 31 (82), control group MD (sd) 11 (41); p>0.05). The other RCT [43] evaluated the impact of TCM foot bath combined with traditional Chinese and Western medicine nursing in 120 participants. Zeng et al. 2015 [43] randomly assigned participants to four groups. There was significant improvement in anxiety and depression of participants in the combined group (traditional Chinese and Western medicine nursing+TCM foot bath) compared to individual treatments at 6 months (p<0.05).
Riociguat
Riociguat was assessed in one RCT [48] and one before-and-after [46] study and 169 participants: Hoeper et al. [46] carried out a small study involving 22 participants and found a nonsignificant increase in 6MWD (difference (sd): 25 (64), 95% CI −8 to 58) at 12 weeks in those receiving riociguat three times daily. Authors stated that no significant changes compared with baseline were seen for other relevant outcomes but no data were presented. Nathan et al. [48] conducted an RCT involving 147 participants receiving either riociguat three times daily for 26 weeks or placebo. No significant improvement was noted in 6MWD for the intervention group at 26 weeks (MD 21 m, 95% CI: −9 to 52; p>0.2).
Ambulatory oxygen
Oxygen was assessed in two studies (both crossover RCTs) and 87 participants: Visca et al. [50, 62] compared 2 weeks of ambulatory oxygen to no ambulatory oxygen in 76 participants with fibrotic ILD. Health status measured by the King's Brief Interstitial Lung Disease (KBILD) questionnaire was significantly improved in the intervention group for the total score (MD 3.7; 95% CI 1.8, 5.6; p<0.0001), breathlessness and activity domain (MD: 8.6; 95% CI 4.7, 12.5; p<0.0001) and chest symptoms (MD: 7.6; 95% CI 1.9, 13.2; p=0.009) but not for psychological symptoms. Significant improvements were also seen on the University of California San Diego (UCSD) shortness of breath (SOB) questionnaire (p<0.0001) and SGRQ total score (p=0.018); Schaeffer et al. [47] compared oxygen with room air (duration and length of follow-up not reported) in 20 enrolled participants (11 with IPF), and did not find any significant difference in dyspnoea. However, it is unclear how many participants were included in analysis.
Corticosteroids
Corticosteroids were assessed in three nonrandomised studies [15–17] and 60 participants. Hope-Gill et al. [16] conducted a small open-label study of prednisolone (40–60 mg·day−1 for at least 4 weeks) in six participants with capsaicin-induced cough and found a significant reduction in mean (se) visual analogue scale (VAS) score from 7.2 (0.8) to 2.2 (2.5) at 4 weeks (p<0.05). Fiorucci et al. [17] conducted a three-arm before-and-after study of 30 participants receiving prednisolone alone (group 1), prednisolone and cyclophosphamide (group 2) and prednisolone and colchicine (group 3) for 18 months. There were significant improvements in dyspnoea in the prednisolone and colchicine group compared to the other two groups. Baseline dyspnoea mean (sd) was 8.4 (2.5) versus 18 months 6.3 (2.2) (p=0.001). Two participants of group 1 (18%), one patient of group 2 (11%) and eight participants of group 3 (80%) showed a significant decrease in dyspnoea (p=0.001). Papiris et al. [15] conducted a cohort study of 24 participants and found no difference in 6MWD between participants with an acute exacerbation of IPF “ever treated” and “never treated” with steroids. The follow-up duration was not stated.
Proton pump inhibitor therapy
Proton pump inhibitor therapy was assessed in two studies (one RCT [42] and one before-and-after study [20]) and 68 participants: the before-and-after study [20] assessed the effects of omeprazole 40 mg twice daily or lansoprazole 30 mg twice daily, plus ranitidine 300 mg at night for 8 weeks in 14 participants. The median 24-h cough count showed a nonsignificant increase from baseline (p=0.7). The matched placebo RCT assessed the effects of 20 mg omeprazole twice daily in 40 participants. This study found that geometric mean cough frequency at the end of treatment, adjusted for baseline, was 39.1% lower (95% CI 66% lower to 9.3% higher) in the omeprazole group compared with placebo. Similar results were obtained for daytime and night-time cough frequency. There was no clinically meaningful difference for patient-reported symptoms of cough and reflux. There was no significant difference in change in 6MWD.
Thalidomide
Thalidomide was assessed in two studies (an open-label prospective cohort trial and a crossover RCT [12, 39]) and 34 participants. Horton et al. [12] presented data in 11 participants with chronic cough caused by IPF. There was a marked to complete resolution of cough in 10 participants who received thalidomide administered daily in 100–400 mg doses for 3 months. SGRQ data showed significant decrease on the cough question from baseline 4.9 (0.3) to 2.2 (1.6) at 3 months (p=0.03). A crossover RCT of 23 participants was conducted by Horton et al. [39] where 50 mg (increased to 100 mg after 2 weeks if no improvement in cough) of thalidomide was administered orally once daily for 12 weeks in a crossover design with a 2-week washout. There was a significant improvement in cough-specific QoL MD ((95% CI) −11.4 (−15.7 to −7.0) p<0.001) for those taking thalidomide compared with placebo.
PA101 (cromolyn sodium)
PA101 was assessed in one crossover placebo-controlled RCT [19, 49] and 23 IPF participants. A total of 40 mg PA101 (cromolyn sodium) was administered via a high-efficiency nebuliser three times daily for 14 days with washout for 14 days before crossover. The intervention group had significantly reduced cough frequency (ratio of least-squares means 0.67, 95% CI 0.48, 0.94; p=0.0241) but nocturnal cough, cough-specific QoL and KBILD total score were not significantly improved.
VRP700
VRP700 was assessed in one crossover placebo RCT [21] and 20 participants. Participants with cough received either a single inhaled dose of VRP700 (100 mg) or placebo as a single dose intervention with a 7-day washout. VRP700 significantly increased cough frequency within 4 h of treatment compared with placebo (136.8 (95% CI 80.3, 233.1) versus 64.9 (95% CI 38.1, 110.6); p<0.001). Reported cough severity and urge to cough were not significantly different. The authors state that dyspnoea severity at 24 h was significantly better in the VRP700 group but the data were not presented.
Opioids
Opioids were assessed in one before-and-after study [18] and 11 participants: Allen et al. [18] administered single s.c. diamorphine doses to 11 IPF participants and showed a substantial fall in dyspnoea score from mean (sd) baseline 83 (13) to 36 (11) at 15 min and 36 (12) at 30 min (p<0.001). The authors also reported decreased observed anxiety (no details given).
Nonpharmacological interventions
Pulmonary rehabilitation and exercise training
PR and exercise training were assessed by 22 studies (6 RCTs [28, 32, 34, 35, 41, 59], 1 controlled clinical trial [40], 1 prospective cohort study [27] and 14 before-and-after studies [29–31, 36, 37, 44, 51–58]) with 748 participants in total. Among studies assessing PR, six had participants with severe disease (diffusing capacity of the lung for carbon monoxide (DLCO) % pred <45) [31, 37, 40, 55, 57, 59], while one study comprised a mixture of participants with severe and nonsevere disease [29].
The Cochrane Review [6] included five RCTs measuring dyspnoea and combined three of these in a meta-analysis. These were screened and relevant data were extracted. We identified five additional controlled trials with new data on dyspnoea, of which only one could be added to the three RCTs retrieved from the Cochrane meta-analysis, and one prospective cohort study; this was considered inappropriate, as a large proportion of evidence would be omitted.
The PR studies assessed dyspnoea using seven different scales. These included four unidimensional scales such as the Borg scale [86] and modifications of it, activity domain of the SGRQ [87], dyspnoea domain of the chronic respiratory disease questionnaire (CRDQ) [88] and the oxygen cost diagram [89] as well as three multidimensional scales including baseline/transition dyspnoea index (BDI/TDI) [90], the Medical Research Council (MRC) scale [91] and modifications of it, and the UCSD SOB questionnaire [92].
Of the seven controlled studies, three had inconsistent results from the different dyspnoea scales they used [28, 34, 41], while one study [35] found no significant improvement. The remaining three studies did not report any significant between-group difference [32, 40, 59]. Twelve before-and-after studies [29–31, 36, 37, 44, 51–56] assessed dyspnoea. One study [29] found significant differences in the magnitude of change in dyspnoea immediately after PR in participants with MRC dyspnoea grades 2,3, and 5 but not in participants with MRC dyspnoea grade 4. No significant improvement was found on longer follow-up. Five of these studies [30, 36, 44, 54, 56] found a statistically significant improvement in dyspnoea immediately after the intervention but did not assess its long-term sustainability. Chehere et al. [55] used one unidimensional (Borg scale) [86] and one multidimensional (BDI/TDI) [90] method to assess dyspnoea and found contrasting results: results from the BDI/TDI showed significant improvement while that with the Borg scale was not significant. Three other studies [31, 37, 51] found no significant improvement in dyspnoea. Only two studies [52, 53] reported longer follow-up (6 months) with inconsistent results.
Six RCTs measured health-related QoL (HRQoL) immediately following PR [28, 32–35, 41], while four of these studies involved longer follow-up [28, 32, 33, 41]. However, meta-analysis was considered inappropriate. Overall, results were inconsistent with some RCTs finding no statistically significant improvement effect. Longer follow-up showed PR had no significant improvement effect. Ten before-and-after studies measuring HRQoL [29, 30, 36, 37, 44, 53–56, 58] showed inconsistent findings.
A statistically significant difference between PR and control was found for change in 6MWD immediately following the intervention (n=6 studies with 200 participants, MD 39.85 m, 95% CI 18.17 to 61.53, I2 48%, random effects; figure 2). At longer-term follow-up (3 or 6 months), there was no significant difference in change in 6MWD between groups (n=4 studies with 147 participants, MD 5.26 m, 95% CI −12.88 to 23.40, I2 6%, fixed effects; figure 3). 11 of the 14 before-and-after studies reported 6MWD; most studies found improvement in 6MWD immediately following PR. Only three studies [52, 53, 84] reported longer (6-month) follow-up and results were contradictory.
Case conference
One fast-track RCT [2] evaluated the effects of a palliative care case conference in 53 participants. The intervention was a multi-professional, holistic case conference, including symptom control and developing a home-based nurse-led individualised care plan. Mean (sd) palliative care outcome scores at 4 weeks were −5.7 (7.5) fast-track versus −0.4 (8.0) control (mean change difference between the two arms was −5.3 (95% CI −9.8, −0.7) (p=0.02); effect size (95% CI) −0.7 (−1.2, −0.1)). The secondary outcomes of QoL, anxiety and depression were superior in the intervention arm, and none was worse. Qualitative findings corroborated these data.
Patient and Partner Empowerment Programme
A controlled clinical trial [26, 85] evaluated the effect of a short multidisciplinary empowerment programme, Patient and Partner Empowerment Programme in 23 participants. Results showed no statistically significant change in health status, dyspnoea or HRQoL at 3 weeks and at 3 months.
Disease management programme
One RCT [38] evaluated a disease management program in 19 participants with mixed results. Quantitative analysis showed the intervention negatively affected perceptions of physical QoL and a tendency for greater anxiety. The mean end Beck Anxiety Index scores for the intervention were 15.13 (6.92) and control were 8.56 (6.95) (p=0.077) reflecting a nonsignificant increase in anxiety in the intervention group. Mean (sd) end score for the SF-36 physical component showed a statistically significant difference, intervention 31.06 (4.61) versus control 36.04 (4.63) p=0.038, reflecting a negative impact on perceptions of physical HRQoL. However, qualitative analysis found participants valued the intervention and did not feel isolated.
Adverse events
Adverse events were reported in 15 studies assessing pharmacological interventions, namely, oxygen, bosentan, corticosteroids, opioids, proton pump inhibitors, PA101, riociguat, sildenafil and thalidomide [12–15, 17, 22–24, 39, 42, 45, 46, 62, 93, 94] (see online supplement 7 for details). In particular, seven studies described serious adverse events in studies of bosentan [13, 14, 45] sildenafil [22], thalidomide [39], riociguat [46] and oxygen [50], while three studies reported deaths in studies of corticosteroids [15], bosentan [45] and oxygen [50]. Birring et al. [49] and Yu et al. [25] reported that there were no serious adverse events in studies of cromolyn sodium and Feiwei granules respectively. Five PR studies reported that there were either no serious adverse events [34, 57] or no adverse events [29, 36, 41] at all.
Discussion
We have conducted the most comprehensive evidence synthesis to date providing up-to-date evidence on pharmacological and nonpharmacological interventions to improve symptom control, functional exercise capacity and QoL for people living with fibrotic ILD. To aid reflection on the evidence base for practitioners, policy makers and researchers, we have mapped our findings on an evidence gap map (figure 4).
Pharmacological
The most tested pharmacological interventions were bosentan and sildenafil. Data on pharmacological interventions to improve symptoms, particularly dyspnoea, continue to be limited. Of the drug studies, there were only four studies (n=106 participants) with a primary outcome of cough [19, 21, 39, 42], and no study with a primary outcome of dyspnoea.
Interestingly, current ILD clinical guidelines [7, 95] recommend the use of a number of pharmacological measures (including opioids and benzodiazepines) for symptomatic management of ILD despite limited evidence in this group. However, a recent important study by Faisal et al. [96] suggests that despite very different pathological processes, ILD and COPD patients’ perceptual responses (including intensity and quality) to dyspnoea are similar. In addition, there are likely to be common mechanisms of dyspnoea. Therefore, we would suggest that it is reasonable to extrapolate evidence of effectiveness of pharmacological measures that act centrally to improve dyspnoea (e.g. opioids and benzodiazepines) from COPD to ILD. However, we would recommend a word of caution; while evidence of effectiveness may be transferred, this does not mean that safety profiles of the drugs are the same in both populations, as demonstrated by analyses of opioids and benzodiazepine safety data in COPD [97] and ILD [98].
The British Thoracic Society recently endorsed the occasional use of oxygen by specialist teams when breathlessness is unresponsive to all other treatment [99]. This review shows that there is now some evidence for oxygen in improving symptoms and QoL in ILD patients who desaturate on exercise. However, it remains unclear whether oxygen is any more effective than nonpharmacological interventions such as fans or cognitive behavioural therapy. In light of the many negative psychological, social and financial aspects of oxygen, further research is needed before widescale adoption of oxygen in this group.
Nonpharmacological
As shown by figure 4, the most tested intervention (and for which there is the strongest evidence base) was PR, with 47% of all studies being PR-related. Studies with PR achieved short-term improvements in 6MWD. This contrasts with COPD, where a reduction in dyspnoea is more consistent [100]. The difference in effect for PR in ILD may be related to the more fluctuating and declining nature of the disease. Among studies assessing PR, six had participants with severe disease (DLCO, % pred <45) [31, 37, 40, 55, 57, 59], while one study comprised a mixture of participants with severe and nonsevere disease [29]. Participants with more advanced disease may be less able to participate in PR and therefore it is unreasonable to expect to see the improvement seen in COPD. In ILD, the more appropriate outcome is preservation in symptoms and QoL rather than improvement per se. While no clear improvement in symptoms and QoL of life is seen, a slowing of deterioration in rapidly depreciating disease or the attainment of individualised goals are more meaningful clinical outcomes [101]. We attempted to analyse rate of deterioration of outcomes but were unable to do so, limited by the values presented. Furthermore, a plethora of dyspnoea assessment tools limited generalisability. A total of 37 studies assessed dyspnoea using 10 different scales. Of these, 19 studies were in PR using seven different scales to measure dyspnoea. As noted by Similowski and Serresse [102], dyspnoea is a complex and multidimensional experience that warrants a multidimensional tool to assess the effect of complex interventions. PR may be having a positive effect on the affective dimensions of dyspnoea, which are not captured by unidimensional tools focussing on the symptom of dyspnoea alone (e.g. VAS). In contrast, multidimensional tools (e.g. CRDQ) assess the impact of dyspnoea on various domains such as activities of daily living, emotional and mental functioning, sense of mastery or other person related outcomes. Of note, for the 19 PR studies that assessed dyspnoea, 4 unidimensional and 3 multidimensional dyspnoea measures were used. A review of breathlessness tools by Bausewein et al. [103] notes that there is no one scale that can accurately reflect the far-reaching effects of breathlessness in a patient with advanced disease. They recommend combining a unidimensional scale with either a disease-specific scale or a multidimensional scale in conjunction with other methods such as qualitative methods to fully gauge psychosocial impact of interventions to improve dyspnoea. In this evidence synthesis, only seven PR studies used both unidimensional and multidimensional measures.
Recommendations
The evidence gap in figure 4 demonstrates graphically that dyspnoea is the most researched symptom and cough, the least. Research now needs to focus on pharmacological interventions to improve neglected symptoms such as cough while ensuring that dyspnoea is prioritised as a primary end-point for future studies.
The ongoing presentation of limited data and focus on improvement of outcomes is leading to discounting of the effectiveness of interventions. Given the problem of powering and the multiplicity of uncertainties listed above, an international consensus is needed on the goals of palliative interventions, the ranking of candidate primary end-points in each domain and the selection criteria for studied populations. Failure to reach such consensus may lead to similar difficulties to those in studies of acute IPF exacerbations and ILD associated with occult connective tissue disease. In both areas, lack of standardisation hindered integrating a decade of published data into an evidence base. In both areas, international consensus statements were formulated to confront fragmentation of research. Based on this review, research into palliative care in ILD is arguably in danger of following the same pathway. We call for a major initiative, starting with a Delphi exercise, to agree on an ongoing research agenda.
Strengths and limitations
Strengths of this work include a registered protocol, and a systematic and comprehensive search across multiple databases, inclusive of grey literature, with no exclusions by publication year. All screening, eligibility, data extraction and quality assessment was conducted independently by two authors, and multiple stakeholders (researchers, clinicians and patient representative) contributed to the analysis and interpretation of these data. In addition, we have for the first time, clearly mapped the evidence gap to provide a visual overview of the evidence gap for types of interventions evaluated and outcomes reported. This allows the reader to explore findings and quality of the existing evidence and facilitate informed judgement and evidence-based decision-making. Importantly, the evidence gap map has identified key areas where little or no evidence from research is available and where future research can be focused.
The main limitation of this review is the low quality of the studies included. Many studies were uncontrolled, with small sample sizes and unclear quality aspects. Follow-up was often short and few interventions provided evidence that could be meta-analysed. However, we have only used high-quality studies in the meta-analyses and presented studies clearly to allow readers to draw their own conclusions. To limit heterogeneity, we did not include connective tissue disease-associated ILDs and it is possible that there are further relevant studies available. In addition, we included studies looking at both efficacy and effectiveness as we thought it important to assess and present both. Moreover, although we did not assess for statistical evidence of publication bias, there was evidence of selective reporting where study authors did not provide data for statistically nonsignificant findings.
Conclusion and clinical implications
This evidence synthesis highlights overwhelmingly that the most researched symptom is dyspnoea and the strongest evidence base is for short-term PR. The least researched symptom was cough. The research priorities going forward must focus on prioritising and standardising meaningful outcomes and focussing interventions on neglected symptoms. We call for a sea change in the way we conduct research in this area starting in the first instance with a Delphi exercise concluding in recommendations of a core outcome measure set.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Supplement 1 00107-2020.supplement1
Supplement 2 00107-2020.supplement2
Supplement 3 00107-2020.supplement3
Supplement 4 00107-2020.supplement4
Supplement 5 00107-2020.supplement5
Supplement 6 00107-2020.supplement6
Supplement 7 00107-2020.supplement7
Footnotes
This article has supplementary material available from openres.ersjournals.com
Conflict of interest: S. Bajwah has nothing to disclose.
Conflict of interest: J. Colquitt has nothing to disclose.
Conflict of interest: E. Loveman has nothing to disclose.
Conflict of interest: C. Bausewein has nothing to disclose.
Conflict of interest: H. Almond has nothing to disclose.
Conflict of interest: A. Oluyase has nothing to disclose.
Conflict of interest: M. Dzingina has nothing to disclose.
Conflict of interest: M. Maddocks has nothing to disclose.
Conflict of interest: I.J. Higginson has nothing to disclose.
Conflict of interest: A. Wells has nothing to disclose.
Support statement: This paper presents independent research funded by the National Institute for Health Research (NIHR) under its Research for Patient Benefit Programme (grant reference no. PB-PG-0815-20030). The views expressed are those of the authors and not necessarily those of the National Health Service, the NIHR or the Dept of Health and Social Care. Funding information for this article has been deposited with the Crossref Funder Registry.
- Received February 28, 2020.
- Accepted September 13, 2020.
- Copyright ©ERS 2021
This article is open access and distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0.