Abstract
Chronic breathlessness occurs across many different conditions, often independently of disease severity. Yet, despite being strongly linked to adverse outcomes, the consideration of chronic breathlessness as a stand-alone therapeutic target remains limited. Here we use data-driven techniques to identify and confirm the stability of underlying features (factors) driving breathlessness across different cardiorespiratory diseases.
Questionnaire data on 182 participants with main diagnoses of asthma (21.4%), COPD (24.7%), heart failure (19.2%), idiopathic pulmonary fibrosis (18.7%), other interstitial lung disease (2.7%), and “other diagnoses” (13.2%) were entered into an exploratory factor analysis (EFA). Participants were stratified based on their EFA factor scores. We then examined model stability using 6-month follow-up data and established the most compact set of measures describing the breathlessness experience.
In this dataset, we have identified four stable factors that underlie the experience of breathlessness. These factors were assigned the following descriptive labels: 1) body burden, 2) affect/mood, 3) breathing burden and 4) anger/frustration. Stratifying patients by their scores across the four factors revealed two groups corresponding to high and low burden. These two groups were not related to the primary disease diagnosis and remained stable after 6 months.
In this work, we identified and confirmed the stability of underlying features of breathlessness. Previous work in this domain has been largely limited to single-diagnosis patient groups without subsequent re-testing of model stability. This work provides further evidence supporting disease independent approaches to assess breathlessness.
Abstract
Using machine learning techniques, four underlying factors of breathlessness in cardiorespiratory disease were identified. These underlying factors of the breathlessness experience were similar across diseases and remained stable over time. https://bit.ly/3brtD8c
Introduction
Chronic breathlessness, breathlessness persisting despite optimal treatment, is a central symptom in many conditions, especially in respiratory and cardiac diseases, but also in cancer, neurological diseases and for survivors of coronavirus disease 2019 (COVID-19) [1, 2]. Breathlessness is strongly linked to poorer clinical outcomes, including worse quality of life and increased rates of anxiety and depression [1, 3–7]. While cardiorespiratory physiological mechanisms undoubtably often play a key role in breathlessness, they fail to explain breathlessness in many situations, such as when two individuals with objectively similar disease severities report very different experiences of breathlessness [4, 8]. These discrepancies, alongside the multifaceted and subjective nature of chronic breathlessness, make its assessment and treatment challenging.
A multitude of assessment tools exist to quantify breathlessness. However, the focus of much recent research has been on identifying the “best” measurement tool rather than underlying features (or factors) and their relationship in driving the experience of breathlessness. Where domains of breathlessness have been explored, efforts have typically centred around distinctions between physical sensations of breathlessness i.e. work effort/air hunger, and the comparison of subjective intensity with unpleasantness [9–12]. While a deeper understanding of the sensation of breathlessness itself is clearly important, symptoms need to be contextualised within a person's broader lived experience. For example, clinical guidelines include “affective distress” and “symptom impact/burden” as domains of dyspnoea management [3], highlighting the relevance of anxiety, depression and fatigue, which may be addressed with mind–body interventions [13].
Multi-dimensional, data-driven models offer an opportunity to explore the bi-directional impact of breathlessness on both body and mind, revealing underlying “hidden” factors of breathlessness not previously considered. These underlying factors may not only form the basis of a common descriptive framework for breathlessness across cardiorespiratory disease but could enrich understanding of discordant breathlessness and become key therapeutic targets in their own right. Machine-learning techniques have already been used to identify baseline factors, which together predict treatment response in depression [14, 15] and pain [16, 17]. Similar approaches have been used to identify symptom-based phenotypes in asthma [18] and COPD [19, 20] where clusters of patients for whom breathlessness was linked with underlying mood and affect were identified. The findings of those studies supports our own previous findings, which revealed separable factors centring around mood and affect measures [21–23] and symptom burden measures [21, 22], with further important factors including anticipated and physical capability measures [22]. However, this work focused on identifying factors underling breathlessness within a single disease [22] or between a patient and a control group [21, 23]. The aim of the present study was to address some outstanding questions:
Does a shared description of breathlessness exist across disease diagnoses?
Do weightings on any identified factors predict primary disease diagnosis?
Are factors stable across time?
Methods
This was an analysis of data from a longitudinal study of patients suffering from cardiorespiratory disease and breathlessness in everyday life (approved by the Regional Ethical Board at Lund University (DNr: 2016/16). This body of work uses the dataset of which parts were used in published validation of the Swedish Multidimensional Dyspnea Profile (MDP) [24], the Dyspnoea-12 [25], and the instruments’ clinical feasibility and minimal clinically important differences [26]. The present analyses are novel and not previously reported.
Participants
182 participants (97 female, median age 72 years (range 19–91 years), asthma (21.4%), COPD (24.7%), heart failure (19.2%), idiopathic pulmonary fibrosis (18.7%), other interstitial lung disease (2.7%), and “other diagnoses” including depression, cancer, diabetes and renal failure (13.2%); table 1) were recruited from five outpatient clinics [24]. Inclusion criteria were: age 18 years or older, documented physician-diagnosed chronic cardiorespiratory disease, self-reported breathlessness during daily life defined as an answer “yes” to the question “Did you experience any breathlessness during the last 2 weeks?” and ability to give written informed consent to participate in the study. Exclusion criteria were: inability to write or understand Swedish adequately to participate, cognitive or other inability to participate in the study, or estimated survival of less than 3 months. Of the 182 participants who completed the baseline visit, 144 (79%) provided follow-up data at six months (79 female, median age 72 years (range 20–92 years)).
Demographic information per main cause of breathlessness (total subjects=182)
Measures
Participants attended the clinic for a baseline visit, while repeat data were collected six months after the first visit date via a postal questionnaire. Baseline data included demographics, smoking status, measured height and weight, and self-report questionnaires, which were scored according to their respective manuals and recorded as their appropriate domain scores: COPD Assessment Test (CAT) [27]; Dyspnea-12 (D12) [28]; EuroQol Five Dimensions, Five Levels (EQ-5D-5L) [29]; Functional Assessment of Chronic Illness Therapy (FACIT)–Fatigue Scale; Hospital Anxiety and Depression Scale (HADS) [30]; Multidimensional Dyspnea Profile (MDP) [9]; modified Medical Research Council (mMRC) Breathlessness Scale [31]. Average severity of pain (0–10 numerical rating scale (NRS)) and average severity of breathlessness were measured as “on average during the last two weeks” (Likert scale), along with current severity of breathlessness (0–10 NRS).
Six months after their first visit participants were asked to complete and return a postal questionnaire pack. Questionnaires remained the same as at baseline and participants were asked to additionally rate their change in breathlessness since the first assessment on a seven-point ordinal scale (Global Impression of Change (GIC); where 1=“very much better”, 4=“no change” and 7=“very much worse”) [26].
Analysis
A brief summary of analyses are provided here. Further information on technical details can be found within Supplementary materials.
Factor identification
Exploratory factor analysis (EFA) was used to identify and formalise any common structure underlying responses across clusters of questionnaire measures (table 2). EFA is a model-free process, allowing researchers to examine a dataset without applying a preconceived structure to the result [32–34]. This is sometimes called “unsupervised machine learning”. In EFA, measures are grouped or discarded depending on how much they contributed to any one cluster. The resulting composite scores of each group are classed as a factor. Models were fit using Lavaan version 0.6–1 in R Studio version 1.2.1.
A list of measures included in the exploratory factor analysis model
Participant stratification
Following exploratory factor analysis, each participant received a score (similar to the first component of a principal component analysis across measures within a factor) corresponding to each latent factor. To examine whether natural groupings of participants existed, the participants were stratified based on their factor scores using hierarchical cluster modelling techniques [35, 36]. Hierarchical models were used to reorder participants based on their correlation strengths [35]. Models were programmed using Matlab (MATLAB 2018b, Mathworks, Natick, MA, USA). To determine whether the hierarchical groupings corresponded to disease diagnosis, the percentage probability of each of the disease categories was calculated for each group proportional to group size and compared with chance as shown in Supplementary equation (1).
Longitudinal model validation
To determine whether the exploratory factor analysis model established at baseline was stable six months later we re-examined the factor structure using a confirmatory factor analysis on the follow-up data. Model fit criteria compared the proposed model to a null model.
Model refinement
Following the generation of the factor model we assessed whether low loading items could be removed from the model while maintaining a significant model fit. The process was carried out iteratively and after each item's removal the model fitting procedure was rerun and assessed for significance using the above model selection criteria. Items were removed until the model no longer significantly fit.
Results
Establishing a shared description of breathlessness across disease diagnoses
Of the 25 measures entered into the exploratory factor analysis (table 2), 16 fulfilled the model fit criteria and were retained for model validation. The measures retained were: the EQ-5D-5L index, average severity of pain (0–10 NRS), (FACIT)–Fatigue Scale, CAT total score, HADS anxiety, mMRC, D12 physical, current and average severity of breathlessness (0–10 NRS), MDP A1 (breathing discomfort), MDP A2 (frustration, anger, fear, anxiety and depression), MDP SQ4 intensity (mental breathing effort). The composite scores of the 16 retained measures formed four factors, which were validated according to the model fit criteria (TLI=0.97, root mean square error of approximation (RMSEA)=0.055, SRMR=0.03), shown in figure 1. We assigned descriptive labels to each of the factors as follows:
Factor 1: body burden: comprised EQ-5D-5L-index, pain and fatigue measures, CAT (a disease impact measure), and the SQ4 domain of the MDP (mental breathing effort).
Factor 2: affect/mood: was composed of the anxiety, fear and depression A2 domains of the MDP along with the anxiety subscale of HADS.
Factor 3: breathing burden: was made up of breathlessness at rest and on average, the mMRC, A1 domain of the MDP and the physical domain of the D12.
Factor 4: anger/frustration: consisted of two of the A2 domains of the MDP – anger and frustration.
a) Connectogram. Measures are plotted as a wheel with connections between each measure input to the model shown as a thread. The thickness of the thread provides a visual representation of the strength of the relationship between any two measures (R≥0.35). Coloured dots highlight the common factors underlying groups of measures. Pink dots correspond to Factor 2 (affect/mood), red to Factor 1 (body burden), blue to Factor 3 (breathing burden) and orange to Factor 4 (anger/frustration). b) Factor diagram. The retained measures are shown on the left hand side of the diagram while the contribution of each measure to the resulting factor is shown as a loading weight (rectangular boxes) Factors are shown as ovals with descriptive labels underneath. The covariance across factors is illustrated by the curved lines.
Factors 1 and 3, and factors 2 and 4 showed the strongest covariance (curved lines figure 1) but retained sufficient independence to not collapse into the same factor.
Do weightings on any identified shared factors predict disease diagnosis?
Participants were stratified based on their four composite factor scores from the EFA model fit. A two-group solution was confirmed by MATLAB's evalcluster algorithm as the most distinct and largely seems to correspond to high and low load across the four factors (figure 2).
Clustergram. A matrix of each participants score across the four factors. Factor score is measured in arbitrary units (au). Each of the four factors is displayed along the bottom, with participants forming the y-axis. Factors 1 and 3 correspond to body and breathing burden, while Factors 2 and 4 correspond to affect/mood and anger/frustration. A dendrogram is displayed along the left side and highlights the division of participants into the two groups, higher symptom burden and lower symptom burden.
The two groups were not found to correspond to primary disease diagnosis (figure 3), of which there were six categories: asthma, COPD, heart failure, idiopathic pulmonary fibrosis (IPF), other interstitial lung disease and “other” (including depression, cancer, diabetes and renal failure). However, participants with IPF, other interstitial lung disease or heart failure were more likely when compared to chance (65%, 60% and 61% respectively) to be classified into the lower symptom burden group. While participants with COPD, asthma or a diagnosis of “other” were more likely to be classified into the higher symptom burden group when compared to chance (43%, 50% and 47% respectively).
The relative proportion of each disease category belonging to the higher and lower symptom burden groups shown. Each participant is represented by a single square. Colours encode disease category as follows. Red: idiopathic pulmonary fibrosis; orange: other interstitial lung disease; yellow: COPD; blue: asthma; purple: heart failure; and pink: “other” (including depression, cancer, diabetes and renal failure).
Assessing the stability of factors over time
After six months, the factor model was found to have remained stable according to the confirmatory factor analysis model fit criteria (TLI=0.92, RMSEA=0.086 (marginal fit), SRMR=0.06).
Establishing the simplest informative model
The baseline model was then subjected to an iterative process in which the lowest loading variables were removed. After each cycle of variable removal, the model was retested. Figure 4 illustrates the final “compact” model. The MDP SQ4 (mental breathing effort) was removed along with mMRC, breathlessness at rest and CAT. The final model was found to be significant according to the model fit criteria (TLI=0.96, RMSEA=0.06, SRMR=0.03), and was found to remain stable after six months (TLI=0.93, RMSEA=0.084, SRMR=0.05), although RMSEA was considered to be marginal.
Factor diagram. A compact factor model in which low loading measures were sequentially removed. The retained measures are shown on the left-hand side of the diagram with the contribution of each measure to the resulting factor shown as a loading weight (rectangular boxes). Removed measures are shown in shaded boxes next to the factor to which they previously belonged. Factors are shown as ovals with descriptive labels underneath. The covariance across factors is illustrated by the curved lines.
Discussion
We aimed to answer the following questions: 1) does a shared description of breathlessness exist across disease diagnoses? 2) do weightings on any identified factors predict primary disease diagnosis? 3) are factors stable across time? Our findings go beyond our previous work by showing that underlying factors of the breathlessness experience are similar across diseases and remain stable over time. Using unsupervised machine learning techniques, we identified four key factors underlying the experience of patients with chronic breathlessness. We assigned the key factors the following descriptive labels: body burden, affect/mood, breathing burden and anger/frustration. Together these factors provide a common description of breathlessness across asthma, COPD, heart failure, idiopathic pulmonary fibrosis and other interstitial lung disease. These factors were found to be stable across time but were not predictive of the primary disease category. Instead participants fell into either high or low scorers across the four factors.
Applying machine learning techniques to investigate breathlessness
In this study, we used unsupervised machine learning techniques, a benefit of which is that hypotheses and relationships can be led by the data to reveal associations not previously considered [32, 36]. This exploratory approach does not, however, guarantee a statistically or clinically significant finding. Measures may have too little, or alternatively too much in common with all other measures to form separable factors. An example is the D12 affective score, which was removed from this model as it contributed strongly to both Factor 2 (mood/affect) and Factor 3 (breathing burden). In contrast the COPD assessment test was retained, suggesting that the constructs assessed are common across different cardiorespiratory conditions. These considerations lend confidence to our findings but reinforce the caveats of these techniques: factor analysis builds models based on shared variance and requires linear relationships between variables. Excluded variables not fulfilling those criteria may still be important descriptors of breathlessness. To address this, independent but relevant measures could be included at the point of participant stratification or as an independent validation of group differences.
Latent factors of breathlessness
Parallels can clearly be drawn between the four factors identified in this study and our previous work despite different assessment tools being utilised and its application in a new patient group. In an investigation of breathlessness in COPD we identified the most separable factors to be what a person felt they could or could not do, how their symptoms impacted their lives and their general mood [22]. Two of these factors, corresponding to mood and perceived symptom burden, were identified in a second investigation conducted in individuals with asthma [21]. In this current work, mood/affect and symptom burden were again important factors, but here we were able to separate symptom burden into two factors; one focused on body burden (Factor 1) and a second factor relating to breathing burden (Factor 3). Interestingly, Factor 4, which contained anger/frustration measures did not collapse into Factor 2 (mood/affect), despite strong covariance. Both our previous and current work show measures contributing to factors corresponding to mood and body burden as relevant and distinct, while their strong covariance shows they are not completely independent. This illustrates the value of mechanistic research into this bi-directional relationship, which may become overlooked when investigated using other methods [8, 22, 37].
Stratifying patients across diseases
Taking scores across the four-factor model we were able to split the participant population into two groups: one corresponding to higher scores across the four factors, and one lower scoring group. Again, this is consistent to our previous work which also found a two group structure corresponding to high and low scores across four factors [21, 22]. The current work extends our previous work, as group identity was found to be independent of primary disease diagnosis. This finding highlights that a common psychological reference frame for breathlessness burden could provide an opportunity to address the underlying mechanisms of breathlessness, over-come issues with comorbidities and drive treatment forwards in a more effective and personalised manner, particularly when medical therapies have been optimised. Additionally, the model could also help to contextualise other work that demonstrated a disconnect between physiological and subjective measures of breathlessness across both COPD and interstitial lung disease [10]. Stratification and cluster-based techniques have been used to good effect in other more clinically minded studies. Haldar et al. [18] and Leung et al. [38] were able to identify several different asthma phenotypes using similar methods, but their works were restricted to mainly cardiorespiratory measures such as dosage of inhaled corticosteroids, neutrophil count, chemokine levels and atopy markers in a single disease.
A repeatable and compact measure of breathlessness
A key requirement of any model is that it is stable across time. With this in mind we repeated our assessment of the factor structure on data acquired after six-months using confirmatory factor analysis and found that the factor structure remained stable. Having ascertained that the model was stable, we then sought to determine whether we could remove measures to create a more compact, less burdensome assessment, while maintaining a significant model fit. The iterative process of variable removal revealed only Sensory Quality 4 (SQ4 – mental breathing effort) and CAT could be removed from Factor 1 (body burden), while mMRC and breathlessness at rest could be removed from Factor 3 (breathing burden). The final compact model structure (figure 4) remained significant after testing on the six-month dataset despite the natural drop-out of participants over the 6-month period.
Clinical implications
Both the Global Initiative for Chronic Obstructive Lung Disease and American Thoracic Society statements [3, 13] have highlighted a clear need for a broader framework within which to describe and treat breathlessness. In introducing the rational for multidimensional models of breathlessness we highlighted the need to avoid burdening both participant and clinician with extensive questionnaires. Thus, a balance must be established, and the most relevant factors retained for clinical use. To achieve this, firstly the ability of factor scores to predict clinical outcomes should be assessed, and secondly, a randomised interventional study could target underlying factors and examine change scores across clinical outcome measures.
Further considerations and limitations
In this work, we have identified stable factors across different disease populations that we hypothesise capture important self-report aspects of the lived experience of breathlessness. However, before firm conclusions as to the utility of this model can be drawn, we must address several questions. Firstly, do different weightings across the factors link with or predict relevant outcome measures? Are there different mechanisms underlying group identity? And finally, are these groups a basis for personalised treatment pathways? To answer these questions we would need more detailed outcome measures and in depth physiological characterisation. In this work we were restricted in our ability to test for generalisability of the models. A larger sample size or second, independent dataset would have enabled us to divide the dataset into two and examine whether the models generalised to new datasets. Thus, the broad relevance of these models should be examined by future studies. Future models should also consider building in the multiple comorbidities common to patients with chronic breathlessness. In this work, we were restricted by sample size, and so individuals were labelled according to only their primary diagnosis, thus restricting the investigation of comorbidities influence on symptom burden. However, with a larger sample size it may be possible to examine whether particular comorbidities affect group identity or factor weightings. Additionally, physiology, duration of illness and patterns of breathlessness may be important contributors to any description of breathlessness or a relevant outcome measure, but of the measures collected, none were suitable for use across all the clinical groups. Those that were collected would have likely biased the model towards exclusively detecting disease, for example cardiac left ventricle ejection fraction would be more relevant for heart failure than asthma. Future work may consider creating disease specific longitudinal physiological burden scores, which could then be translated across different diseases. However, as demonstrated by Faisal et al. [10], differences in physiological burden across COPD and interstitial lung disease did not explain reported breathlessness. The difficulty of incorporating physiology into such models highlights the potential for compact patient-reported tools in examining the drivers of breathlessness across disease diagnoses.
Conclusions
We have shown using machine learning techniques that a shared description of breathlessness underlies patient reports of breathlessness. These latent factors were not related to primary disease diagnosis and remained stable over time. This structure should now be investigated for clinical utility by interventional studies focused on targeted treatments for specific domains of breathlessness.
Acknowledgements
The authors extend their warm thanks to the staff conducting the study, to Hans Bornefalk and Anna Hermansson Bornefalk who made important contributions regarding the statistical aspects of the project and database management, and to all patients who participated to make this research possible.
Footnotes
This article has supplementary material available from openres.ersjournals.com
Author contributions: S.L. Finnegan: study design for this analysis project, analysis of data, interpretation, and drafting, editing and approving manuscript. K.T.S. Pattinson: study design for this analysis project, interpretation, editing and approving manuscript, and supervision of analysis. J. Sundh, M. Sköld, C. Janson, A. Blomberg and J. Sandberg: data collection, and editing and approving manuscript. M. Ekström: original study design, data collection, interpretation, and editing and approving manuscript.
Conflict of interest: S.L. Finnegan has nothing to disclose.
Conflict of interest: K.T.S. Pattinson has a UK patent application entitled “Use of cerebral nitric oxide donors in the assessment of the extent of brain dysfunction following injury” pending.
Conflict of interest: J. Sundh has nothing to disclose.
Conflict of interest: M. Sköld has nothing to disclose.
Conflict of interest: C. Janson has nothing to disclose.
Conflict of interest: A. Blomberg has nothing to disclose.
Conflict of interest: J. Sandberg has nothing to disclose.
Conflict of interest: M. Ekström has nothing to disclose.
- Received November 3, 2020.
- Accepted February 15, 2021.
- Copyright ©The authors 2021
This version is distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0. For commercial reproduction rights and permissions contact permissions{at}ersnet.org