Abstract
Single nucleotide polymorphisms (SNPs) in various genes have been shown to associate with COPD, suggesting a role in disease pathogenesis. Sulfatase modifying factor (SUMF1) is a key modifier in connective tissue remodelling, and we have shown previously that several SNPs in SUMF1 are associated with COPD. The aim of this study was to investigate the association between SUMF1 SNPs and advanced lung function characteristics.
Never-, former and current smokers with (n=154) or without (n=405) COPD were genotyped for 21 SNPs in SUMF1 and underwent spirometry, body plethysmography, diffusing capacity of the lung for carbon monoxide (DLCO) measurement and impulse oscillometry.
Four SNPs (rs793391, rs12634248, rs2819590 and rs304092) showed a significantly decreased odds ratio of having COPD when heterozygous for the variance allele, together with a lower forced expiratory volume in 1 s (FEV1) and FEV1/forced vital capacity (FVC) ratio and an impaired peripheral resistance and reactance. Moreover, individuals homozygous for the variance allele of rs3864051 exhibited a strong association to COPD, a lower FEV1/FVC, FEV1 and DLCO, and an impaired peripheral resistance and reactance. Other SNPs (rs4685744, rs2819562, rs2819561 and rs11915920) were instead associated with impaired lung volumes and exhibited a lower FVC, total lung capacity and alveolar volume, in individuals having the variance allele.
Several SNPs in the SUMF1 gene are shown to be associated with COPD and impaired lung function. These genetic variants of SUMF1 may cause a deficient sulfation balance in the extracellular matrix of the lung tissue, thereby contributing to the development of COPD.
Abstract
This study shows that several SNPs in the SUMF1 gene are associated with COPD and impaired lung function. These SNPs appear in two blocks: block 2 is associated with COPD and airway obstruction, and block 1 is associated with impaired lung volumes. https://bit.ly/3IiZXcC
Introduction
COPD is mainly caused by long-term cigarette smoking [1], but multiple genetic factors may influence predisposition to lung damage and thereby the susceptibility for developing COPD [2]. Given the heterogeneous nature of COPD, it is most likely that COPD patients have different genetic patterns. The decreased lung function seen in COPD patients is mainly due to airway inflammation caused by oxidative stress, which subsequently leads to airways remodelling and tissue destruction [3–6]. Different gene polymorphisms related to these processes are important to investigate to better understand their role in disease development. The most well-known genetic factor associated with COPD is alpha-1 antitrypsin deficiency, which is associated with polymorphism in the SERPINA1 gene [7]. In addition, mainly single nucleotide polymorphisms (SNPs) associated with inflammatory processes [8] and biological stress pathways [9] have been identified to be linked to COPD. Some polymorphisms related to connective tissue remodelling have also been identified, such as matrix metalloproteinase (MMP)-7, which is suggested to influence early development of COPD [10], and MMP-12, which was found to be associated with severe/very severe COPD [11].
In a prior study, we have found several SNPs in sulfatase modifying factor 1 (SUMF1) to be associated with COPD [12]. SUMF1 is the main regulator of all known sulfatases in the body [13], and modifies them into their active state. The role of the different sulfatases is then to remove sulphate from specific sulfated carbohydrate chains, and they thereby have an important role in the delicate balance of connective tissue remodelling. SUMF1 is well known for its implication in multiple sulfatase deficiency [14–18], but Arteaga-Solis et al. [19] observed that Sumf1−/− mice exhibited an emphysema-like pattern in their lungs, due to post-natal alveolarisation arrest. In our previous clinical study, we showed that several SNPs in SUMF1 were associated with COPD, of which rs793391 was the most significant. 12 SUMF1 SNPs were found to be significant by expression quantitative trait loci (eQTL) analysis, and certain splice variants of SUMF1 exhibited decreased expression levels in sputum cells from COPD patients compared to control subjects. In association with the SUMF1 SNP rs11915920, which was a top hit in the eQTL analyses, we found decreased mRNA expression levels in sputum cells and lung fibroblasts in subjects with the variance allele, confirming the results of the lung tissue eQTL analysis [12]. A previous genome-wide association study has also identified SUMF1 to be associated with prominent emphysema, but was not studied further [20].
The main objective of the present study was to investigate the association between SUMF1 SNPs and advanced lung function characteristics. A secondary aim was to verify findings from our previous study in a larger, well-defined cohort and to investigate if additional SNPs in SUMF1 showed an association to lung function and COPD.
Material and methods
Study population
598 patients were recruited from two sites within Skåne University Hospital (Sweden): the lung clinic in Lund and at the clinical physiology department in Malmö. In Lund, smokers and former smokers with or without COPD were recruited [21]. In the Malmö cohort, never-smokers without any reported lung disease or lower respiratory symptoms, smokers and former smokers with and without COPD and subjects with a self-reported diagnosis of chronic bronchitis/emphysema/COPD were included [22].
All subjects signed written informed consent and the regional ethical review board in Lund approved the studies (431/2008 and 786/2003+amendment 101/2015).
Lung function measurements
Spirometry (MasterScreen; Erich Jaeger, Würzburg, Germany), body plethysmography (MasterScreen Body; Erich Jaeger), diffusing capacity of the lung for carbon monoxide (DLCO) (MasterScreen Diffusion; Erich Jaeger) and impulse oscillometry (MasterScreen) were performed after inhalation of β-agonist (Lund: 400 mg salbutamol, HandiHaler using a spacer device; Malmö: 1 mg terbutaline, Turbohaler). All measurements were performed according to manufacturers’ protocols and according to European Respiratory Society/American Thoracic Society recommendations [23] when applicable. Reference values from the Global Lung Function Initiative [24–26] were used to calculate percentage of predicted normal.
Blood samples and SUMF1 genotyping
Whole blood was drawn from all subjects and stored in EDTA tubes at −80°C until analysis. DNA was extracted and genotyped for 21 SNPs in SUMF1: rs11915920, rs12634249, rs1356229, rs137852846, rs137852848, rs137852849, rs137852854, rs2322683, rs2633852, rs2819561, rs2819562, rs2819590, rs304092, rs308739, rs3864051, rs4685744, rs748169616, rs793391, rs794185, rs794187 and rs807785. This was analysed using Agena iPLEX genotyping at the mutation analysis facility at Karolinska University Hospital (Huddinge, Sweden) using iPLEX Gold chemistry and MassARRAY mass spectrometry system [27] (Agena Bioscience, San Diego, CA, USA), as described previously [12].
Statistical analysis
Baseline difference in demographic data and lung function between COPD and controls were analysed using t-test or Chi-squared test. Associations of SNPs to COPD were analysed with multivariable logistic regression models including the SNP in an additive genetic model and were adjusted for confounding factors (age, sex and smoking status). Associations of SNPs to lung function parameters were done with multivariable linear regression, corrected for confounding factors (age, sex, height and smoking status). Figures 1–3 are presented as percentage predicted when applicable, to increase visualisation, and data are analysed with one-way ANOVA for overall comparison among the groups, followed by Bonferroni's multiple comparison test between separate groups.
Results
Subject characteristics
DNA from whole blood from a total of 559 subjects (405 controls and 154 COPD patients) were successfully genotyped and included in the final analysis (39 subjects were excluded due to other lung diseases, unsuccessful lung function performance or unsuccessful SNP analysis). Subjects were divided into two groups, COPD or healthy, according to the Global Initiative for Chronic Obstructive Lung Disease definition [28]. Subjects with COPD were significantly older and had more pack-years of smoking compared to the control group (table 1). In addition, the COPD group consisted of a larger proportion of males and former smokers, and fewer never-smokers and current smokers. As expected, all lung function variables were impaired in COPD compared to controls (table 1).
Genotype frequencies
16 out of the 21 analysed SNPs had a variation among our study subjects, while five SNP genotypes (rs137852846, rs137852848, rs137852849, rs137852854 and rs748169616) were monomorphic, i.e. presenting as homozygous for the reference allele in all subjects, and therefore not included in the analyses.
Genotype frequencies for COPD and controls are presented separately in table 2, and variant allele frequencies of all SNPs are presented in supplementary table S1.
Association to COPD
After adjusting for age, sex and smoking status, five of the SNPs were found to be significantly associated with COPD (table 2). Four of them (rs793391, rs12634248, rs2819590 and rs304092) exhibited a significantly decreased odds ratio of having COPD if heterozygous for the variance allele. Furthermore, when homozygous for the variance allele for any of the aforementioned SNPs, a decreased odds ratio of having COPD was not observed.
rs3864051 displayed the most significant association with COPD, with 11 out of the 13 subjects that were homozygous for the variance allele having COPD. In addition, a tendency towards an increased risk of having COPD was seen in subjects heterozygous for the variance allele rs3864051.
Association with advanced lung function
When examining the different lung function parameters, as expected, the forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC) ratio and FEV1 were similarly associated to the same SNPs as having COPD, showing a lower FEV1/FVC ratio and a lower FEV1 if homozygous for the reference allele of rs793391 (figure 1a and b), rs12634248, rs2819590 and rs304092 compared to being heterozygous for the variance allele (table 3). Additionally, two other SNPs (rs794185 and rs794187) showed a similar association to FEV1, with lower FEV1 if homozygous for the reference allele, but were not associated with having COPD. In conjunction with the association to COPD, lower FEV1/FVC and FEV1 were seen if subjects were homozygous for the variance allele of rs3864051 (table 3 and figure 2a and b).
In contrast, other SNPs were associated with FVC, showing a significantly lower FVC if the variance allele (whether heterozygous or homozygous) for rs4685744, rs2819562, rs2819561 or rs11915920 (table 3 and figure 3a) was present, and a tendency of association also observed for rs2633852. In contrast, subjects heterozygous for the variance allele of rs794185 and rs794187 had a higher FVC if heterozygous for the variance allele (table 3), which followed the pattern of FEV1 for these SNPs.
Besides being associated to FVC, the same SNPs (rs4685744, rs2819562, rs2819561, rs2633852 and rs11915920) were also significantly associated with total lung capacity (TLC), exhibiting a lower TLC if the variance allele was present, and most significant if homozygous for the variance allele (table 4 and figure 3b). These SNPs also showed a similar association to residual volume (RV), but this was not significant, with a lower RV if homozygous for the variance allele. In addition, these SNPs show a higher DLCO/alveolar volume (VA) ratio if homozygous for the variance allele, which is mostly due to a low VA (figure 3c and d), but also a higher DLCO. Consistent with the association between rs3864051 and COPD, a lower DLCO was observed if this SNP was homozygous for the variance allele (figure 2c).
The resistance and reactance results were consistent with the findings of the SNPs associated to COPD and FEV1 (table 5). Total resistance (R5) (possibly here reflecting the peripheral airways as it is in concordance with R5–R20, which is a variable that is commonly taken to represent resistance of peripheral airways), is lower if heterozygous for the variance allele of rs793391 (figure 1c), rs12634248, rs2819590, rs304092, rs794185 and rs794187. A similar pattern was seen in airway reactance (reactance at 5 Hz and reactance area), which was also less impaired if heterozygous for the variance allele of these SNPs (figure 1d). Consistent with its association to COPD and FEV1, rs3864051 exhibited impaired resistance and reactance if homozygous for the variance allele (figure 2d).
There were no associations between rs1356229 or rs308739 and lung function (data not shown).
Discussion
In the present study, several SNPs in the SUMF1 gene were shown to be associated with COPD and with impaired lung function. These SNPs appear in two blocks (supplementary figure S1A) where one block was associated with COPD and airway obstruction, and the other block was associated with impaired lung volumes. One block (block 2) included rs7933191, rs12634249, rs2819590 and rs304092, which were associated with having COPD, affecting FEV1 and FEV1/FVC, and airway resistance and reactance. This was also partly the case for rs794187 and rs794185. The other block (block 1) included rs4685744, rs2819562, rs2819561, rs2633852 and rs11915920, and was associated with impaired FVC, TLC, RV and VA. Furthermore, the most evident SNP associated with COPD was rs3864051, which also exhibited associations with impaired FEV1, FEV1/FVC, DLCO and airway resistance and reactance.
Moreover, we here confirm in a separate, larger number of subjects our previous finding that SNP rs793391 was associated with COPD [12]. Furthermore, we determined additional SNPs in SUMF1 (in block 2) associated with COPD: rs12634248, rs2819590 and rs304092. Similarly, these SNPs, together with rs794185 and rs794187, show associations to FEV1, FEV1/FVC and airway resistance and reactance, typical for an obstructive lung function profile. These SNPs showed weaker association to airway obstruction when the subject was heterozygous for the variance allele, suggesting a protective role for having one variance allele over two reference alleles.
In contrast, the SNPs in the other block (block 1) showed lower lung volumes (FVC, TLC, RV, VA) when heterozygous for the variance allele, and in most cases even more impairment if homozygous for the variance allele. Two of these SNPs (rs11915920 and rs2819562) were included in a previous investigation using an eQTL analysis in lung tissue [12] and showed a very strong association between gene expression of SUMF1 in lung tissue and the genetic polymorphisms. Lower levels of SUMF1 gene expression were seen in subjects having the variance allele, both if being heterozygous and even more prominent in those being homozygous for the variance allele. This might cause an impaired SUMF1 function, and thereby a dysfunctional downstream sulfatase imbalance in the lungs [29], thus resulting in potential extracellular matrix issues. Our findings of lower lung volumes associated to SUMF1 SNPs in this study are in conjunction with the finding of deficient alveolarisation in Sumf−/− mice [19], which might be an explanation for the impaired lung function in these subjects.
Another SNP located between the two defined linkage blocks and strongly associated (OR 14.22) with COPD when homozygous for the variance allele was rs3864051. It has been implicated in longevity [30], and the frequency of homozygosity for the variance allele is rare. In our study, it was found in only 13 out of 555 subjects, but 11 out of these subjects had COPD and only two were healthy. However, this could be a significant risk factor for getting COPD, but since the frequency of being homozygote for this allele is low, more studies are needed to verify our findings.
Variant allele frequencies are similar in the population of the present study and the Database of Genotypes and Phenotypes (supplementary table S1), and most SNPs in the present study follow Hardy–Weinberg equilibrium, except for rs3864051. Only 2.3% of the study population were homozygous for the variance allele (TT), instead of the expected 6% (based on variant allele frequency of 0.24 in our population compared to 0.34 in the database). This could either be due to random error in included genotypes, or a bias in sampling due to COPD patients being less prone to participate in the study. The latter case would suggest even more COPD patients in the group of homozygous for the variance allele, which would strengthen the results of rs3864051 being highly associated with having COPD.
Most of the SNPs are in intron positions (table 2), and only one SNP (rs2819590 in exon 1) is defined as giving rise to missense transcript and potentially affecting the SUMF1 protein directly. In addition, one SNP (2633852) is localised in exon 9 and suggested to be a synonymous, silent mutation in a coding sequence (cds-syn). The role of the two aforementioned SNPs is not known, but they could be in undefined regulatory zones or act to regulate other genes. The associations between lung function and the SNPs in block 2 could also be explained by the fact that they have a neighbouring localisation on the chromosome and are in a nonrandom association of alleles (as shown in the linkage disequilibrium plot in supplementary figure S1B) and thereby a in block with rs2819590.
We have shown previously that the SNP rs793391 was associated with COPD in a selected study population of healthy current smokers and ex-smokers [12]. These findings were confirmed in the present study, but in a broader population including also many subjects who were never-smokers. In a subanalysis, we found that if the associations between the SNP and COPD were not adjusted for smoking status, the results were similar (p=0.012 and the same confidence interval of the odds ratio 0.39–0.89). Hereby we showed that rs793391 was not only associated to COPD in smoking/ex-smoking subjects, but also in never-smokers.
The association between certain SNPs and COPD may also be related to disease severity and progression. Certain SNPs located on chromosome 6 or the HLA-DQB2 gene have been shown to be associated with susceptibility to early (mild/moderate) COPD [31], but only in comparison to never-smoking controls. The population investigated in the present study consists of mainly mild/moderate COPD, and our findings are therefore most representative for this group of patients.
Differences in lung function variables between the SNP genotypes could be read as clinically significant, which can easily be spotted when presenting the B-value, which, for example, for FEV1 was >120 mL for most SNPs (table 3). This is within the recommended range of minimally clinical important difference [32] and suggest SUMF1 genotypes to be a potential biomarker.
Over the past decade, the accessibility of whole-genome sequencing and the increased number of genome-wide association studies have led to a substantial increase in the understanding of genetic variants that play a role in COPD susceptibility and COPD-related phenotypes [2]. Several variants are shown to be associated to COPD, emphysema and/or spirometric values, and specifically some genetic loci have been identified as being associated to DLCO [33]. In the present study, two distinct blocks of SNPs were identified, and one block (block 2) was associated to an obstructive phenotype, while the other block (block 1) was associated with a phenotype with reduced lung volumes. We have shown previously that the SNPs in the latter block were highly significant in eQTL analysis of SUMF1 and were associated with lower levels of SUMF1 mRNA in sputum cells and lung fibroblasts. Therefore, we believe that specific SNPs give rise to certain pathological effects, which may contribute to the heterogeneous nature of COPD.
In conclusion, we have found several SNPs in the SUMF1 gene that are associated with COPD and with impaired lung function. These SNPs appear in two blocks: block 2 is associated to COPD and airway obstruction, and block 1 is associated to impaired lung volumes. We hereby confirm in a larger well-defined cohort, that which we have shown previously in a smaller cohort; thus, we are again proving that these SNPs may have a legitimate role in COPD pathogenesis. Additionally, we see again that the two linkage disequilibrium blocks appear to have different effects on aspects of the disease pathogenesis that we do not yet fully understand. Furthermore, a single SNP (rs3864051) is unconnected and strongly associated to COPD and impaired lung function. These genetic variants of SUMF1 may cause a deficient sulfation balance in the extracellular matrix of the lung tissue, thereby contributing to the COPD disease.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Supplementary material 00668-2021.SUPPLEMENT
Figure S1 00668-2021.FIGURES1
Acknowledgement
The authors would like to thank personnel at the Lung and Allergy Research Unit, Skåne University Hospital (Lund, Sweden) and Clinical Physiology, Skåne University Hospital (Malmö, Sweden) for clinical assistance.
Footnotes
Provenance: Submitted article, peer reviewed.
Conflict of interest: P. Wollmer reports receiving support for the present manuscript from the Swedish Heart and Lung Foundation; payment or honoraria for lectures, presentations, speaker bureaus, manuscript writing or educational events from Chiesi Pharma, outside the submitted work: and a patent issued for a device and method for pulmonary function measurement (no licence). The remaining authors have nothing to disclose.
Support statement: This work was supported by independent grants from the Swedish Heart and Lung Foundation and Region Skåne ALF grants.
- Received December 4, 2021.
- Accepted February 17, 2022.
- Copyright ©The authors 2022
This version is distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0. For commercial reproduction rights and permissions contact permissions{at}ersnet.org