Abstract
Background Age of pubertal onset is associated with lung function in adulthood. However, the underlying role of epigenetics as a mediator of this association remains unknown.
Methods DNA methylation (DNAm) in peripheral blood was measured at age 18 years in the Isle of Wight birth cohort (IOWBC) along with data on age of pubertal events, forced vital capacity (FVC) and forced expiratory volume in 1 s (FEV1) at 26 years. Structural equation models were applied to examine mediation effects of DNAm on the association of age at pubertal events with FVC and FEV1. Findings were further tested in the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort.
Results In the IOWBC, for females, 21 cytosine-phosphate-guanine sites (CpGs) were shown to mediate the association of age at puberty with FVC or FEV1 at 26 years (p<0.05). In males, DNAm at 20 CpGs was found to mediate the association of age at puberty with FVC (p<0.05). At almost all these CpGs, indirect effects (effects of age at pubertal events on FVC or FEV1 via DNAm) contributed a smaller portion to the total effects compared to direct effects (e.g. at cg08680129, ∼22% of the estimated total effect of age at menarche on FVC at age 26 was contributed by an indirect effect). Among the IOWBC-discovered CpGs available in ALSPAC, none of them was replicated in ALSPAC (p>0.05).
Conclusions Our findings suggest that post-adolescence DNAm in peripheral blood is likely not to mediate the association of age at pubertal onset with young adulthood FVC or FEV1.
Abstract
The association between age at pubertal onset and lung function parameters FVC or FEV1 in young adulthood is not likely to be mediated by DNA methylation in peripheral blood https://bit.ly/31G8hDi
Introduction
Lung function parameters, including forced vital capacity (FVC) and forced expiratory volume in 1 s (FEV1), are shown to be associated with the age of menarche [1–4]. Subjects with early menarche have lower FVC and FEV1 in adulthood [1, 2, 4, 5]. A recent Mendelian randomisation study suggests that pubertal timing in general rather than the time of a single pubertal event is associated with lung function [2], implying the necessity to examine the contribution of other pubertal events onset to lung function development in both males and females. Our recent study has shown that age at puberty was associated with FVC in young adulthood in both sexes [6].
A covalent addition of a methyl group to the DNA at a cytosine residue followed by a guanine (CpG site), i.e. DNA methylation (DNAm) at CpG sites, is an important epigenetic mechanism that regulates gene transcription. Multiple studies have demonstrated that DNAm, as an epigenetic mechanism, links ageing, environmental exposure and lifestyle factors to disease and health status [7–10]. Recent studies examined such impact of DNAm via mediation analyses, which offers an opportunity to better understand the role that DNA methylation plays between exposures and health outcomes [11–14]. Studies have shown that DNAm is associated with various traits including age of pubertal onset [15–19]. For females, one study suggested that DNA methylation level might be modulated by age at menarche with later puberty associated with lower DNAm [18]. An association between DNA methylation patterns and lung function measures has also been demonstrated [20, 21].
Given the association between age at pubertal events and lung function, and the association of DNAm with both age of pubertal events and with lung function, this study aimed to ascertain whether DNAm at post-adolescence or close to post-adolescence mediates the association between age of pubertal events and young adulthood lung function parameters FVC and FEV1 (figure 1). To test this, we applied mediation analyses using structural equation modelling [22], using data in the Isle of Wight birth cohort (IOWBC; discovery cohort) [23] and the Avon Longitudinal Study of Parents and Children (ALSPAC) birth cohort (replication cohort) [24–26]. Mediation analysis is a statistical model with a minimum of three variables, in which there is one hypothesised mediator variable (M) transmitting the effect of independent (X) to the dependent variable (Y) [22]. In our study, mediation analysis allows evaluation of indirect and direct effects of an exposure (e.g. age at puberty) on an outcome of interest (e.g. a lung function parameter) (figure 1), where the expected value of the mediator (e.g. DNAm) and the outcome can be expressed as a function of the exposure via a generalised linear model [27].
Methods
Study population in IOWBC
A population-based birth cohort study was initiated in 1989 on the Isle of Wight (IOWBC), UK, to prospectively study the natural history of allergic diseases and associated risk factors [23]. After exclusion of adoptions, perinatal deaths and refusal to follow-up, 1456 newborns were enrolled in the study. In total, 1261 children were followed up at 10, 18 or 26 years (supplementary figure S1). The study was approved by the Isle of Wight Local Research Ethics Committee (06/Q1701/34). The internal review board of the University of Memphis approved the project (FWA00006815). At each follow-up, demographic information was collected, allergic status was assessed, and height, weight, and FVC and FEV1 were measured.
Age at onset of pubertal events
The age at onset of pubertal events was obtained through questionnaires when subjects were interviewed at age 18 years. Questions relating to pubertal events came from the National Institute of Child and Human Development (NICHD) questionnaire from the Study of Early Child Care and Youth Development based on the Pubertal Development Scale (PDS) [28]. For females, pubertal events included body hair growth, breast growth, menarche, skin changes and growth spurt, and for males, pubertal events including body hair growth, facial hair growth, voice deepening, skin changes and growth spurt were the focus. The detailed measurements of pubertal events are described elsewhere [29].
Lung function assessment and potential confounders
Assessment of lung function at 26 years of age was conducted using the KoKo spirometry software package on portable desktop device (PDS instrumentation, Louisville, KY, USA) according to the guidelines of the American Thoracic Society and the European Respiratory Society [30]. The lung function measurements included in this study were FVC and FEV1.
Low birthweight (<2500 g), maternal smoking during pregnancy, asthma status at age 10 years, body mass index (BMI) at age 10 years, socioeconomic status, atopy at age 10 years and personal smoking at age 18 years were included in the analyses as potential confounders (supplementary material section S2).
DNAm and cell type composition
In the IOWBC, DNA was extracted from whole blood using a standard procedure, and DNAm was measured using Infinium HumanMethylation450 or MethylationEPIC BeadChips. Details of DNAm assessment and preprocessing are described in supplementary material section S2. CpGs on autosome chromosomes and M values, calculated as base 2 logit transformed DNAm β values at each CpG, were used in the analyses. In this study, DNAm at age 18 years was included in the analyses.
Heterogeneity in white blood cell composition may confound DNA methylation measurements [31]. Proportions of six cell types, CD4+ T-cells, natural killer cells, neutrophils, B-cells, monocytes and eosinophils, were inferred using the R package minfi [32, 33] and included in the analyses as confounders.
Statistical analyses
Given the differences in age of pubertal onset and in lung function growth between males and females [4, 6], all analyses were stratified by sex. To assess whether the subsample in the study was representative of the IOWBC, one sample t-tests were applied to the key variables, including ages of pubertal events onset, and lung function measures FVC and FEV1 at age 26 years.
The study aimed to examine whether DNAm at age 18 years mediates the association between age of puberty and age 26-year FVC or FEV1 in young adults (figure 1). A set of analyses was carried out to achieve this goal (supplementary figure S2). Firstly, the association of FVC and FEV1 with age of pubertal event onset was assessed using linear regressions. Only pubertal events that passed this screening were included in subsequent analyses. Next, we screened CpG sites from two directions in an effort to include as many CpGs as possible in the assessment of their mediation effects while avoiding significant power loss should we analyse at the genome-scale. One direction is to identify candidate CpGs potentially associated with age at onset of a pubertal event (forward screening), and the other direction is to screen CpGs associated with the two lung function parameters (backward screening). Screening was carried out using the R package ttScreening [34], and CpGs showing association with age at puberty or lung function at age 26 were included in subsequent path analyses. To assess the potential of mediation effects of DNAm, PROC CALIS (SAS 9.4) was applied to evaluate direct and indirect effects (via DNAm) of ages at pubertal events on FVC or FEV1 (figure 2); statistical significance level was set at 0.05.
Replication study in ALSPAC
Pubertal events showing statistically significant indirect effects through DNAm on FVC or FEV1 were further tested in the ALSPAC birth cohort [24–26] (supplementary material section S2). Complete data on demographics, ages of puberty onset (menarche and body hair growth for females, and body hair growth and voice deepening for males), DNAm at ages 15 or 17 years, along with FVC and FEV1 at age 24 years were analysed (n=353 females and 335 males) [35]. The same path analysis modelling with comparable covariates was applied. Statistical significance was set at p-value <0.05.
Results
Population characteristics
The IOWBC subsamples reasonably represented the complete cohort, indicated by statistically insignificant differences between the subsamples and the complete cohort on the variables of interest (all p-values ≥0.05, table 1).
Initial screening for pubertal events
Based on linear regressions, ages at menarche and age at body hair growth in females were associated with both FVC and FEV1, and for males, age of body hair growth, growth spurt, voice deepening and facial hair growth were shown to be associated with FVC only (supplementary table S1). Age of onset for these pubertal events was included in the subsequent mediation analyses.
Mediation analysis through DNAm at CpGs that passed forward screening
In total, 321 unique CpGs for females and 489 unique CpGs for males survived forward screening and were included in path analyses (supplementary material section S3 and figure 3). Results from path analyses in IOWBC indicated that, for females, at 6 and 7 CpGs, DNAm mediated the association of age at menarche with FVC and FEV1 at age 26 years, respectively (p<0.05; supplementary table S2A). For FEV1 at age 26 years, 4 CpGs showed a mediation effect on the association of age at body hair growth with FEV1 (p<0.05; supplementary table S2A), but we did not identify any mediating CpGs for FVC at age 26 years. In total, 14 unique CpGs were identified. At these CpGs, of the total effects (direct + indirect effects), direct effects accounted for a large fraction of the total effect, e.g. at cg08680129, ∼78% of the estimated total effect of age at menarche on FVC at age 26 was contributed by direct effect, and an indirect effect through DNAm only contributed a small portion (supplementary table S2A and figure S3). The small contributions from indirect effects were also reflected by the small ratios of indirect effect over direct effect, e.g. at cg08680128, the ratio is 0.04/0.14=2/7. It is worth noting that using ratios of indirect effect over direct effect is especially informative when the two effects were in opposite directions (e.g. a negative direct effect but positive indirect effect), in which case the ratio was calculated using absolute values of estimated coefficients.
In males, through path analyses, we found that DNAm at 7, 3, 2 and 4 candidate CpGs mediated the association of ages of body hair growth, growth spurt, voice deepening and facial hair growth, respectively, with FVC at age 26 years (p-values <0.05) (supplementary table S2A). In total, 16 unique CpGs demonstrated statistically significant mediation effects. At 10 of the 16 identified CpGs, ratios of indirect effect over direct effect were <0.5, indicating an overall small contribution of DNAm as a mediator on the association of age of pubertal events with FVC.
Mediation analysis through DNAm at CpGs that passed backward screening
Path analyses were applied to 321 unique CpGs that passed backward screening (259 in females; supplementary material section S3 and figure 3). In IOWBC, for females, DNAm at three of the 259 candidate CpGs was shown to mediate the association between age at menarche and FEV1 at age 26 years (p<0.05) (supplementary table S2B). Mediation effects were not identified at any of the candidate CpGs for FVC at age 26 years. For age at body hair growth, DNAm at three and four CpGs mediated its association with FVC and FEV1, respectively (p<0.05) (supplementary table S2B). Altogether, seven unique CpGs showed statistically significant mediation effects, but in most mediation analyses, magnitudes of indirect effects were lower than a half of that for direct effect (in eight of the 10 mediation analyses; supplementary table S2B).
In males, for ages of body hair growth, growth spurt, voice deepening and facial hair growth, mediation effects of DNAm on the association with FVC at age 26 years were found at one CpG site of the 62 candidate CpGs for each of the four pubertal events (p-values <0.05) (supplementary table S2B). At all these four CpGs, contributions of indirect effects to total effects were small, ranging from 25% to 38% (supplementary table S2B) and the ratios of indirect over direct effects were all <0.6.
Comparison between forward screening and backward screening
In females, no common CpGs were observed between the 321 CpGs that survived forward screening and the 259 CpGs that survived backward screening. Similarly, for males, no common CpGs were identified between 489 CpGs that passed forward screening and the 62 CpGs that passed backward screening. When comparing common genes to which these CpGs were mapped between forward screening and backward screening, seven and one common genes were observed in females and males, respectively.
Further assessment of the IOWBC-discovered CpGs in ALSPAC
In females, of the 21 discovered CpGs, indirect effects at 10 CpGs (tables 2 and 3) were consistent with those observed in IOWBC, but none of them were statistically significant. In males, ages of body hair growth and voice deepening were available in ALSPAC. Of the 11 IOWBC-discovered CpGs for these two pubertal events, consistent mediation effects were observed at six CpGs but were statistically insignificant (tables 2 and 3). Overall, indirect effects of these two pubertal events were negligible in ALSPAC, consistent with the findings in the IOWBC. Detailed results are given in the supplementary material section S4.
Discussion
Combining findings in both the discovery and replication cohorts, post-adolescence or close-to-post-adolescence DNA methylation showed negligible mediation effects on the association of age at pubertal events with young adulthood FVC or FEV1. Specifically, indirect effects only represented a small proportion of total effects of age at pubertal events on the two lung function measurements in young adults. Overall, the findings in both cohorts did not support a role for DNAm in mediating the association of pubertal events with FVC or FEV1.
Our findings were drawn based on a rigorous study design with the potential to maximise statistical power. We used a comprehensive screening process to screen pubertal events and CpG sites to maximise statistical power. The two ways of screening for CpG sites, age at puberty → DNAm (forward), DNAm → lung function (backward), allowed us to include as many potentially informative CpG sites (with respect to potential mediation effects) as possible. These carefully designed screening procedures provided us with an opportunity to identify mediation CpGs, if any, with high power. Even with this carefully designed screening process, our data in both cohorts did not identify CpGs showing strong potential as mediators. In addition, the findings between the discovery and replication cohort were consistent and not affected by cell type compositions. Cell type could introduce multicollinearity in path analyses [36], which could lead to an apparent reversal of the direction of indirect effects and changes in the size of indirect effects. In IOWBC, the direction of associations with or without cell type compositions was intact for the indirect effects. For the indirect effects estimated in the ALSPAC cohort, although the direction of associations changed at two CpGs when removing the cell types in the models, since the size of indirect effects was very trivial regardless of the involvement of cell types, we expect the change in direction of association was due to random error.
Although our data did not support the role of DNAm as a mediator between age at puberty and young adult lung function parameters, we cannot rule out the existence of CpGs (but we have not discovered) acting as mediators. One explanation of the lack of association might be that current methods for arrays (Illumina 450K and EPIC arrays) cover only 2–4% of CpG sites across the whole genome [37, 38]. In addition, we used DNAm at CpGs from peripheral blood. DNAm shows strong tissue-specific effects and DNAm in another tissue (e.g. lung) might mediate the association of pubertal event timing with lung function [39]. Our mediation analyses assumed a linear relationship between exposure and outcome variables. However, biological relationships can be complex and nonlinear with unknown forms. More advanced analytical approaches in this area are certainly needed.
Our study has multiple strengths. The first is the utilisation of discovery and replication cohorts, and the results from these two independent cohorts suggest that our identified insignificant mediation effects of DNAm were potentially conclusive. Second, to avoid reverse causation, the mediation analyses were built upon a longitudinal setting, offering an insight to future investigations on possible “causal paths” and helping open the “black box” from age of pubertal events onset to lung function later in life [40]. Third, the carefully designed screening process in the detection of candidate pubertal events and candidate CpGs offered a strong potential to improve statistical power and at the same time reduced the risk of excluding potentially important epigenetic mediators. Via currently available epigenetic assessment platforms of Illumina, results from our carefully designed study support the absence of epigenetic mediation on the association of age at puberty with lung function measurements FVC and FEV1, based on DNAm measured in whole blood. Further assessment is certainly deserved, accompanied by a much denser assessment of DNAm such as pyrosequencing, and by novel analytical approaches such as analysing a set of CpGs in one region as a unified entity. However, genome-scale pyrosequencing with all CpG sites covered is challenging. Novel analytical tools are greatly needed and will allow an in-depth assessment of the role of DNAm in the association of age at puberty with lung function measurements. Furthermore, instead of DNAm in peripheral blood, using DNAm measured in lung tissues or tissues in the respiratory system will enable us to assess whether our insignificant findings were due to tissue-specificity and in the meantime to better reveal underlying biological mechanisms from pubertal events to lung function in young adulthood. Also, in the screening process based on data in the IOWBC, some pubertal events were not associated with FEV1 or FVC. The lack of association between age at those pubertal events and lung function parameters was likely due to the variations caused by recall bias. In IOWBC, ages of onset for each pubertal event were obtained by retrospective assessment at age 18 through questionnaire. The timing order of different pubertal ages in IOWBC had been evaluated in our earlier study, and we showed that the timing order was consistent with other studies [29]. However, compared to puberal events such as menarche that have less recall bias on age of onset [41], ages at other pubertal events, e.g. skin changes, were likely to have more recall bias, potentially leading to insignificant associations with FEV1 or FVC.
Conclusion
Our study suggests that DNA methylation in peripheral blood in post- or late-adolescence might not play a role in mediating the association between age of pubertal events and lung function in young adults.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
Supplementary material 00476-2021.SUPPLEMENT
Acknowledgements
We are extremely grateful to all participants and their families who are followed up in the Isle of Wight 1989 birth cohort and the Avon Longitudinal Study of Parents and Children cohort, which include interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, midwives and nurses. We also appreciate the hard efforts of these two research teams in maintaining the cohorts and collecting data. Our special thanks also go to the High Performance Computing facility provided by the University of Memphis.
Footnotes
Provenance: Submitted article, peer reviewed.
This article has supplementary material available from openres.ersjournals.com
Author contributions: The original concept and design were initiated by Hongmei Zhang and Wilfried Karmaus. Data analyses were performed by Liang Li and Hongmei Zhang. Data interpretation and manuscript writing was led by Liang Li and Hongmei Zhang. Critical comments on the manuscript and final approval of the manuscript were given by all the authors: Liang Li, Hongmei Zhang, John W. Holloway, Susan Ewart, Caroline L. Relton, S. Hasan Arshad and Wilfried Karmaus.
Conflict of interest: L. Li has nothing to disclose.
Conflict of interest: H. Zhang has nothing to disclose.
Conflict of interest: J.W. Holloway has nothing to disclose.
Conflict of interest: S. Ewart has nothing to disclose.
Conflict of interest: C.L. Relton has nothing to disclose.
Conflict of interest: S.H. Arshad has nothing to disclose.
Conflict of interest: W. Karmaus has nothing to disclose.
Support statement: This work was supported by the National Institutes of Health research funds R01AI121226 (H. Zhang and J.W. Holloway) and R01 AI091905 (W. Karmaus). The 10-year follow-up of IOWBC was funded by National Asthma Campaign, UK (Grant No. 364) and the 18-year follow-up by R01HL082925 (S.H. Arshad). The UK Medical Research Council and Wellcome (grant ref. 217065/Z/19/Z) and the University of Bristol provide core support for ALSPAC. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). The sponsor had no role in study design, data collection and analysis, or the preparation of the manuscript. This publication is the work of the authors and H. Zhang will serve as guarantor for the contents of this paper. Funding information for this article has been deposited with the Crossref Funder Registry.
- Received July 25, 2021.
- Accepted December 30, 2021.
- Copyright ©The authors 2022
This version is distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0. For commercial reproduction rights and permissions contact permissions{at}ersnet.org