Kaikki aineistot
Lisää
Abstract Genome-wide association studies have facilitated the discovery of thousands of loci for hundreds of phenotypes. However, the issue of missing heritability remains unsolved for most complex traits. Locus discovery could be enhanced with both improved power through multi-phenotype analysis (MPA) and use of a wider allele frequency range, including rare variants (RVs). MPA methods for single-variant association have been proposed, but given their low power for RVs, more efficient approaches are required. We propose multi-phenotype analysis of rare variants (MARV), a burden test-based method for RVs extended to the joint analysis of multiple phenotypes through a powerful reverse regression technique. Specifically, MARV models the proportion of RVs at which minor alleles are carried by individuals within a genomic region as a linear combination of multiple phenotypes, which can be both binary and continuous, and the method accommodates directly the genotyped and imputed data. The full model, including all phenotypes, is tested for association for discovery, and a more thorough dissection of the phenotype combinations for any set of RVs is also enabled. We show, via simulations, that the type I error rate is well controlled under various correlations between two continuous phenotypes, and that the method outperforms a univariate burden test in all considered scenarios. Application of MARV to 4876 individuals from the Northern Finland Birth Cohort 1966 for triglycerides, high- and low-density lipoprotein cholesterols highlights known loci with stronger signals of association than those observed in univariate RV analyses and suggests novel RV effects for these lipid traits.
Abstract Background: Genome-wide association studies have enabled identification of thousands of loci for hundreds of traits. Yet, for most human traits a substantial part of the estimated heritability is unexplained. This and recent advances in technology to produce high-dimensional data cost-effectively have led to method development beyond standard common variant analysis, including single-phenotype rare variant and multi-phenotype common variant analysis, with the latter increasing power for locus discovery and providing suggestions of pleiotropic effects. However, there are currently no optimal methods and tools for the combined analysis of rare variants and multiple phenotypes. Results: We propose a user-friendly software tool MARV for Multi-phenotype Analysis of Rare Variants. The tool is based on a method that collapses rare variants within a genomic region and models the proportion of minor alleles in the rare variants on a linear combination of multiple phenotypes. MARV provides analyses of all phenotype combinations within one run and calculates the Bayesian Information Criterion to facilitate model selection. The running time increases with the size of the genetic data while the number of phenotypes to analyse has little effect both on running time and required memory. We illustrate the use of MARV with analysis of triglycerides (TG), fasting insulin (FI) and waist-to-hip ratio (WHR) in 4,721 individuals from the Northern Finland Birth Cohort 1966. The analysis suggests novel multi-phenotype effects for these metabolic traits at APOA5 and ZNF259, and at ZNF259 provides stronger support for association (PTG+FI = 1.8 × 10−9) than observed in single phenotype rare variant analyses (PTG = 6.5 × 10−8 and PFI = 0.27). Conclusions: MARV is a computationally efficient, flexible and user-friendly software tool allowing rapid identification of rare variant effects on multiple phenotypes, thus paving the way for novel discoveries and insights into biology of complex traits.
Abstract Background: The prevention of the risk of type 2 diabetes (T2D) is complicated by multidimensional interplays between biological and psychosocial factors acting at the individual level. To address the challenge we took a systematic approach, to explore the bio-psychosocial predictors of blood glucose in mid-age. Methods: Based on the 31-year and 46-year follow-ups (5,078 participants, 43% male) of Northern Finland Birth Cohort 1966, we used a systematic strategy to select bio-psychosocial variables at 31 years to enable a data-driven approach. As selection criteria, the variable must be (i) a component of the metabolic syndrome or an indicator of psychosocial health using WHO guidelines, (ii) easily obtainable in general health check-ups and (iii) associated with fasting blood glucose at 46 years (P < 0.10). Exploratory and confirmatory factor analysis were used to derive latent factors, and stepwise linear regression allowed exploration of relationships between factors and fasting glucose. Results: Of all 26 variables originally considered, 19 met the selection criteria and were included in an exploratory factor analysis. Two variables were further excluded due to low loading (<0.3). We derived four latent factors, which we named as socioeconomic, metabolic, psychosocial and blood pressure status. The combination of metabolic and psychosocial factors, adjusted for sex, provided best prediction of fasting glucose at 46 years (explaining 10.7% of variation in glucose; P < 0.001). Regarding different bio-psychosocial pathways and relationships, the importance of psychosocial factors in addition to established metabolic risk factors was highlighted. Conclusions: The present study supports evidence for the bio-psychosocial nature of adult glycemic health and exemplifies an evidence-based approach to model the bio-psychosocial relationships. The factorial model may help further research and public health practice in focusing also on psychosocial aspects in maintaining normoglycaemia in the prevention of cardio-metabolic diseases.
Abstract Background: There are various maternal prenatal biopsychosocial (BPS) predictors of birth weight, making it difficult to quantify their cumulative relationship. Methods: We studied two birth cohorts: Northern Finland Birth Cohort 1986 (NFBC1986) born in 1985–1986 and the Generation R Study (from the Netherlands) born in 2002–2006. In NFBC1986, we selected variables depicting BPS exposure in association with birth weight and performed factor analysis to derive latent constructs representing the relationship between these variables. In Generation R, the same factors were generated weighted by loadings of NFBC1986. Factor scores from each factor were then allocated into tertiles and added together to calculate a cumulative BPS score. In all cases, we used regression analyses to explore the relationship with birth weight corrected for sex and gestational age and additionally adjusted for other factors. Results: Factor analysis supported a four-factor structure, labelled closely to represent their characteristics as ‘Factor1-BMI’ (body mass index), ‘Factor2-DBP’ (diastolic blood pressure), ‘Factor3-Socioeconomic-Obstetric-Profile’ and ‘Factor4-Parental-Lifestyle’. In both cohorts, ‘Factor1-BMI’ was positively associated with birth weight, whereas other factors showed negative association. ‘Factor3-Socioeconomic-Obstetric-Profile’ and ‘Factor4-Parental-Lifestyle’ had the greatest effect size, explaining 30% of the variation in birth weight. Associations of the factors with birth weight were largely driven by ‘Factor1-BMI’. Graded decrease in birth weight was observed with increasing cumulative BPS score, jointly evaluating four factors in both cohorts. Conclusion: Our study is a proof of concept for maternal prenatal BPS hypothesis, highlighting the components snowball effect on birth weight in two different European birth cohorts.
Abstract Background: Maternal pre-pregnancy body mass index (BMI) is positively associated with offspring birth weight (BW) and BMI in childhood and adulthood. Each of these associations could be due to causal intrauterine effects, or confounding (genetic or environmental), or some combination of these. Here we estimate the extent to which the association between maternal BMI and offspring body size is explained by offspring genotype, as a first step towards establishing the importance of genetic confounding. Methods: We examined the associations of maternal pre-pregnancy BMI with offspring BW and BMI at 1, 5, 10 and 15 years, in three European birth cohorts (n ≤11 498). Bivariate Genomic-relatedness-based Restricted Maximum Likelihood implemented in the GCTA software (GCTA-GREML) was used to estimate the extent to which phenotypic covariance was explained by offspring genotype as captured by common imputed single nucleotide polymorphisms (SNPs). We merged individual participant data from all cohorts, enabling calculation of pooled estimates. Results: Phenotypic covariance (equivalent here to Pearson’s correlation coefficient) between maternal BMI and offspring phenotype was 0.15 [95% confidence interval (CI): 0.13, 0.17] for offspring BW, increasing to 0.29 (95% CI: 0.26, 0.31) for offspring 15 year BMI. Covariance explained by offspring genotype was negligible for BW [−0.04 (95% CI: −0.09, 0.01)], but increased to 0.12 (95% CI: 0.04, 0.21) at 15 years, which is equivalent to 43% (95% CI: 15%, 72%) of the phenotypic covariance. Sensitivity analyses using weight, BMI and ponderal index as the offspring phenotype at all ages showed similar results. Conclusions: Offspring genotype explains a substantial fraction of the covariance between maternal BMI and offspring adolescent BMI. This is consistent with a potentially important role for genetic confounding as a driver of the maternal BMI–offspring BMI association.
Abstract The current epidemics of cardiovascular and metabolic noncommunicable diseases have emerged alongside dramatic modifications in lifestyle and living environments. These correspond to changes in our ”modern” postwar societies globally characterized by rural-to-urban migration, modernization of agricultural practices, and transportation, climate change, and aging. Evidence suggests that these changes are related to each other, although the social and biological mechanisms as well as their interactions have yet to be uncovered. LongITools, as one of the 9 projects included in the European Human Exposome Network, will tackle this environmental health equation linking multidimensional environmental exposures to the occurrence of cardiovascular and metabolic noncommunicable diseases.
Abstract Early childhood growth patterns are associated with adult health, yet the genetic factors and the developmental stages involved are not fully understood. Here, we combine genome-wide association studies with modeling of longitudinal growth traits to study the genetics of infant and child growth, followed by functional, pathway, genetic correlation, risk score, and colocalization analyses to determine how developmental timings, molecular pathways, and genetic determinants of these traits overlap with those of adult health. We found a robust overlap between the genetics of child and adult body mass index (BMI), with variants associated with adult BMI acting as early as 4 to 6 years old. However, we demonstrated a completely distinct genetic makeup for peak BMI during infancy, influenced by variation at the LEPR/LEPROT locus. These findings suggest that different genetic factors control infant and child BMI. In light of the obesity epidemic, these findings are important to inform the timing and targets of prevention strategies.
Abstract Large consortia have revealed hundreds of genetic loci associated with anthropometric traits, one trait at a time. We examined whether genetic variants affect body shape as a composite phenotype that is represented by a combination of anthropometric traits. We developed an approach that calculates averaged PCs (AvPCs) representing body shape derived from six anthropometric traits (body mass index, height, weight, waist and hip circumference, waist-to-hip ratio). The first four AvPCs explain >99% of the variability, are heritable, and associate with cardiometabolic outcomes. We performed genome-wide association analyses for each body shape composite phenotype across 65 studies and meta-analysed summary statistics. We identify six novel loci: LEMD2 and CD47 for AvPC1, RPS6KA5/C14orf159 and GANAB for AvPC3, and ARL15 and ANP32 for AvPC4. Our findings highlight the value of using multiple traits to define complex phenotypes for discovery, which are not captured by single-trait analyses, and may shed light onto new pathways.
Abstract Differences between sexes contribute to variation in the levels of fasting glucose and insulin. Epidemiological studies established a higher prevalence of impaired fasting glucose in men and impaired glucose tolerance in women, however, the genetic component underlying this phenomenon is not established. We assess sex-dimorphic (73,089/50,404 women and 67,506/47,806 men) and sex-combined (151,188/105,056 individuals) fasting glucose/fasting insulin genetic effects via genome-wide association study meta-analyses in individuals of European descent without diabetes. Here we report sex dimorphism in allelic effects on fasting insulin at IRS1 and ZNF12 loci, the latter showing higher RNA expression in whole blood in women compared to men. We also observe sex-homogeneous effects on fasting glucose at seven novel loci. Fasting insulin in women shows stronger genetic correlations than in men with waist-to-hip ratio and anorexia nervosa. Furthermore, waist-to-hip ratio is causally related to insulin resistance in women, but not in men. These results position dissection of metabolic and glycemic health sex dimorphism as a steppingstone for understanding differences in genetic effects between women and men in related phenotypes.
Abstract Physical activity (PA) may modify the genetic effects that give rise to increased risk of obesity. To identify adiposity loci whose effects are modified by PA, we performed genome-wide interaction meta-analyses of BMI and BMI-adjusted waist circumference and waist-hip ratio from up to 200,452 adults of European (n = 180,423) or other ancestry (n = 20,029). We standardized PA by categorizing it into a dichotomous variable where, on average, 23% of participants were categorized as inactive and 77% as physically active. While we replicate the interaction with PA for the strongest known obesity-risk locus in the FTO gene, of which the effect is attenuated by ~30% in physically active individuals compared to inactive individuals, we do not identify additional loci that are sensitive to PA. In additional genome-wide meta-analyses adjusting for PA and interaction with PA, we identify 11 novel adiposity loci, suggesting that accounting for PA or other environmental factors that contribute to variation in adiposity may facilitate gene discovery.
Abstract Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 x 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.