Kaikki aineistot
Lisää
Abstract Background and aims: Population subgrouping has been suggested as means to improve coronary heart disease (CHD) risk assessment. We explored here how unsupervised data-driven metabolic subgrouping, based on comprehensive lipoprotein subclass data, would work in large-scale population cohorts. Methods: We applied a self-organizing map (SOM) artificial intelligence methodology to define subgroups based on detailed lipoprotein profiles in a population-based cohort (n = 5789) and utilised the trained SOM in an independent cohort (n = 7607). We identified four SOM-based subgroups of individuals with distinct lipoprotein profiles and CHD risk and compared those to univariate subgrouping by apolipoprotein B quartiles. Results: The SOM-based subgroup with highest concentrations for non-HDL measures had the highest, and the subgroup with lowest concentrations, the lowest risk for CHD. However, apolipoprotein B quartiles produced better resolution of risk than the SOM-based subgroups and also striking dose-response behaviour. Conclusions: These results suggest that the majority of lipoprotein-mediated CHD risk is explained by apolipoprotein B-containing lipoprotein particles. Therefore, even advanced multivariate subgrouping, with comprehensive data on lipoprotein metabolism, may not advance CHD risk assessment.
Abstract Context: Aging varies between individuals, with profound consequences for chronic diseases and longevity. One hypothesis to explain the diversity is a genetically regulated molecular clock that runs differently between individuals. Large human studies with long enough follow-up to test the hypothesis are rare due to practical challenges, but statistical models of aging are built as proxies for the molecular clock by comparing young and old individuals cross-sectionally. These models remain untested against longitudinal data. Objective: We applied novel methodology to test if cross-sectional modeling can distinguish slow vs accelerated aging in a human population. Methods: We trained a machine learning model to predict age from 153 clinical and cardiometabolic traits. The model was tested against longitudinal data from another cohort. The training data came from cross-sectional surveys of the Finnish population (n = 9708; ages 25–74 years). The validation data included 3 time points across 10 years in the Young Finns Study (YFS; n = 1009; ages 24–49 years). Predicted metabolic age in 2007 was compared against observed aging rate from the 2001 visit to the 2011 visit in the YFS dataset and correlation between predicted vs observed metabolic aging was determined. Results: The cross-sectional proxy failed to predict longitudinal observations (R2 = 0.018%, P = 0.67). Conclusion: The finding is unexpected under the clock hypothesis that would produce a positive correlation between predicted and observed aging. Our results are better explained by a stratified model where aging rates per se are similar in adulthood but differences in starting points explain diverging metabolic fates.
Abstract Background: Peripheral artery disease (PAD) and coronary artery disease (CAD) represent atherosclerosis in different vascular beds. We used detailed metabolic biomarker profiling to identify common and discordant biomarkers and clarify pathophysiological differences for these vascular diseases. Methods and results: We used 5 prospective cohorts from Finnish population (FINRISK 1997, 2002, 2007, and 2012, and Health 2000; n=31 657; median follow‐up time of 14 years) to estimate associations between >200 metabolic biomarkers and incident PAD and CAD. Metabolic biomarkers were measured with nuclear magnetic resonance, and disease events were obtained from nationwide hospital records. During the follow‐up, 498 incident PAD and 2073 incident CAD events occurred. In age‐ and sex‐adjusted Cox models, apolipoproteins and cholesterol measures were robustly associated with incident CAD (eg, hazard ratio [HR] per SD for higher apolipoprotein B/A‐1 ratio, 1.30; 95% CI, 1.25–1.36), but not with incident PAD (HR per SD for higher apolipoprotein B/A‐1 ratio, 1.04; 95% CI, 0.95–1.14; Pheterogeneity<0.001). In contrast, triglyceride levels in low‐density lipoprotein and high‐density lipoprotein were associated with both end points (Pheterogeneity<0.05). Lower proportion of polyunsaturated fatty acids relative to total fatty acids, and higher concentrations of monounsaturated fatty acids, glycolysis‐related metabolites, and inflammatory protein markers were strongly associated with incident PAD, and many of these associations were stronger for PAD than for CAD (Pheterogeneity<0.001). Most differences in metabolic profiles for PAD and CAD remained when adjusting for traditional risk factors. Conclusions: The metabolic biomarker profile for future PAD risk is distinct from that of CAD. This may represent pathophysiological differences.
Abstract Background/Objectives: This observational study dissects the complex temporal associations between body-mass index (BMI), waist-hip ratio (WHR) and circulating metabolomics using a combination of longitudinal and cross-sectional population-based datasets and new systems epidemiology tools. Subjects/Methods: Firstly, a data-driven subgrouping algorithm was employed to simplify high-dimensional metabolic profiling data into a single categorical variable: a self-organizing map (SOM) was created from 174 metabolic measures from cross-sectional surveys (FINRISK, n = 9708, ages 25–74) and a birth cohort (NFBC1966, n = 3117, age 31 at baseline, age 46 at follow-up) and an expert committee defined four subgroups of individuals based on visual inspection of the SOM. Secondly, the subgroups were compared regarding BMI and WHR trajectories in an independent longitudinal dataset: participants of the Young Finns Study (YFS, n = 1286, ages 24–39 at baseline, 10 years follow-up, three visits) were categorized into the four subgroups and subgroup-specific age-dependent trajectories of BMI, WHR and metabolic measures were modelled by linear regression. Results: The four subgroups were characterised at age 39 by high BMI, WHR and dyslipidemia (designated TG-rich); low BMI, WHR and favourable lipids (TG-poor); low lipids in general (Low lipid) and high low-density-lipoprotein cholesterol (High LDL-C). Trajectory modelling of the YFS dataset revealed a dynamic BMI divergence pattern: despite overlapping starting points at age 24, the subgroups diverged in BMI, fasting insulin (three-fold difference at age 49 between TG-rich and TG-poor) and insulin-associated measures such as triglyceride-cholesterol ratio. Trajectories also revealed a WHR progression pattern: despite different starting points at the age of 24 in WHR, LDL-C and cholesterol-associated measures, all subgroups exhibited similar rates of change in these measures, i.e. WHR progression was uniform regardless of the cross-sectional metabolic profile. Conclusions: Age-associated weight variation in adults between 24 and 49 manifests as temporal divergence in BMI and uniform progression of WHR across metabolic health strata.
Abstract Background: Endurance exercise training promotes the catabolism of branched-chain amino acids (BCAAs) in skeletal muscles. We have previously shown that mitochondrial DNA (mtDNA) haplogroups J and K are markers of low responders in endurance training. In this paper, we hypothesize that BCAA catabolism is a surrogate marker of lower respiratory chain activity attributed to these haplogroups. We evaluated whether exercise-induced changes in amino acid concentrations differ between subjects harbouring mtDNA haplogroups J or K and those with non-JK haplogroups. Methods: Finnish male conscripts (N = 633) undertook the 12-min Cooper running test at the beginning and end of their military service. The intervention during the service mainly included endurance aerobic exercise and sports-related muscle training. Concentrations of seven amino acids were analysed in the serum using a high-throughput 1H NMR metabolomics platform. Total DNA was extracted from whole blood, and restriction fragment analysis was used to determine mtDNA haplogroups J and K. Results: The concentrations of the seven amino acids were higher following the intervention, with the exception of phenylalanine; interestingly, the increase in the concentrations of three BCAAs was larger in subjects with haplogroup J or K than in subjects with non-JK haplogroups (p = 0.029). MtDNA haplogroups J and K share two common nonsynonymous variants. Structural analysis based on crystallographic data on bovine complexes I and III revealed that the Leu18 variant in cytochrome b encoded by m.14798T > C may interfere with ubiquinone binding at the Qi site in complex III. Conclusions: The increase in the concentrations of serum BCAAs following exercise intervention differs between subjects harbouring mtDNA haplogroup J or K and those harbouring non-JK haplogroups. Lower response in endurance training and difference in exercise-induced increase in the concentrations of serum BCAAs suggest decreased respiratory chain activity. Haplogroups J and K share m.14798T > C in MT-CYB, which may hamper the function of complex III.
Abstract Background: GlycA is a nuclear magnetic resonance (NMR) spectroscopy biomarker that predicts risk of disease from myriad causes. It is heterogeneous; arising from five circulating glycoproteins with dynamic concentrations: alpha-1 antitrypsin (AAT), alpha-1-acid glycoprotein (AGP), haptoglobin (HP), transferrin (TF), and alpha-1-antichymotrypsin (AACT). The contributions of each glycoprotein to the disease and mortality risks predicted by GlycA remain unknown. Methods: We trained imputation models for AAT, AGP, HP, and TF from NMR metabolite measurements in 626 adults from a population cohort with matched NMR and immunoassay data. Levels of AAT, AGP, and HP were estimated in 11,861 adults from two population cohorts with eight years of follow-up, then each biomarker was tested for association with all common endpoints. Whole blood gene expression data was used to identify cellular processes associated with elevated AAT. Results: Accurate imputation models were obtained for AAT, AGP, and HP but not for TF. While AGP had the strongest correlation with GlycA, our analysis revealed variation in imputed AAT levels was the most predictive of morbidity and mortality for the widest range of diseases over the eight year follow-up period, including heart failure (meta-analysis hazard ratio = 1.60 per standard deviation increase of AAT, P-value = 1×10−10), influenza and pneumonia (HR = 1.37, P = 6×10−10), and liver diseases (HR = 1.81, P = 1×10−6). Transcriptional analyses revealed association of elevated AAT with diverse inflammatory immune pathways. Conclusions: This study clarifies the molecular underpinnings of the GlycA biomarker’s associated disease risk, and indicates a previously unrecognised association between elevated AAT and severe disease onset and mortality.
Abstract Objective: Previous studies on the association between metabolic biomarkers and hypertension have been limited by small sample sizes, low number of studied biomarkers, and cross-sectional study design. In the largest study to date, we assess the cross-sectional and longitudinal associations between high-abundance serum biomarkers and blood pressure (BP). Methods: We studied cross-sectional (N = 36 985; age 50.5 ± 14.2; 53.1% women) and longitudinal (N = 4197; age 49.4 ± 11.8, 55.3% women) population samples of Finnish individuals. We included 53 serum biomarkers and other detailed lipoprotein subclass measures in our analyses. We studied the associations between serum biomarkers and BP using both conventional statistical methods and a machine learning algorithm (gradient boosting) while adjusting for clinical risk factors. Results: Fifty-one of 53 serum biomarkers were cross-sectionally related to BP (adjusted P < 0.05 for all). Conventional linear regression modeling demonstrated that LDL cholesterol, remnant cholesterol, apolipoprotein B, and acetate were positively, and HDL particle size was negatively, associated with SBP change over time (adjusted P < 0.05 for all). Adding serum biomarkers (cross-sectional root-mean-square error: 16.27 mmHg; longitudinal: 17.61 mmHg) in the model with clinical measures (cross-sectional: 16.70 mmHg; longitudinal 18.52 mmHg) improved the machine learning model fit. Glucose, albumin, triglycerides in LDL, glycerol, VLDL particle size, and acetoacetate had the highest importance scores in models related to current or future BP. Conclusions: Our results suggest that serum lipids, and particularly LDL-derived and VLDL-derived cholesterol measures, and glucose metabolism abnormalities are associated with hypertension onset. Use of serum metabolite determination could improve identification of individuals at high risk of developing hypertension.
Abstract Aims/hypothesis: Metabolomics technologies have identified numerous blood biomarkers for type 2 diabetes risk in case−control studies of middle-aged and older individuals. We aimed to validate existing and identify novel metabolic biomarkers predictive of future diabetes in large cohorts of young adults. Methods: NMR metabolomics was used to quantify 229 circulating metabolic measures in 11,896 individuals from four Finnish observational cohorts (baseline age 24–45 years). Associations between baseline metabolites and risk of developing diabetes during 8–15 years of follow-up (392 incident cases) were adjusted for sex, age, BMI and fasting glucose. Prospective metabolite associations were also tested with fasting glucose, 2 h glucose and HOMA-IR at follow-up. Results: Out of 229 metabolic measures, 113 were associated with incident type 2 diabetes in meta-analysis of the four cohorts (ORs per 1 SD: 0.59–1.50; p< 0.0009). Among the strongest biomarkers of diabetes risk were branched-chain and aromatic amino acids (OR 1.31–1.33) and triacylglycerol within VLDL particles (OR 1.33–1.50), as well as linoleic n-6 fatty acid (OR 0.75) and non-esterified cholesterol in large HDL particles (OR 0.59). The metabolic biomarkers were more strongly associated with deterioration in post-load glucose and insulin resistance than with future fasting hyperglycaemia. A multi-metabolite score comprised of phenylalanine, non-esterified cholesterol in large HDL and the ratio of cholesteryl ester to total lipid in large VLDL was associated with future diabetes risk (OR 10.1 comparing individuals in upper vs lower fifth of the multi-metabolite score) in one of the cohorts (mean age 31 years). Conclusions/interpretation: Metabolic biomarkers across multiple molecular pathways are already predictive of the long-term risk of diabetes in young adults. Comprehensive metabolic profiling may help to target preventive interventions for young asymptomatic individuals at increased risk.
Abstract The role of metabolic syndrome (MetS) as a preceding metabolic state for type 2 diabetes and cardiovascular disease is widely recognised. To accumulate knowledge of the pathological mechanisms behind the condition at the methylation level, we conducted an epigenome-wide association study (EWAS) of MetS and its components, testing 1187 individuals of European ancestry for approximately 470 000 methylation sites throughout the genome. Methylation site cg19693031 in gene TXNIP —previously associated with type 2 diabetes, glucose and lipid metabolism, associated with fasting glucose level (P = 1.80 × 10−8). Cg06500161 in gene ABCG1 associated both with serum triglycerides (P = 5.36 × 10−9) and waist circumference (P = 5.21 × 10−9). The previously identified type 2 diabetes–associated locus cg08309687 in chromosome 21 associated with waist circumference for the first time (P = 2.24 × 10−7). Furthermore, a novel HDL association with cg17901584 in chromosome 1 was identified (P = 7.81 × 10−8). Our study supports previous genetic studies of MetS, finding that lipid metabolism plays a key role in pathology of the syndrome. We provide evidence regarding a close interplay with glucose metabolism. Finally, we suggest that in attempts to identify methylation loci linking separate MetS components, cg19693031 appears to represent a strong candidate.
Abstract Aims: Angiopoietin-like protein 3 (ANGPTL3) and 4 (ANGPTL4) inhibit lipoprotein lipase (LPL) and represent emerging drug targets to lower circulating triglycerides and reduce cardiovascular risk. To investigate the molecular effects of genetic mimicry of ANGPTL3 and ANGPTL4 inhibition and compare them to the effects of genetic mimicry of LPL enhancement. Methods and results: Associations of genetic variants in ANGPTL3 (rs11207977-T), ANGPTL4 (rs116843064-A), and LPL (rs115849089-A) with an extensive serum lipid and metabolite profile (208 measures) were characterized in six cohorts of up to 61 240 participants. Genetic associations with anthropometric measures, glucose-insulin metabolism, blood pressure, markers of kidney function, and cardiometabolic endpoints via genome-wide summary data were also explored. ANGPTL4 rs116843064-A and LPL rs115849089-A displayed a strikingly similar pattern of associations across the lipoprotein and lipid measures. However, the corresponding associations with ANGPTL3 rs11207977-T differed, including those for low-density lipoprotein and high-density lipoprotein particle concentrations and compositions. All three genotypes associated with lower concentrations of an inflammatory biomarker glycoprotein acetyls and genetic mimicry of ANGPTL3 inhibition and LPL enhancement were also associated with lower C-reactive protein. Genetic mimicry of ANGPTL4 inhibition and LPL enhancement were associated with a lower waist-to-hip ratio, improved insulin-glucose metabolism, and lower risk of coronary heart disease and type 2 diabetes, whilst genetic mimicry of ANGPTL3 was associated with improved kidney function. Conclusions: Genetic mimicry of ANGPTL4 inhibition and LPL enhancement have very similar systemic metabolic effects, whereas genetic mimicry of ANGPTL3 inhibition showed differing metabolic effects, suggesting potential involvement of pathways independent of LPL. Genetic mimicry of ANGPTL4 inhibition and LPL enhancement were associated with a lower risk of coronary heart disease and type 2 diabetes. These findings reinforce evidence that enhancing LPL activity (either directly or via upstream effects) through pharmacological approaches is likely to yield benefits to human health.
Abstract Background and aims: Apolipoprotein A-I (apoA-I) infusions represent a potential novel therapeutic approach for the prevention of coronary artery disease (CAD). Although circulating apoA-I concentrations inversely associate with risk of CAD, the evidence base of this representing a causal relationship is lacking. The aim was to assess the causal role of apoA-I using human genetics. Methods: We identified a variant (rs12225230) in APOA1 locus that associated with circulating apoA-I concentrations (p < 5 × 10−8) in 20,370 Finnish participants, and meta-analyzed our data with a previous GWAS of apoA-I. We obtained genetic estimates of CAD from UK Biobank and CARDIoGRAMplusC4D (totaling 122,733 CAD cases) and conducted a two-sample Mendelian randomization analysis. We compared our genetic findings to observational associations of apoA-I with risk of CAD in 918 incident CAD cases among 11,535 individuals from population-based prospective cohorts. Results: ApoA-I was associated with a lower risk of CAD in observational analyses (HR 0.81; 95%CI: 0.75, 0.88; per 1-SD higher apoA-I), with the association showing a dose-response relationship. Rs12225230 associated with apoA-I concentrations (per-C allele beta 0.076 SD; SE: 0.013; p = 1.5 × 10−9) but not with confounders. In Mendelian randomization analyses, apoA-I was not related to risk of CAD (OR 1.13; 95%CI: 0.98,1.30 per 1-SD higher apoA-I), which was different from the observational association. Similar findings were observed using an independent ABCA1 variant in sensitivity analysis. Conclusions: Genetic evidence fails to support a cardioprotective role for apoA-I. This is in line with the cumulative evidence showing that HDL-related phenotypes are unlikely to have a protective role in CAD.
Abstract Background: Observational findings for high-density lipoprotein (HDL)-mediated cholesterol efflux capacity (HDL-CEC) and coronary heart disease (CHD) appear inconsistent, and knowledge of the genetic architecture of HDL-CEC is limited. Objectives: A large-scale observational study on the associations of HDL-CEC and other HDL-related measures with CHD and the largest genome-wide association study (GWAS) of HDL-CEC. Participants/Methods: Six independent cohorts were included with follow-up data for 14,438 participants to investigate the associations of HDL-related measures with incident CHD (1,570 events). The GWAS of HDL-CEC was carried out in 20,372 participants. Results: HDL-CEC did not associate with CHD when adjusted for traditional risk factors and HDL cholesterol (HDL-C). In contradiction, almost all HDL-related concentration measures associated consistently with CHD after corresponding adjustments. There were no genetic loci associated with HDL-CEC independent of HDL-C and triglycerides. Conclusions: HDL-CEC is not unequivocally associated with CHD in contrast to HDL-C, apolipoprotein A-I, and most of the HDL subclass particle concentrations.
Abstract Background: Cardiomyocytes secrete atrial natriuretic peptide (ANP) and B-type natriuretic peptide (BNP) in response to mechanical stretching, making them useful clinical biomarkers of cardiac stress. Both human and animal studies indicate a role for ANP as a regulator of blood pressure with conflicting results for BNP. Methods and Results: We used genome-wide association analysis (n=6296) to study the effects of genetic variants on circulating natriuretic peptide concentrations and compared the impact of natriuretic peptide–associated genetic variants on blood pressure (n=27 059). Eight independent genetic variants in 2 known (NPPA-NPPB and POC1B-GALNT4) and 1 novel locus (PPP3CC) associated with midregional proANP (MR-proANP), BNP, aminoterminal proBNP (NT-proBNP), or BNP:NT-proBNP ratio. The NPPA-NPPB locus containing the adjacent genes encoding ANP and BNP harbored 4 independent cis variants with effects specific to either midregional proANP or BNP and a rare missense single nucleotide polymorphism in NT-proBNP seriously altering its measurement. Variants near the calcineurin catalytic subunit gamma gene PPP3CC and the polypeptide N-acetylgalactosaminyltransferase 4 gene GALNT4 associated with BNP:NT-proBNP ratio but not with BNP or midregional proANP, suggesting effects on the post-translational regulation of proBNP. Out of the 8 individual variants, only those correlated with midregional proANP had a statistically significant albeit weak impact on blood pressure. The combined effect of these 3 single nucleotide polymorphisms also associated with hypertension risk (P=8.2×10−4). Conclusions: Common genetic differences affecting the circulating concentration of ANP associated with blood pressure, whereas those affecting BNP did not, highlighting the blood pressure–lowering effect of ANP in the general population.
Abstract Background: Inflammatory processes contribute to the pathophysiology of multiple chronic conditions. Genetic factors play a crucial role in modulating the inflammatory load, but the exact mechanisms are incompletely understood. Objective: To assess genetic determinants of 16 circulating cytokines and cell adhesion molecules (inflammatory phenotypes) in Finns. Methods: Genome-wide associations of the inflammatory phenotypes were studied in Northern Finland Birth Cohort 1966 (N=5284). A subsequent meta-analysis was completed for 10 phenotypes available in a previous genome-wide association study, adding up to 13 577 individuals in the study. Complementary association tests were performed to study the effect of the ABO blood types on soluble adhesion molecule levels. Results: We identified seven novel and six previously reported genetic associations (p<3.1×10−9). Three loci were associated with soluble vascular cell adhesion molecule-1 (sVCAM-1) level, one of which was the ABO locus that has been previously associated with soluble E-selectin (sE-selectin) and intercellular adhesion molecule-1 (sICAM-1) levels. Our findings suggest that the blood type B associates primarily with sVCAM-1 level, while the A1 subtype shows a robust effect on sE-selectin and sICAM-1 levels. The genotypes in the ABO locus associating with higher soluble adhesion molecule levels tend to associate with lower circulating cholesterol levels and lower cardiovascular disease risk. Conclusion: The present results extend the knowledge about genetic factors contributing to the inflammatory load. Our findings suggest that two distinct mechanisms contribute to the soluble adhesion molecule levels in the ABO locus and that elevated soluble adhesion molecule levels per se may not increase risk for cardiovascular disease.
Abstract Background: The pathophysiological basis of idiopathic normal pressure hydrocephalus (iNPH) is still unclear. Previous studies have shown a familial aggregation and a potential heritability when it comes to iNPH. Our aim was to conduct a novel case-controlled comparison between familial iNPH (fNPH) patients and their elderly relatives, involving multiple different families. Methods: Questionnaires and phone interviews were used for collecting the data and categorising the iNPH patients into the familial (fNPH) and the sporadic groups. Identical questionnaires were sent to the relatives of the potential fNPH patients. Venous blood samples were collected for genetic studies. The disease histories of the probable fNPH patients (n = 60) were compared with their ≥ 60-year-old relatives with no iNPH (n = 49). A modified Charlson Comorbidity Index (CCI) was used to measure the overall disease burden. Fisher’s exact test (two-tailed), the Mann–Whitney U test (two-tailed) and a multivariate binary logistic regression analysis were used to perform the statistical analyses. Results: Diabetes (32% vs. 14%, p = 0.043), arterial hypertension (65.0% vs. 43%, p = 0.033), cardiac insufficiency (16% vs. 2%, p = 0.020) and depressive symptoms (32% vs. 8%, p = 0.004) were overrepresented among the probable fNPH patients compared to their non-iNPH relatives. In the age-adjusted multivariate logistic regression analysis, diabetes remained independently associated with fNPH (OR = 3.8, 95% CI 1.1–12.9, p = 0.030). Conclusions: Diabetes is associated with fNPH and a possible risk factor for fNPH. Diabetes could contribute to the pathogenesis of iNPH/fNPH, which motivates to further prospective and gene-environmental studies to decipher the disease modelling of iNPH/fNPH.
Abstract Copy number variants (CNVs) are associated with syndromic and severe neurological and psychiatric disorders (SNPDs), such as intellectual disability, epilepsy, schizophrenia, and bipolar disorder. Although considered high-impact, CNVs are also observed in the general population. This presents a diagnostic challenge in evaluating their clinical significance. To estimate the phenotypic differences between CNV carriers and non-carriers regarding general health and well-being, we compared the impact of SNPD-associated CNVs on health, cognition, and socioeconomic phenotypes to the impact of three genome-wide polygenic risk score (PRS) in two Finnish cohorts (FINRISK, n = 23,053 and NFBC1966, n = 4895). The focus was on CNV carriers and PRS extremes who do not have an SNPD diagnosis. We identified high-risk CNVs (DECIPHER CNVs, risk gene deletions, or large [>1 Mb] CNVs) in 744 study participants (2.66%), 36 (4.8%) of whom had a diagnosed SNPD. In the remaining 708 unaffected carriers, we observed lower educational attainment (EA; OR = 0.77 [95% CI 0.66–0.89]) and lower household income (OR = 0.77 [0.66–0.89]). Income-associated CNVs also lowered household income (OR = 0.50 [0.38–0.66]), and CNVs with medical consequences lowered subjective health (OR = 0.48 [0.32–0.72]). The impact of PRSs was broader. At the lowest extreme of PRS for EA, we observed lower EA (OR = 0.31 [0.26–0.37]), lower-income (OR = 0.66 [0.57–0.77]), lower subjective health (OR = 0.72 [0.61–0.83]), and increased mortality (Cox’s HR = 1.55 [1.21–1.98]). PRS for intelligence had a similar impact, whereas PRS for schizophrenia did not affect these traits. We conclude that the majority of working-age individuals carrying high-risk CNVs without SNPD diagnosis have a modest impact on morbidity and mortality, as well as the limited impact on income and educational attainment, compared to individuals at the extreme end of common genetic variation. Our findings highlight that the contribution of traditional high-risk variants such as CNVs should be analyzed in a broader genetic context, rather than evaluated in isolation.
Abstract Background: Loss-of-function mutations in the SGLT1 (sodium/glucose co-transporter-1) gene result in a rare glucose/galactose malabsorption disorder and neonatal death if untreated. In the general population, variants related to intestinal glucose absorption remain uncharacterized. Objectives: The goal of this study was to identify functional SGLT1 gene variants and characterize their clinical consequences. Methods: Whole exome sequencing was performed in the ARIC (Atherosclerosis Risk in Communities) study participants enrolled from 4 U.S. communities. The association of functional, nonsynonymous substitutions in SGLT1 with 2-h oral glucose tolerance test results was determined. Variants related to impaired glucose tolerance were studied, and Mendelian randomization analysis of cardiometabolic outcomes was performed. Results: Among 5,687 European-American subjects (mean age 54 ± 6 years; 47% male), those who carried a haplotype of 3 missense mutations (frequency of 6.7%)—Asn51Ser, Ala411Thr, and His615Gln—had lower 2-h glucose and odds of impaired glucose tolerance than noncarriers (β-coefficient: −8.0; 95% confidence interval [CI]: −12.7 to −3.3; OR: 0.71; 95% CI: 0.59 to 0.86, respectively). The association of the haplotype with oral glucose tolerance test results was consistent in a replication sample of 2,791 African-American subjects (β = −16.3; 95% CI: −36.6 to 4.1; OR: 0.39; 95% CI: 0.17 to 0.91) and an external European-Finnish population sample of 6,784 subjects (β = −3.2; 95% CI: −6.4 to −0.02; OR: 0.81; 95% CI: 0.68 to 0.98). Using a Mendelian randomization approach in the index cohort, the estimated 25-year effect of a reduction of 20 mg/dl in 2-h glucose via SGLT1 inhibition would be reduced prevalent obesity (OR: 0.43; 95% CI: 0.23 to 0.63), incident diabetes (hazard ratio [HR]: 0.58; 95% CI: 0.35 to 0.81), heart failure (HR: 0.53; 95% CI: 0.24 to 0.83), and death (HR: 0.66; 95% CI: 0.42 to 0.90). Conclusions: Functionally damaging missense variants in SGLT1 protect from diet-induced hyperglycemia in multiple populations. Reduced intestinal glucose uptake may protect from long-term cardiometabolic outcomes, providing support for therapies that target SGLT1 function to prevent and treat metabolic conditions.
Abstract Background: Chronic obstructive pulmonary disease (COPD) is a common lung disorder characterized by persistent and progressive airflow limitation as well as systemic changes. Metabolic changes in blood may help detect COPD in an earlier stage and predict prognosis. Methods: We conducted a comprehensive study of circulating metabolites, measured by proton Nuclear Magnetic Resonance Spectroscopy, in relation with COPD and lung function. The discovery sample consisted of 5557 individuals from two large population-based studies in the Netherlands, the Rotterdam Study and the Erasmus Rucphen Family study. Significant findings were replicated in 12,205 individuals from the Lifelines-DEEP study, FINRISK and the Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS) studies. For replicated metabolites further investigation of causality was performed, utilizing genetics in the Mendelian randomization approach. Results: There were 602 cases of COPD and 4955 controls used in the discovery meta-analysis. Our logistic regression results showed that higher levels of plasma Glycoprotein acetyls (GlycA) are significantly associated with COPD (OR = 1.16, P = 5.6 × 10−4 in the discovery and OR = 1.30, P = 1.8 × 10−6 in the replication sample). A bi-directional two-sample Mendelian randomization analysis suggested that circulating blood GlycA is not causally related to COPD, but that COPD causally increases GlycA levels. Using the prospective data of the same sample of Rotterdam Study in Cox-regression, we show that the circulating GlycA level is a predictive biomarker of COPD incidence (HR = 1.99, 95%CI 1.52–2.60, comparing those in the highest and lowest quartile of GlycA) but is not significantly associated with mortality in COPD patients (HR = 1.07, 95%CI 0.94–1.20). Conclusions: Our study shows that circulating blood GlycA is a biomarker of early COPD pathology.
Abstract Background: Pregnancy triggers well-known alterations in maternal glucose and lipid balance but its overall effects on systemic metabolism remain incompletely understood. Methods: Detailed molecular profiles (87 metabolic measures and 37 cytokines) were measured for up to 4260 women (24–49 years, 322 pregnant) from three population-based cohorts in Finland. Circulating molecular concentrations in pregnant women were compared to those in non-pregnant women. Metabolic profiles were also reassessed for 583 women 6 years later to uncover the longitudinal metabolic changes in response to change in the pregnancy status. Results: Compared to non-pregnant women, all lipoprotein subclasses and lipids were markedly increased in pregnant women. The most pronounced differences were observed for the intermediate-density, low-density and high-density lipoprotein triglyceride concentrations. Large differences were also seen for many fatty acids and amino acids. Pregnant women also had higher concentrations of low-grade inflammatory marker glycoprotein acetyls, higher concentrations of interleukin-18 and lower concentrations of interleukin-12p70. The changes in metabolic concentrations for women who were not pregnant at baseline but pregnant 6 years later (or vice versa) matched (or were mirror-images of) the cross-sectional association pattern. Cross-sectional results were consistent across the three cohorts and similar longitudinal changes were seen for 653 women in 4-year and 497 women in 10-year follow-up. For multiple metabolic measures, the changes increased in magnitude across the three trimesters. Conclusions: Pregnancy initiates substantial metabolic and inflammatory changes in the mothers. Comprehensive characterisation of normal pregnancy is important for gaining understanding of the key nutrients for fetal growth and development. These findings also provide a valuable molecular reference in relation to studies of adverse pregnancy outcomes.
Abstract Objective: To conduct a comprehensive analysis of circulating metabolites and incident stroke in large prospective population-based settings. Methods: We investigated the association of metabolites with risk of stroke in 7 prospective cohort studies including 1,791 incident stroke events among 38,797 participants in whom circulating metabolites were measured by nuclear magnetic resonance technology. The relationship between metabolites and stroke was assessed with Cox proportional hazards regression models. The analyses were performed considering all incident stroke events and ischemic and hemorrhagic events separately. Results: The analyses revealed 10 significant metabolite associations. Amino acid histidine (hazard ratio [HR] per SD 0.90, 95% confidence interval [CI] 0.85, 0.94; p = 4.45 × 10−5), glycolysis-related metabolite pyruvate (HR per SD 1.09, 95% CI 1.04, 1.14; p = 7.45 × 10−4), acute-phase reaction marker glycoprotein acetyls (HR per SD 1.09, 95% CI 1.03, 1.15; p = 1.27 × 10−3), cholesterol in high-density lipoprotein (HDL) 2, and several other lipoprotein particles were associated with risk of stroke. When focused on incident ischemic stroke, a significant association was observed with phenylalanine (HR per SD 1.12, 95% CI 1.05, 1.19; p = 4.13 × 10−4) and total and free cholesterol in large HDL particles. Conclusions: We found association of amino acids, glycolysis-related metabolites, acute-phase reaction markers, and several lipoprotein subfractions with the risk of stroke. These findings support the potential of metabolomics to provide new insights into the metabolic changes preceding stroke.
Abstract Background: Lower birthweight is associated with increased susceptibility to cardiometabolic diseases in adulthood, but the underlying molecular pathways are incompletely understood. We examined associations of birthweight with a comprehensive metabolic profile measured in adolescents and adults. Methods: High-throughput nuclear magnetic resonance metabolomics and biochemical assays were used to quantify 87 circulating metabolic measures in seven cohorts from Finland and the UK, comprising altogether 18 288 individuals (mean age 26 years, range 15–75). Metabolic associations with birthweight were assessed by linear regression models adjusted for sex, gestational age and age at blood sampling. The metabolic associations with birthweight were compared with the corresponding associations with adult body mass index (BMI). Results: Lower birthweight adjusted for gestational age was adversely associated with cardiometabolic biomarkers, including lipoprotein subclasses, fatty acids, amino acids and markers of inflammation and impaired liver function (P < 0.0015 for 46 measures). Associations were consistent across cohorts with different ages at metabolic profiling, but the magnitudes were weak. The pattern of metabolic deviations associated with lower birthweight resembled the metabolic signature of higher adult BMI (R2 = 0.77) assessed at the same time as the metabolic profiling. The resemblance indicated that 1 kg lower birthweight is associated with similar metabolic aberrations as caused by 0.92 units higher BMI in adulthood. Conclusions: Lower birthweight adjusted for gestational age is associated with adverse biomarker aberrations across multiple metabolic pathways. Coherent metabolic signatures between lower birthweight and higher adult adiposity suggest that shared molecular pathways may potentially underpin the metabolic deviations. However, the magnitudes of metabolic associations with birthweight are modest in comparison to the effects of adiposity, implying that birthweight is only a weak indicator of the metabolic risk profile in adulthood.
Abstract Background: Immunometabolism plays a central role in many cardiometabolic diseases. However, a robust map of immune-related gene networks in circulating human cells, their interactions with metabolites, and their genetic control is still lacking. Here, we integrate blood transcriptomic, metabolomic, and genomic profiles from two population-based cohorts (total N = 2168), including a subset of individuals with matched multi-omic data at 7-year follow-up. Results: We identify topologically replicable gene networks enriched for diverse immune functions including cytotoxicity, viral response, B cell, platelet, neutrophil, and mast cell/basophil activity. These immune gene modules show complex patterns of association with 158 circulating metabolites, including lipoprotein subclasses, lipids, fatty acids, amino acids, small molecules, and CRP. Genome-wide scans for module expression quantitative trait loci (mQTLs) reveal five modules with mQTLs that have both cis and trans effects. The strongest mQTL is in ARHGEF3 (rs1354034) and affects a module enriched for platelet function, independent of platelet counts. Modules of mast cell/basophil and neutrophil function show temporally stable metabolite associations over 7-year follow-up, providing evidence that these modules and their constituent gene products may play central roles in metabolic inflammation. Furthermore, the strongest mQTL in ARHGEF3 also displays clear temporal stability, supporting widespread trans effects at this locus. Conclusions: This study provides a detailed map of natural variation at the blood immunometabolic interface and its genetic basis, and may facilitate subsequent studies to explain inter-individual variation in cardiometabolic disease.
Abstract Cholesteryl ester transfer protein (CETP) inhibition reduces vascular event risk, but confusion surrounds its effects on low-density lipoprotein (LDL) cholesterol. Here, we clarify associations of genetic inhibition of CETP on detailed lipoprotein measures and compare those to genetic inhibition of 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGCR). We used an allele associated with lower CETP expression (rs247617) to mimic CETP inhibition and an allele associated with lower HMGCR expression (rs12916) to mimic the well-known effects of statins for comparison. The study consists of 65,427 participants of European ancestries with detailed lipoprotein subclass profiling from nuclear magnetic resonance spectroscopy. Genetic associations were scaled to 10% reduction in relative risk of coronary heart disease (CHD). We also examined observational associations of the lipoprotein subclass measures with risk of incident CHD in 3 population-based cohorts totalling 616 incident cases and 13,564 controls during 8-year follow-up. Genetic inhibition of CETP and HMGCR resulted in near-identical associations with LDL cholesterol concentration estimated by the Friedewald equation. Inhibition of HMGCR had relatively consistent associations on lower cholesterol concentrations across all apolipoprotein B-containing lipoproteins. In contrast, the associations of the inhibition of CETP were stronger on lower remnant and very-low-density lipoprotein (VLDL) cholesterol, but there were no associations on cholesterol concentrations in LDL defined by particle size (diameter 18–26 nm) (−0.02 SD LDL defined by particle size; 95% CI: −0.10 to 0.05 for CETP versus −0.24 SD, 95% CI −0.30 to −0.18 for HMGCR). Inhibition of CETP was strongly associated with lower proportion of triglycerides in all high-density lipoprotein (HDL) particles. In observational analyses, a higher triglyceride composition within HDL subclasses was associated with higher risk of CHD, independently of total cholesterol and triglycerides (strongest hazard ratio per 1 SD higher triglyceride composition in very large HDL 1.35; 95% CI: 1.18–1.54). In conclusion, CETP inhibition does not appear to affect size-specific LDL cholesterol but is likely to lower CHD risk by lowering concentrations of other atherogenic, apolipoprotein B-containing lipoproteins (such as remnant and VLDLs). Inhibition of CETP also lowers triglyceride composition in HDL particles, a phenomenon reflecting combined effects of circulating HDL, triglycerides, and apolipoprotein B-containing particles and is associated with a lower CHD risk in observational analyses. Our results reveal that conventional composite lipid assays may mask heterogeneous effects of emerging lipid-altering therapies.
Abstract Background: Recent studies have revealed sexually dimorphic associations between the carbamoyl‐phosphate synthase 1 locus, intermediates of the metabolic pathway leading from choline to urea, and risk of coronary artery disease (CAD) in women. Based on evidence from the literature, the atheroprotective association with carbamoyl‐phosphate synthase 1 could be mediated by the strong genetic effect of this locus on increased circulating glycine levels. Methods and Results: We sought to identify additional genetic determinants of circulating glycine levels by carrying out a meta‐analysis of genome‐wide association study data in up to 30 118 subjects of European ancestry. Mendelian randomization and other analytical approaches were used to determine whether glycine‐associated variants were associated with CAD and traditional risk factors. Twelve loci were significantly associated with circulating glycine levels, 7 of which were not previously known to be involved in glycine metabolism (ACADM,PHGDH,COX18‐ADAMTS3,PSPH,TRIB1,PTPRD, and ABO). Glycine‐raising alleles at several loci individually exhibited directionally consistent associations with decreased risk of CAD. However, these effects could not be attributed directly to glycine because of associations with other CAD‐related traits. By comparison, genetic models that only included the 2 variants directly involved in glycine degradation and for which there were no other pleiotropic associations were not associated with risk of CAD or blood pressure, lipid levels, and obesity‐related traits. Conclusions: These results provide additional insight into the genetic architecture of glycine metabolism, but do not yield conclusive evidence for a causal relationship between circulating levels of this amino acid and risk of CAD in humans.
Abstract Objective: This study aimed to investigate the role of cytokines as intermediates in the pathway from increased adiposity to disease. Methods: BMI and circulating levels of up to 41 cytokines were measured in individuals from three Finnish cohort studies (n = 8,293). Mendelian randomization (MR) was used to assess the impact of BMI on circulating cytokines and the impact of BMI‐driven cytokines on risk of obesity‐related diseases. Results: Observationally, BMI was associated with 19 cytokines. For every SD increase in BMI, causal effect estimates were strongest for hepatocyte growth factor, monocyte chemotactic protein‐1 (MCP‐1), and tumor necrosis factor–related apoptosis‐inducing ligand (TRAIL) and were as ratios of geometric means 1.13 (95% CI: 1.08‐1.19), 1.08 (95% CI: 1.04‐1.14), and 1.13 (95% CI: 1.04‐1.21), respectively. TRAIL was associated with a small increase in the odds of coronary artery disease (odds ratio: 1.03; 95% CI: 1.00‐1.06). There was inconsistent evidence for a protective role of MCP‐1 against inflammatory bowel diseases. Conclusions: Observational and MR estimates of the effect of BMI on cytokine levels were generally concordant. There was little evidence for an effect of raised levels of BMI‐driven cytokines on disease. These findings illustrate the challenges of MR when applied in the context of molecular mediation.
Abstract Aims: Low-density lipoprotein (LDL) particles cause atherosclerotic cardiovascular disease (ASCVD) through their retention, modification, and accumulation within the arterial intima. High plasma concentrations of LDL drive this disease, but LDL quality may also contribute. Here, we focused on the intrinsic propensity of LDL to aggregate upon modification. We examined whether inter-individual differences in this quality are linked with LDL lipid composition and coronary artery disease (CAD) death, and basic mechanisms for plaque growth and destabilization. Methods and results: We developed a novel, reproducible method to assess the susceptibility of LDL particles to aggregate during lipolysis induced ex vivo by human recombinant secretory sphingomyelinase. Among patients with an established CAD, we found that the presence of aggregation-prone LDL was predictive of future cardiovascular deaths, independently of conventional risk factors. Aggregation-prone LDL contained more sphingolipids and less phosphatidylcholines than did aggregation-resistant LDL. Three interventions in animal models to rationally alter LDL composition lowered its susceptibility to aggregate and slowed atherosclerosis. Similar compositional changes induced in humans by PCSK9 inhibition or healthy diet also lowered LDL aggregation susceptibility. Aggregated LDL in vitro activated macrophages and T cells, two key cell types involved in plaque progression and rupture. Conclusion: Our results identify the susceptibility of LDL to aggregate as a novel measurable and modifiable factor in the progression of human ASCVD.
Abstract Predicting longer-term mortality risk requires collection of clinical data, which is often cumbersome. Therefore, we use a well-standardized metabolomics platform to identify metabolic predictors of long-term mortality in the circulation of 44,168 individuals (age at baseline 18–109), of whom 5512 died during follow-up. We apply a stepwise (forward-backward) procedure based on meta-analysis results and identify 14 circulating biomarkers independently associating with all-cause mortality. Overall, these associations are similar in men and women and across different age strata. We subsequently show that the prediction accuracy of 5- and 10-year mortality based on a model containing the identified biomarkers and sex (C-statistic = 0.837 and 0.830, respectively) is better than that of a model containing conventional risk factors for mortality (C-statistic = 0.772 and 0.790, respectively). The use of the identified metabolic profile as a predictor of mortality or surrogate endpoint in clinical studies needs further investigation.
Abstract Background: Cardiac hypertrophy increases the risk of developing heart failure and cardiovascular death. The neutrophil inflammatory protein, lipocalin‐2 (LCN2/NGAL), is elevated in certain forms of cardiac hypertrophy and acute heart failure. However, a specific role for LCN2 in predisposition and etiology of hypertrophy and the relevant genetic determinants are unclear. Here, we defined the role of LCN2 in concentric cardiac hypertrophy in terms of pathophysiology, inflammatory expression networks, and genomic determinants. Methods and Results: We used 3 experimental models: a polygenic model of cardiac hypertrophy and heart failure, a model of intrauterine growth restriction and Lcn2‐knockout mouse; cultured cardiomyocytes; and 2 human cohorts: 114 type 2 diabetes mellitus patients and 2064 healthy subjects of the YFS (Young Finns Study). In hypertrophic heart rats, cardiac and circulating Lcn2 was significantly overexpressed before, during, and after development of cardiac hypertrophy and heart failure. Lcn2 expression was increased in hypertrophic hearts in a model of intrauterine growth restriction, whereas Lcn2‐knockout mice had smaller hearts. In cultured cardiomyocytes, Lcn2 activated molecular hypertrophic pathways and increased cell size, but reduced proliferation and cell numbers. Increased LCN2 was associated with cardiac hypertrophy and diastolic dysfunction in diabetes mellitus. In the YFS,LCN2 expression was associated with body mass index and cardiac mass and with levels of inflammatory markers. The single‐nucleotide polymorphism, rs13297295, located near LCN2 defined a significant cis‐eQTL for LCN2 expression. Conclusions: Direct effects of LCN2 on cardiomyocyte size and number and the consistent associations in experimental and human analyses reveal a central role for LCN2 in the ontogeny of cardiac hypertrophy and heart failure.
Abstract Background: Both statins and proprotein convertase subtilisin/kexin type 9 (PCSK9) inhibitors lower blood low-density lipoprotein cholesterol levels to reduce risk of cardiovascular events. To assess potential differences between metabolic effects of these 2 lipid-lowering therapies, we performed detailed lipid and metabolite profiling of a large randomized statin trial and compared the results with the effects of genetic inhibition of PCSK9, acting as a naturally occurring trial. Methods: Two hundred twenty-eight circulating metabolic measures were quantified by nuclear magnetic resonance spectroscopy, including lipoprotein subclass concentrations and their lipid composition, fatty acids, and amino acids, for 5359 individuals (2659 on treatment) in the PROSPER (Prospective Study of Pravastatin in the Elderly at Risk) trial at 6 months postrandomization. The corresponding metabolic measures were analyzed in 8 population cohorts (N=72 185) using PCSK9 rs11591147 as an unconfounded proxy to mimic the therapeutic effects of PCSK9 inhibitors. Results: Scaled to an equivalent lowering of low-density lipoprotein cholesterol, the effects of genetic inhibition of PCSK9 on 228 metabolic markers were generally consistent with those of statin therapy (R2=0.88). Alterations in lipoprotein lipid composition and fatty acid distribution were similar. However, discrepancies were observed for very-low-density lipoprotein lipid measures. For instance, genetic inhibition of PCSK9 had weaker effects on lowering of very-low-density lipoprotein cholesterol compared with statin therapy (54% versus 77% reduction, relative to the lowering effect on low-density lipoprotein cholesterol; P=2×10−7 for heterogeneity). Genetic inhibition of PCSK9 showed no significant effects on amino acids, ketones, or a marker of inflammation (GlycA), whereas statin treatment weakly lowered GlycA levels. Conclusions: Genetic inhibition of PCSK9 had similar metabolic effects to statin therapy on detailed lipid and metabolite profiles. However, PCSK9 inhibitors are predicted to have weaker effects on very-low-density lipoprotein lipids compared with statins for an equivalent lowering of low-density lipoprotein cholesterol, which potentially translate into smaller reductions in cardiovascular disease risk.
Abstract Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene–trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated BSCL2, known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits.
We report the first long-term follow-up of a randomized trial (NCT04978259) addressing the effects of remdesivir on recovery (primary outcome) and other patient-important outcomes one year after hospitalization resulting from COVID-19. Of the 208 patients recruited from 11 Finnish hospitals, 198 survived, of whom 181 (92%) completed follow-up. At one year, self-reported recovery occurred in 85% in remdesivir and 86% in standard of care (SoC) (RR 0.94, 95% CI 0.47-1.90). We infer no convincing difference between remdesivir and SoC in quality of life or symptom outcomes (p > 0.05). Of the 21 potential long-COVID symptoms, patients reported moderate/major bother from fatigue (26%), joint pain (22%), and problems with memory (19%) and attention/concentration (18%). In conclusion, after a one-year follow-up of hospitalized patients, one in six reported they had not recovered well from COVID-19. Our results provide no convincing evidence of remdesivir benefit, but wide confidence intervals included possible benefit and harm.
OBJECTIVES: To recontact biobank participants and collect cognitive, behavioural and lifestyle information via a secure online platform. DESIGN: Biobank-based recontacting pilot study. SETTING: Three Finnish biobanks (Helsinki, Auria, Tampere) recruiting participants from February 2021 to July 2021. PARTICIPANTS: All eligible invitees were enrolled in FinnGen by their biobanks (Helsinki, Auria, Tampere), had available genetic data and were >18 years old. Individuals with severe neuropsychiatric disease or cognitive or physical disabilities were excluded. Lastly, 5995 participants were selected based on their polygenic score for cognitive abilities and invited to the study. Among invitees, 1115 had successfully participated and completed the study questionnaire(s). OUTCOME MEASURES: The primary outcome was the participation rate among study invitees. Secondary outcomes included questionnaire completion rate, quality of data collected and comparison of participation rate boosting strategies. RESULTS: The overall participation rate was 18.6% among all invitees and 23.1% among individuals aged 18-69. A second reminder letter yielded an additional 9.7% participation rate in those who did not respond to the first invitation. Recontacting participants via an online healthcare portal yielded lower participation than recontacting via physical letter. The completion rate of the questionnaire and cognitive tests was high (92% and 85%, respectively), and measurements were overall reliable among participants. For example, the correlation (r) between self-reported body mass index and that collected by the biobanks was 0.92. CONCLUSION: In summary, this pilot suggests that recontacting FinnGen participants with the goal to collect a wide range of cognitive, behavioural and lifestyle information without additional engagement results in a low participation rate, but with reliable data. We suggest that such information be collected at enrolment, if possible, rather than via post hoc recontacting.
Abstract Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.