Kaikki aineistot
Lisää
Abstract Background: There is increasing evidence that elevated body mass index (BMI) is associated with reduced survival for women with breast cancer. However, the underlying reasons remain unclear. We conducted a Mendelian randomization analysis to investigate a possible causal role of BMI in survival from breast cancer. Methods: We used individual-level data from six large breast cancer case-cohorts including a total of 36 210 individuals (2475 events) of European ancestry. We created a BMI genetic risk score (GRS) based on genotypes at 94 known BMI-associated genetic variants. Association between the BMI genetic score and breast cancer survival was analysed by Cox regression for each study separately. Study-specific hazard ratios were pooled using fixed-effect meta-analysis. Results: BMI genetic score was found to be associated with reduced breast cancer-specific survival for estrogen receptor (ER)-positive cases [hazard ratio (HR) = 1.11, per one-unit increment of GRS, 95% confidence interval (CI) 1.01–1.22, P = 0.03). We observed no association for ER-negative cases (HR = 1.00, per one-unit increment of GRS, 95% CI 0.89–1.13, P = 0.95). Conclusions: Our findings suggest a causal effect of increased BMI on reduced breast cancer survival for ER-positive breast cancer. There is no evidence of a causal effect of higher BMI on survival for ER-negative breast cancer cases.
Abstract Background: The rarity of mutations in PALB2, CHEK2 and ATM make it difficult to estimate precisely associated cancer risks. Population-based family studies have provided evidence that at least some of these mutations are associated with breast cancer risk as high as those associated with rare BRCA2 mutations. We aimed to estimate the relative risks associated with specific rare variants in PALB2, CHEK2 and ATM via a multicentre case-control study. Methods: We genotyped 10 rare mutations using the custom iCOGS array: PALB2 c.1592delT, c.2816T>G and c.3113G>A, CHEK2 c.349A>G, c.538C>T, c.715G>A, c.1036C>T, c.1312G>T, and c.1343T>G and ATM c.7271T>G. We assessed associations with breast cancer risk (42 671 cases and 42 164 controls), as well as prostate (22 301 cases and 22 320 controls) and ovarian (14 542 cases and 23 491 controls) cancer risk, for each variant. Results: For European women, strong evidence of association with breast cancer risk was observed for PALB2 c.1592delT OR 3.44 (95% CI 1.39 to 8.52, p = 7.1 × 10−5), PALB2 c.3113G>A OR 4.21 (95% CI 1.84 to 9.60, p = 6.9 × 10−8) and ATM c.7271T>G OR 11.0 (95% CI 1.42 to 85.7, p = 0.0012). We also found evidence of association with breast cancer risk for three variants in CHEK2, c.349A>G OR 2.26 (95% CI 1.29 to 3.95), c.1036C>T OR 5.06 (95% CI 1.09 to 23.5) and c.538C>T OR 1.33 (95% CI 1.05 to 1.67) (p ≤ 0.017). Evidence for prostate cancer risk was observed for CHEK2 c.1343T>G OR 3.03 (95% CI 1.53 to 6.03, p = 0.0006) for African men and CHEK2 c.1312G>T OR 2.21 (95% CI 1.06 to 4.63, p = 0.030) for European men. No evidence of association with ovarian cancer was found for any of these variants. Conclusions: This report adds to accumulating evidence that at least some variants in these genes are associated with an increased risk of breast cancer that is clinically important.
Abstract Background: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry. Methods: Meta-analyses included summary estimates based on Cox models of twelve datasets using ~10.4 million variants for 96,661 women with breast cancer and 7697 events (breast cancer-specific deaths). Oestrogen receptor (ER)-specific analyses were based on 64,171 ER-positive (4116) and 16,172 ER-negative (2125) patients. We evaluated the probability of a signal to be a true positive using the Bayesian false discovery probability (BFDP). Results: We did not find any variant associated with breast cancer-specific mortality at P < 5 × 10−8. For ER-positive disease, the most significantly associated variant was chr7:rs4717568 (BFDP = 7%, P = 1.28 × 10−7, hazard ratio [HR] = 0.88, 95% confidence interval [CI] = 0.84–0.92); the closest gene is AUTS2. For ER-negative disease, the most significant variant was chr7:rs67918676 (BFDP = 11%, P = 1.38 × 10−7, HR = 1.27, 95% CI = 1.16–1.39); located within a long intergenic non-coding RNA gene (AC004009.3), close to the HOXA gene cluster. Conclusions: We uncovered germline variants on chromosome 7 at BFDP < 15% close to genes for which there is biological evidence related to breast cancer outcome. However, the paucity of variants associated with mortality at genome-wide significance underpins the challenge in providing genetic-based individualised prognostic information for breast cancer patients.
Abstract The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10−6, including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.
Abstract Quantifying the genetic correlation between cancers can provide important insights into the mechanisms driving cancer etiology. Using genome-wide association study summary statistics across six cancer types based on a total of 296,215 cases and 301,319 controls of European ancestry, here we estimate the pair-wise genetic correlations between breast, colorectal, head/neck, lung, ovary and prostate cancer, and between cancers and 38 other diseases. We observed statistically significant genetic correlations between lung and head/neck cancer (rg = 0.57, p = 4.6 × 10−8), breast and ovarian cancer (rg = 0.24, p = 7 × 10−5), breast and lung cancer (rg = 0.18, p =1.5 × 10−6) and breast and colorectal cancer (rg = 0.15, p = 1.1 × 10−4). We also found that multiple cancers are genetically correlated with non-cancer traits including smoking, psychiatric diseases and metabolic characteristics. Functional enrichment analysis revealed a significant excess contribution of conserved and regulatory regions to cancer heritability. Our comprehensive analysis of cross-cancer heritability suggests that solid tumors arising across tissues share in part a common germline genetic basis.
Abstract Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to similar to 1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations (P < 5 x 10(−8)), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were similar to 8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets.
Abstract Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
Abstract Large consortia have revealed hundreds of genetic loci associated with anthropometric traits, one trait at a time. We examined whether genetic variants affect body shape as a composite phenotype that is represented by a combination of anthropometric traits. We developed an approach that calculates averaged PCs (AvPCs) representing body shape derived from six anthropometric traits (body mass index, height, weight, waist and hip circumference, waist-to-hip ratio). The first four AvPCs explain >99% of the variability, are heritable, and associate with cardiometabolic outcomes. We performed genome-wide association analyses for each body shape composite phenotype across 65 studies and meta-analysed summary statistics. We identify six novel loci: LEMD2 and CD47 for AvPC1, RPS6KA5/C14orf159 and GANAB for AvPC3, and ARL15 and ANP32 for AvPC4. Our findings highlight the value of using multiple traits to define complex phenotypes for discovery, which are not captured by single-trait analyses, and may shed light onto new pathways.
Abstract Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 x 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.
Abstract Genome-wide association studies (GWAS) have identified more than 170 breast cancer susceptibility loci. Here we hypothesize that some risk-associated variants might act in non-breast tissues, specifically adipose tissue and immune cells from blood and spleen. Using expression quantitative trait loci (eQTL) reported in these tissues, we identify 26 previously unreported, likely target genes of overall breast cancer risk variants, and 17 for estrogen receptor (ER)-negative breast cancer, several with a known immune function. We determine the directional effect of gene expression on disease risk measured based on single and multiple eQTL. In addition, using a gene-based test of association that considers eQTL from multiple tissues, we identify seven (and four) regions with variants associated with overall (and ER-negative) breast cancer risk, which were not reported in previous GWAS. Further investigation of the function of the implicated genes in breast and immune cells may provide insights into the etiology of breast cancer.
Abstract Nomenclatural type definitions are one of the most important concepts in biological nomenclature. Being physical objects that can be re-studied by other researchers, types permanently link taxonomy (an artificial agreement to classify biological diversity) with nomenclature (an artificial agreement to name biological diversity). Two proposals to amend the International Code of Nomenclature for algae, fungi, and plants (ICN), allowing DNA sequences alone (of any region and extent) to serve as types of taxon names for voucherless fungi (mainly putative taxa from environmental DNA sequences), have been submitted to be voted on at the 11th International Mycological Congress (Puerto Rico, July 2018). We consider various genetic processes affecting the distribution of alleles among taxa and find that alleles may not consistently and uniquely represent the species within which they are contained. Should the proposals be accepted, the meaning of nomenclatural types would change in a fundamental way from physical objects as sources of data to the data themselves. Such changes are conducive to irreproducible science, the potential typification on artefactual data, and massive creation of names with low information content, ultimately causing nomenclatural instability and unnecessary work for future researchers that would stall future explorations of fungal diversity. We conclude that the acceptance of DNA sequences alone as types of names of taxa, under the terms used in the current proposals, is unnecessary and would not solve the problem of naming putative taxa known only from DNA sequences in a scientifically defensible way. As an alternative, we highlight the use of formulas for naming putative taxa (candidate taxa) that do not require any modification of the ICN.