Authors |
Rebecca Keener, Margaret A. Taub, Matthew Conomos, Joshua Weinstock, John Lane, Kruthika Iyer, Lisa Yanek, Nathan Pankratz, Alexander Reiner, Rasika Mathias, Alexis Battle
|
Abstract Text |
Telomere length (TL) regulation is critical for human health. Individuals with very short TL exhibit Short Telomere Syndromes (STS) leading to organ failure, while individuals with very long TL may be predisposed to cancer. Genome-wide association studies (GWAS) have yielded several TL-associated loci, but the mechanisms underlying most of these signals remains poorly characterized. TL is reported to differ between ancestry groups, yet the majority of previous TL GWAS were limited to European individuals. We recently estimated TL from whole genome sequencing (WGS) bioinformatically using TelSeq in a pooled trans-ethnic study (N=109,122) from the Trans-Omics for Precision Medicine (TOPMed) Program. GWAS using TOPMed data yielded 59 independent, genome-wide significant signals at 36 loci, 21 of which were novel signals. To extend these discoveries beyond TOPMed, we performed a meta-analysis including three additional GWAS from European (1), Singaporean Chinese (2), and South Asian (3) ancestries and the TOPMed data stratified by descent (European, African, Hispanic/Latino, Asian, Brazilian and Samoan) and identified an additional 18 novel loci for a total of 54 GWAS loci. The genes closest to the GWAS and meta-analysis signals were enriched for gene ontology terms involving telomere biology, DNA repair, and DNA metabolism. To understand the mechanisms underlying these signals we performed credible set analysis, colocalization analysis, and functional characterization of all signals. These fine-mapping approaches identified several zinc-finger nucleases and transcription factors (TF) as likely causal genes. ZCCHC8 is a zinc-finger protein and rare variants in ZCCHC8 are reported to cause STS; BANP is a TF which contributes to T cell development and STS patients are reported to have T-cell immunodeficiency. The diversity in our dataset made it possible to identify population-specific signals driven by differences in allele frequency, which included signals near OBFC1, a telomere binding protein. Colocalization analysis with expression quantitative trait loci (eQTLs) from GTEx tissues demonstrated that all signals near OBFC1 colocalize with OBFC1 eQTLs. Notably, the primary OBFC1 signal was driven by European, African, and Hispanic/Latino individuals while an independent, secondary signal was driven exclusively by Asian and Hispanic/Latino individuals. Our results demonstrate the value of including diverse ancestries in GWAS and of WGS in the identification of ancestry-specific and low frequency signals to better understand genetic variation underlying TL regulation.
1. PMC7058826
2. PMC6554354
3. PMC5749304
|