Abstract Text |
BACKGROUND: Plasma lipid levels, which include low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides (TG), and total cholesterol (TC), are heritable risk factors and therapeutic targets for coronary heart disease. More than 350 loci have been associated with plasma lipid levels in genome-wide association studies. We now extend prior efforts to examine the full allelic spectrum with plasma lipids using whole genome sequencing.
METHODOLOGY: Whole genome sequenced (WGS) data (>30X coverage) and plasma lipids from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program was used for the analysis. Single variants with MAF > 0.1% from 22 autosomes were associated with the four lipid traits individually, while adjusted for covariates (PCs, age, sex, age^2, cohort-race and kinship). Single variant GWAS was carried out using a fast linear mixed model with kinship adjustment (SAIGE-QT). Significant common variants were identified with p- value <5e-9 and compared with previously reported GWAS summary
data.
RESULTS: We analyzed WGS data and plasma lipids from 66,329 samples in 21 studies (Amish, ARIC, BioMe, CARDIA, CFS, CHS, DHS, FHS, GeneSTAR,
GENOA, GenSalt, GOLDN, HCHS_SOL, HyperGEN, JHS, MESA, MGH_AF, SAFS, SAS, THRV and WHI). The dataset was multi-ethnic and included samples from Asian (4,719-7%), Black (16,983-26%), Hispanic (13,943-21%), Samoan (1,182-2%) and White (29,502-44%) ethnicities. Novel low frequency variants included, intronic variant 15q22.2 (MAF-0.1%) to gene RNF111 associated with HDL-C, intergenic variants 12q23.1 (MAF-0.3%), 4q34.2 (MAF-0.1%) associated with LDL-C, and intergenic variant 11q13.3 (MAF-0.01%), intronic variant 15q21.1 (MAF-0.3%) to gene SPG11 associated with TG. The low frequency variants were identified to be specific to certain ethnic groups, for example 15q22.2 variant specific to White population with MAF-0.1%, 12q23.1 to Hispanic population with MAF-1% and 4q34.2 to African ethnic group with MAF-0.6%. Additionally, we found common lead variants including 11p15.4 (MAF-6%) with LDL-C, and 11q12.2 (MAF-49%), 13q34 (MAF-28%), and 20q13.12 (MAF-2%) with TC; in prior analyses of the Million Veteran Program, these variants showed consistent associations.
CONCLUSIONS: Whole genome sequence analysis of plasma lipids in diverse ethnicities provides a platform to identify novel genetic associations.
|