Skip to main content

Blood Pressure

StocSum: a reference-panel-free summary statistics framework for diverse populations

Authors
Han Chen, Nannan Wang, Bing Yu, Goo Jun, Qibin Qi, Ramon A. Durazo-Arvizu, Sara Lindstrom, Alanna C. Morrison, Robert C. Kaplan, Eric Boerwinkle
Name and Date of Professional Meeting
IGES 33rd Annual Meeting (November 3-4, 2024)
Associated paper proposal(s)
Working Group(s)
Abstract Text
Genomic summary statistics have been widely used to address various scientific questions in genetic and genomic research. Applications that involve multiple genetic variants, such as conditional analysis, variant set and gene-based tests, heritability and genetic correlation estimation, also require correlation or linkage disequilibrium (LD) information between genetic variants, often obtained from an external reference panel. While these methods usually have good performance for common variants in populations of only European ancestry, in practice, it is usually difficult to find external reference panels that accurately represent the LD structure for isolated, underrepresented or admixed populations, as well as rare genetic variants from whole genome sequencing (WGS) studies, limiting their applications to European populations. To maximize the applicability of summary statistics-based methods and make them equally beneficial to all human populations, we have developed StocSum, a novel reference-panel-free statistical framework for generating, managing, and analyzing stochastic summary statistics using random matrix algorithms. Using two cohorts from the Trans-Omics for Precision Medicine Program, we demonstrate the accuracy of StocSum-based LD measures as compared to those directly computed from individual-level genotype data, in European-, African-, and Hispanic/Latino-Americans. We also show that for admixed populations such as African- and Hispanic/Latino-Americans, LD measures computed from external reference panels perform much worse, even if all ancestry populations are included in those reference panels. As a reference-panel-free framework, StocSum will facilitate sharing and utilization of genomic summary statistics from WGS studies, especially for isolated, underrepresented and admixed populations.

Genetic and phenotypic association analyses of cardiometabolic traits in diverse African samples with whole-genome sequencing data

Authors
Daniel Hui*, Matt Hansen*, Daniel Harris, Michael McQuillan, Dan Ju, Alexander Platt, William Beggs, Sunungouko Wata Mpoloka, Gaonyadiwe George Mokone, Gurja Belay, Thomas Nyambo, Stephen Chanock, Meredith Yeager, TOPMed Consortium, Giorgio Sirugo, Marylyn D. Ritchie, Scott Williams, Sarah A. Tishkoff
Name and Date of Professional Meeting
American Society of Human Genetics, November 2023
Associated paper proposal(s)
Working Group(s)
Abstract Text
African populations demonstrate exceptional genetic and phenotypic diversity, due in part to their varied environments, lifestyles, and demographic history. We conducted genetic and phenotypic association analyses in 6,965 geographically and ethnically diverse Sub-Saharan African individuals (6,280 with whole-genome sequences from the NIH TOPMed consortium and 685 with genotypes from Illumina arrays), using 15 cardiometabolic phenotypes (range 686-6,854 individuals/trait). Each phenotype had at least one ethnicity with significantly differing mean values compared to the remaining cohort, such as short stature in the Baka rainforest hunter-gatherers of Cameroon, and high adiposity in the Herero pastoralists of Botswana. An analysis of ethnicity-sex interactions revealed several ethnic groups with significant sexual dimorphism for at least one cardiometabolic phenotype, such as Herero women having markedly higher body mass index than men. Comparison between the African cohort and African ancestry UK Biobank (UKBB) individuals showed the latter have higher mean values than any of the 53 African ethnic groups for multiple cardiometabolic measurements, including low density lipoprotein cholesterol (LDL), body fat percentage (BFP), and systolic blood pressure. We also found that phenotype-phenotype correlations differ between the UKBB and African cohort, as well as between African ethnicities. For example, BFP and LDL had low correlation in the UKBB (R=0.04) but showed a range of correlation among African groups, from R = 0.00 in the Maasai pastoralists of eastern Africa to R = 0.43 in the Agaw agriculturalists of Ethiopia. Genome-wide association analyses identified 76 significantly associated loci (p<5.0x10-8), with 14 passing a more stringent empirical threshold (p<3.0x10-9), including APOE and APOC1 loci for various blood lipids, PCSK9 for LDL, and CETP for high density lipoprotein cholesterol (HDL), as well as novel loci. Set-based rare variant analyses for loss-of-function variants found 12 gene-phenotype associations replicating known associations with PCSK9 and APOE for LDL and total cholesterol and uncovering several novel gene-trait associations for adiposity traits and HDL. Ongoing analyses include phenotype associations with subsistence and genetically inferred ancestry, replication of genetic associations, and gene-set enrichment. In total, these results offer insights into the genetic and phenotypic landscape of cardiometabolic traits in African populations. This work was supported by grant numbers: ADA 1–19-VSN-02, NIH grants 1R35GM134957, R01DK104339, and R01AR076241, and 1X01HL139409-01.

Genome-wide association studies of blood pressure phenotypes in 4,819 participants from Samoa and American Samoa

Authors
T-A. Yapp, J. Z. Zhang, M. Krishnan, S. Liu, B. M. Blobner, H. Cheng, T. Naseri, M. S. Reupena, S. Viali, J. Tuitele, R. Deka, N. L. Hawley, S. T. McGarvey, D. E. Weeks, R. L. Minster, J. C. Carlson
Name and Date of Professional Meeting
ASHG (Nov 1-5, 2023)
Associated paper proposal(s)
Working Group(s)
Abstract Text
Exploring the genetic determinants of blood pressure (BP) phenotypes in Polynesian individuals could help address inequity in research with marginalized populations and has the potential to provide additional insight to what is known from other populations around the globe about the biological foundations of such traits. In the current study, we performed genome-wide associations studies of five traits—systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial pressure (MAP), pulse pressure (PP), and hypertension (HTN)—in 4,819 research participants across five subsamples from Samoa and American Samoa. Genotypes were measured via two methods: 895,103 variants were genotyped in a subsample of 3,053 individuals from Samoa with the Affymetrix 6.0 array and 669,569 variants were genotyped in a subsample of 1,766 individuals from Samoa and American Samoa with the Illumina Global Screening Array. We then imputed up to 7,408,897 variants using a haplotype reference panel based on 1,285 participants from Samoa that were whole-genome sequenced by the TOPMed Program. We performed association testing of inverse-normally transformed traits in each subsample separately using linear or logistic mixed models, adjusting for fixed effects of age, sex, polity, principal components of ancestry and a kinship random effect. The subsamples were meta-analyzed using the p values and effect direction with weighting by the subsample size. No variants were observed to be genome-wide significant; variants at fourteen unique loci associated with BP phenotypes were observed at p < 1 × 10−7. Several of these loci have been associated with BP phenotypes before: FGF5, associated here with SBP and MAP, has been associated with HTN previously; KIF15, associated with SBP here, has been associated with PP; KCNK3, here with SBP, with SBP, DBP, and MAP; DOT1L, here with DBP and MAP, with SBP and PP previously; and PDILT, here with HTN, has been associated with SBP, DBP, and HTN. PLPPR2, associated in this study with SBP and MAP, however, has not been, to our knowledge, been associated with BP phenotypes before. Little is known about the function of PLPPR2; it is predicted to be involved in phospholipid processing and signal transduction. In this analysis, both known and novel genetic loci associated with BP phenotypes were observed, suggesting that the genetic architecture of blood pressure and hypertension in individuals from Samoa and American Samoa is affected by both universal and unique genetic determinants. Further investigation and validation of these results will be necessary to determine the biological mechanisms of these associations in cardiovascular health.

StocSum: stochastic summary statistics for whole genome sequencing studies

Authors
Nannan Wang, Bing Yu, Goo Jun, Qibin Qi, Ramon A. Durazo-Arvizu, Sara Lindstrom, Alanna C. Morrison, Robert C. Kaplan, Eric Boerwinkle, Han Chen
Name and Date of Professional Meeting
ASHG Meeting (November 1-5, 2023)
Associated paper proposal(s)
Working Group(s)
Abstract Text
Genomic summary statistics, usually defined as single-variant test results from genome-wide association studies, have been widely used to address different scientific questions in genetic and genomic research, such as meta-analysis, heritability estimation, conditional analysis, variant set and gene-based tests, multiple phenotype analysis, genetic correlation or co-heritability estimation. Applications that involve multiple genetic variants also require their correlations or linkage disequilibrium (LD) information, often obtained from an external reference panel. While these methods usually have good performance for common variants in populations of European ancestry, in practice, it is usually difficult to find suitable external reference panels that represent the LD structure for isolated, underrepresented and admixed populations, or rare genetic variants from whole genome sequencing (WGS) studies, limiting the scope of applications for genomic summary statistics. We have developed StocSum, a novel reference-panel-free statistical framework for generating, managing, and analyzing stochastic summary statistics using random vector algorithms. Regardless of the complex sample correlation structure, StocSum always scales linearly with both the sample size and the number of genetic variants in computing stochastic summary statistics from individual-level data. We develop various downstream applications using StocSum including single-variant tests, conditional association tests, gene-environment interaction tests, variant set tests, as well as meta-analysis and LD score regression tools. The complexity of all these downstream applications does not depend on the sample size. We demonstrate the accuracy and computational efficiency of StocSum using two cohorts from the Trans-Omics for Precision Medicine Program. Specifically, we show that StocSum can be used to perform long-range variant set tests, expanding the aggregation units beyond genes or genomic regions in close proximity. We also show that for admixed populations, LD scores estimated by StocSum are much more accurate compared to those from external reference panels, even if all ancestry populations are included in those reference panels. In summary, as a reference-panel-free framework, StocSum will facilitate sharing and utilization of genomic summary statistics from WGS studies, especially for isolated, underrepresented and admixed populations.

StocSum: stochastic summary statistics for whole genome sequencing studies

Authors
Nannan Wang, Bing Yu, Goo Jun, Qibin Qi, Ramon A. Durazo-Arvizu, Sara Lindstrom, Alanna C. Morrison, Robert C. Kaplan, Eric Boerwinkle, Han Chen
Name and Date of Professional Meeting
Joint Statistical Meetings (August 5-10, 2023)
Associated paper proposal(s)
Working Group(s)
Abstract Text
Genomic summary statistics, usually defined as single-variant test results from genome-wide association studies, have been widely used to advance the genetics field in a wide range of applications. Applications that involve multiple genetic variants also require their correlations or linkage disequilibrium (LD) information, often obtained from an external reference panel. In practice, it is usually difficult to find suitable external reference panels that represent the LD structure for underrepresented and admixed populations, or rare variants from whole genome sequencing (WGS) studies, limiting the scope of applications for genomic summary statistics. We develop StocSum, a novel reference-panel-free statistical framework for generating, managing, and analyzing stochastic summary statistics using random vector algorithms. Regardless of the complex sample correlation structure, StocSum always scales linearly with both the sample size and the number of genetic variants in computing stochastic summary statistics from individual-level data. We demonstrate the accuracy and computational efficiency of StocSum using two cohorts from the Trans-Omics for Precision Medicine WGS studies.
Back to top