Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans.

The Journal of clinical investigation
Authors
Keywords
Abstract

BACKGROUNDMost GWAS of plasma proteomics have focused on White individuals of European ancestry, limiting biological insight from other ancestry-enriched protein quantitative loci (pQTL).METHODSWe conducted a discovery GWAS of approximately 3,000 plasma proteins measured by the antibody-based Olink platform in 1,054 Black adults from the Jackson Heart Study (JHS) and validated our findings in the Multi-Ethnic Study of Atherosclerosis (MESA). The genetic architecture of identified pQTLs was further explored through fine mapping and admixture association analysis. Finally, using our pQTL findings, we performed a phenome-wide association study (PheWAS) across 2 large multiethnic electronic health record (EHR) systems in All of Us and BioMe.RESULTSWe identified 1,002 pQTLs for 925 protein assays. Fine mapping and admixture analyses suggested allelic heterogeneity of the plasma proteome across diverse populations. We identified associations for variants enriched in African ancestry, many in diseases that lack precise biomarkers, including cis-pQTLs for cathepsin L (CTSL) and Siglec-9, which were linked with sarcoidosis and non-Hodgkin's lymphoma, respectively. We found concordant associations across clinical diagnoses and laboratory measurements, elucidating disease pathways, including a cis-pQTL associated with circulating CD58, WBC count, and multiple sclerosis.CONCLUSIONSOur findings emphasize the value of leveraging diverse populations to enhance biological insights from proteomics GWAS, and we have made this resource readily available as an interactive web portal.FUNDINGNIH K08 HL161445-01A1; 5T32HL160522-03; HHSN268201600034I; HL133870.

Year of Publication
2024
Journal
The Journal of clinical investigation
Volume
134
Issue
21
Date Published
09/2024
ISSN
1558-8238
DOI
10.1172/JCI181802
PubMed ID
39316441
Links