Skip to main content
Editorial
Originally Published 13 March 2018
Free Access

Integrative Omics: Harnessing the Proteome to Maximize the Potential of the Genome

Article, see p 1158
Great strides have been made in dissecting the genetics of cardiovascular disease (CVD) through genome-wide association studies and whole exome/genome sequencing, although the full heritability remains to be elucidated. Furthermore, biological insights into the associations of common variants with complex traits are often lacking. Although static DNA variation is at the core of variability between humans, viewed in isolation it can provide only a snapshot of the complex landscape of the dynamic, temporal evolution of CVD.
One powerful approach to attempt to harness this dynamic variability is to study intermediate phenotypes that are more proximal to genomic variation than complex clinical end points. Although they can be difficult to measure accurately given the temporal nature of CVD, when analyzed on the foundation of the static genome, integrated studies can identify genomic loci that influence disease through these intermediate biomarkers. This principle underlies quantitative trait loci (QTL) studies, where genetic variation is integrated with other omics data (eg, transcriptomics, metabolomics, proteomics). Once these associations are established, the genetic variants and biomarkers can then be studied in a focused manner to determine what, if any, causal relation they have to the disease of interest. In addition, if done at the genome-wide level, these studies provide a valuable resource to identify possible functional effects of disease-associated genetic variants. The CVD community is just beginning to embark on such integrative studies that capitalize on the large amount of molecular data now available in CVD cohorts.
In this issue of Circulation, Benson et al1 do just this, elegantly harnessing the power of the proteome to further our understanding of CVD genetics. Integrating genome-wide and exome array genetic data with proteomic data profiled by using a novel aptameter-based multiplex assay, they identify protein QTL (pQTL) in the Framingham Heart Study (FHS). Specifically, they performed pQTL analysis in the FHS Offspring cohort (n=759) on 156 proteins that they had previously identified as being associated with the Framingham CVD 10-year risk score.2 They then performed validation studies in the MDCS (Malmö Diet and Cancer Study) (n=1421). Their results highlight the multifaceted output that can be extracted from population-based studies. First, taking advantage of the family-based structure of FHS, they show that a large amount of variance (>20%) in the majority protein levels (≈92%) can be explained by heritable factors, whereas the clinical components of the Framingham CVD risk score explain the same degree of variance in only 16% of the proteins. Genome-wide association studies of the 156 proteins identified 120 pQTLs that reached genome-wide significance in FHS, with 75 validating in MDCS. Four of the pQTLs had been identified as CVD risk loci in the large CARDIoGRAMplusC4D (Coronary Artery Disease Genome-wide Replication and Meta-analysis) consortium, and several additional pQTLs had been identified as risk loci in other studies. They identified another 41 pQTLs in the exome array analysis, which was restricted to variants that were likely to have a functional effect (eg, nonsynonymous, stop-altering or splice-altering substitutions), with 27 validating in MDCS. As a means to determine potential causality, they performed Mendelian randomization of the data, demonstrating 8 examples where proteins were likely causally associated with clinical risk factors. Last, they looked for genetic variants that were associated with multiple different protein levels, suggesting that these loci might report on central pathways mediating CVD risk. They found 3 of these pleotropic variants near the PPM1G, FOXP1, and FAM20A genes. The variant near PPM1G, which was associated with apolipoprotein E levels in this study, had previously been associated with circulating total cholesterol levels.3 Experiments in Hep G2 human liver hepatocellular carcinoma cell lines showed that knockdown of PPM1G resulted in decreased ApoE gene expression and decreased protein accumulation in the culture media. Although further experiments are needed to dissect the full molecular genetics and biology of these relationships, they do provide initial proof-of-principle mechanistic studies to confirm the statistical associations seen from this discovery study.
Many of the identified genetic variants were in cis (defined in this study as <1 mega base pairs) with the coding gene for the proteins and were in linkage disequilibrium with mutations in the coding genes that could plausibly directly impact circulating levels. Thus, these newly identified variants provide insight into the genetic regulation of protein abundance. In addition, several new trans variants (distant from the coding gene of the associated protein) were identified. These pQTLs provide important insight into previously unappreciated regulatory networks mediating CVD risk. For example, they found a pQTL for galectin-3–binding protein, a protein that has previously been associated with several CVD phenotypes, that is a nonsense mutation residing within the macrophage scavenger receptor type I protein (MSR1) gene, thus suggesting that variation in the MSR1 gene may modulate the macrophage lectin-binding activity of galectin-3–binding protein. Further experimental validation is needed to test this hypothesis.
The proteomics platform used in this study uses an emerging DNA-aptamer immunoaffinity technology. The power of this medium-scale platform allows scale evaluation of a larger number of proteins than afforded by traditional assays, while providing less complex analytics for protein identification and annotation required by unbiased mass spectrometry–based proteomics. This and other such medium-scale proteomics platforms are now available and, as demonstrated by this study, will likely have a significant impact on the discovery of novel biomarkers and molecular pathways in CVD and other diseases. It is important to note that these platforms provide a snapshot of the global proteome (1129 proteins), and that this study only examined a subset of those that were associated with the Framingham CVD risk score. Newer generations of the platform provide coverage of >4700 proteins4; pQTL studies using this expanded data set would describe even more of the genetic architecture underlying the cardiovascular risk proteome. Aptamer-based and -related proteomic technologies also have unique caveats that must be considered, because there is variability in the binding affinity and specificity for different proteins. For example, in this study, proteins represented by aptamers with lower binding affinity/specificity could result in pQTLs that account for less of the variance in protein levels. As this group has previously done, aptamer specificity for top protein hits can be validated using mass spectrometry.2
Overall, this important integrated omics study provides a valuable pQTL atlas as a publicly available resource to further study the genetic architecture that mediates intermediate markers of CVD risk and to identify potential new therapeutic targets. As with many functional genomics studies, these statistical associations serve as only the beginning of our understanding of their significance. Mendelian randomization approaches may point toward causality of variants on clinical or intermediate risk factors; however, as the authors did for their PPM1G variant, functional validation will be necessary to confirm the proposed impact of other variants on protein levels and function. This is particularly important because identified genetic variants may not directly mediate protein levels but rather be in linkage disequilibrium with true functional variants. It is important to note that the proteomic and genotyping primary data have been uploaded to the publicly available dbGAP. Such resource sharing is critical to allow other investigators to continue to explore the associations described here and to perform the necessary functional validation; the authors should be lauded for their commitment to data sharing. Moreover, the data can be combined with similar data sets through meta-analyses to increase the power to detect new pQTLs, and be used for integration of other molecular data in FHS, such as metabolomics,5 to approach a true systems biology perspective of the CVD risk landscape.

References

1.
Benson MD, Yang Q, Ngo D, Zhu Y, Shen D, Farrell LA, Sinha S, Keyes MJ, Vasan RS, Larson MG, Smith JG, Wang TJ, Gerszten RE. Genetic architecture of the cardiovascular risk proteome. Circulation. 2018;137:1158–1172. doi: 10.1161/CIRCULATIONAHA.117.029536.
2.
Ngo D, Sinha S, Shen D, Kuhn EW, Keyes MJ, Shi X, Benson MD, O’Sullivan JF, Keshishian H, Farrell LA, Fifer MA, Vasan RS, Sabatine MS, Larson MG, Carr SA, Wang TJ, Gerszten RE. Aptamer-based proteomic profiling reveals novel candidate biomarkers and pathways in cardiovascular disease. Circulation. 2016;134:270–285. doi: 10.1161/CIRCULATIONAHA.116.021803.
3.
Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Ripatti S, Chasman DI, Willer CJ, Johansen CT, Fouchier SW, Isaacs A, Peloso GM, Barbalic M, Ricketts SL, Bis JC, Aulchenko YS, Thorleifsson G, Feitosa MF, Chambers J, Orho-Melander M, Melander O, Johnson T, Li X, Guo X, Li M, Shin Cho Y, Jin Go M, Jin Kim Y, Lee JY, Park T, Kim K, Sim X, Twee-Hee Ong R, Croteau-Chonka DC, Lange LA, Smith JD, Song K, Hua Zhao J, Yuan X, Luan J, Lamina C, Ziegler A, Zhang W, Zee RY, Wright AF, Witteman JC, Wilson JF, Willemsen G, Wichmann HE, Whitfield JB, Waterworth DM, Wareham NJ, Waeber G, Vollenweider P, Voight BF, Vitart V, Uitterlinden AG, Uda M, Tuomilehto J, Thompson JR, Tanaka T, Surakka I, Stringham HM, Spector TD, Soranzo N, Smit JH, Sinisalo J, Silander K, Sijbrands EJ, Scuteri A, Scott J, Schlessinger D, Sanna S, Salomaa V, Saharinen J, Sabatti C, Ruokonen A, Rudan I, Rose LM, Roberts R, Rieder M, Psaty BM, Pramstaller PP, Pichler I, Perola M, Penninx BW, Pedersen NL, Pattaro C, Parker AN, Pare G, Oostra BA, O’Donnell CJ, Nieminen MS, Nickerson DA, Montgomery GW, Meitinger T, McPherson R, McCarthy MI, McArdle W, Masson D, Martin NG, Marroni F, Mangino M, Magnusson PK, Lucas G, Luben R, Loos RJ, Lokki ML, Lettre G, Langenberg C, Launer LJ, Lakatta EG, Laaksonen R, Kyvik KO, Kronenberg F, König IR, Khaw KT, Kaprio J, Kaplan LM, Johansson A, Jarvelin MR, Janssens AC, Ingelsson E, Igl W, Kees Hovingh G, Hottenga JJ, Hofman A, Hicks AA, Hengstenberg C, Heid IM, Hayward C, Havulinna AS, Hastie ND, Harris TB, Haritunians T, Hall AS, Gyllensten U, Guiducci C, Groop LC, Gonzalez E, Gieger C, Freimer NB, Ferrucci L, Erdmann J, Elliott P, Ejebe KG, Döring A, Dominiczak AF, Demissie S, Deloukas P, de Geus EJ, de Faire U, Crawford G, Collins FS, Chen YD, Caulfield MJ, Campbell H, Burtt NP, Bonnycastle LL, Boomsma DI, Boekholdt SM, Bergman RN, Barroso I, Bandinelli S, Ballantyne CM, Assimes TL, Quertermous T, Altshuler D, Seielstad M, Wong TY, Tai ES, Feranil AB, Kuzawa CW, Adair LS, Taylor HA, Borecki IB, Gabriel SB, Wilson JG, Holm H, Thorsteinsdottir U, Gudnason V, Krauss RM, Mohlke KL, Ordovas JM, Munroe PB, Kooner JS, Tall AR, Hegele RA, Kastelein JJ, Schadt EE, Rotter JI, Boerwinkle E, Strachan DP, Mooser V, Stefansson K, Reilly MP, Samani NJ, Schunkert H, Cupples LA, Sandhu MS, Ridker PM, Rader DJ, van Duijn CM, Peltonen L, Abecasis GR, Boehnke M, Kathiresan S. Biological, clinical and population relevance of 95 loci for blood lipids. Nature. 2010;466:707–713. doi: 10.1038/nature09270.
4.
Jacob J, Ngo D, Finkel N, Pitts R, Gleim S, Benson MD, Keyes MJ, Farrell LA. Application of large scale aptamer-based proteomic profiling to “planned” myocardial infarctions [published online ahead of print December 8, 2017]. Circulation. doi: 10.1161/CIRCULATIONAHA.117.029443.
5.
Rhee EP, Ho JE, Chen MH, Shen D, Cheng S, Larson MG, Ghorbani A, Shi X, Helenius IT, O’Donnell CJ, Souza AL, Deik A, Pierce KA, Bullock K, Walford GA, Vasan RS, Florez JC, Clish C, Yeh JR, Wang TJ, Gerszten RE. A genome-wide association study of the human metabolome in a community-based cohort. Cell Metab. 2013;18:130–143. doi: 10.1016/j.cmet.2013.06.013.

eLetters(0)

eLetters should relate to an article recently published in the journal and are not a forum for providing unpublished data. Comments are reviewed for appropriate use of tone and language. Comments are not peer-reviewed. Acceptable comments are posted to the journal website only. Comments are not published in an issue and are not indexed in PubMed. Comments should be no longer than 500 words and will only be posted online. References are limited to 10. Authors of the article cited in the comment will be invited to reply, as appropriate.

Comments and feedback on AHA/ASA Scientific Statements and Guidelines should be directed to the AHA/ASA Manuscript Oversight Committee via its Correspondence page.

Information & Authors

Information

Published In

Go to Circulation
Go to Circulation
Circulation
Pages: 1173 - 1175
PubMed: 29530892

History

Published online: 13 March 2018
Published in print: 13 March 2018

Permissions

Request permissions for this article.

Keywords

  1. Editorials
  2. biomarkers
  3. cardiovascular diseases
  4. genetics
  5. genome-wide association study
  6. proteomics

Subjects

Authors

Affiliations

Robert W. McGarrah, MD
Division of Cardiology, Department of Medicine, Duke University School of Medicine, Durham, NC (R.W.M., S.H.S.).
Svati H. Shah, MD, MS, MHS
Division of Cardiology, Department of Medicine, Duke University School of Medicine, Durham, NC (R.W.M., S.H.S.).
Duke Molecular Physiology Institute, Durham, NC (R.W.M., S.H.S.). Duke Clinical Research Institute, Durham, NC (S.H.S.).

Notes

The opinions expressed in this article are not necessarily those of the editors or of the American Heart Association.
Svati H. Shah, MD, MS, MHS, 300 North Duke Street, Duke University, Durham NC 27701. E-mail [email protected]

Disclosures

None.

Metrics & Citations

Metrics

Citations

Download Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Select your manager software from the list below and click Download.

  1. Towards Allograft Longevity: Leveraging Omics Technologies to Improve Heart Transplant Outcomes, Current Heart Failure Reports, 20, 6, (493-503), (2023).https://doi.org/10.1007/s11897-023-00631-z
    Crossref
Loading...

View Options

View options

PDF and All Supplements

Download PDF and All Supplements

PDF/EPUB

View PDF/EPUB
Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Personal login Institutional Login
Purchase Options

Purchase this article to access the full text.

Purchase access to this article for 24 hours

Integrative Omics
Circulation
  • Vol. 137
  • No. 11

Purchase access to this journal for 24 hours

Circulation
  • Vol. 137
  • No. 11
Restore your content access

Enter your email address to restore your content access:

Note: This functionality works only for purchases done as a guest. If you already have an account, log in to access the content to which you are entitled.

Media

Figures

Other

Tables

Share

Share

Share article link

Share

Comment Response