Evaluating the Contribution of Cell Type–Specific Alternative Splicing to Variation in Lipid Levels
Circulation: Genomic and Precision Medicine
Abstract
Background:
Genome-wide association studies have identified hundreds of loci associated with lipid levels. However, the genetic mechanisms underlying most of these loci are not well-understood. Recent work indicates that changes in the abundance of alternatively spliced transcripts contribute to complex trait variation. Consequently, identifying genetic loci that associate with alternative splicing in disease-relevant cell types and determining the degree to which these loci are informative for lipid biology is of broad interest.
Methods:
We analyze gene splicing in 83 sample-matched induced pluripotent stem cell (iPSC) and hepatocyte-like cell lines (n=166), as well as in an independent collection of primary liver tissues (n=96) to perform discovery of splicing quantitative trait loci (sQTLs).
Results:
We observe that transcript splicing is highly cell type specific, and the genes that are differentially spliced between iPSCs and hepatocyte-like cells are enriched for metabolism pathway annotations. We identify 1384 hepatocyte-like cell sQTLs and 1455 iPSC sQTLs at a false discovery rate of <5% and find that sQTLs are often shared across cell types. To evaluate the contribution of sQTLs to variation in lipid levels, we conduct colocalization analysis using lipid genome-wide association data. We identify 19 lipid-associated loci that colocalize either with an hepatocyte-like cell expression quantitative trait locus or sQTL. Only 2 loci colocalize with both a sQTL and expression quantitative trait locus, indicating that sQTLs contribute information about genome-wide association studies loci that cannot be obtained by analysis of steady-state gene expression alone.
Conclusions:
These results provide an important foundation for future efforts that use iPSC and iPSC-derived cells to evaluate genetic mechanisms influencing both cardiovascular disease risk and complex traits in general.
Genome-wide association studies have identified hundreds of loci associated with plasma lipid levels, an important set of predictive and causal risk factors for cardiovascular disease.1 The majority of variants associated with these traits are found in the noncoding genome and are hypothesized to mechanistically influence complex traits through changes in gene expression. For example, at SORT1, an established locus associated with both LDL (low-density lipoprotein) cholesterol levels and coronary heart disease, a functional noncoding variant creates a novel CCAAT/enhancer binding protein (C/EBP) binding site, leading to changes in hepatic SORT1 expression and in turn, changes in circulating LDL cholesterol levels.2 Motivated by this and other examples, significant effort has been dedicated to identifying variants associated with changes in gene expression (expression quantitative trait loci; eQTLs) that are also associated with changes in plasma lipid levels.
Recent research indicates that variants associated with changes in the proportion of alternatively spliced transcript isoforms, or splicing quantitative trait loci (sQTLs) can provide another contributing mechanism underlying complex traits and may be as informative as total gene expression quantitative trait loci in some cases.3,4 For example, sQTLs discovered in lymphoblast cell lines are enriched in multiple sclerosis genome-wide association studies (GWAS) disease loci, and other studies report similar findings for sQTL enrichment in GWAS for schizophrenia and type-2 diabetes.4–6 Variants affecting splicing have also been linked to systemic lupus erythematosus and fatty acid metabolism.7–9 Splicing is an attractive mechanism for study because it can be targeted with antisense oligonucleotides, several of which are currently undergoing clinical trials to treat diseases resulting from aberrant splicing such as Duchenne Muscular Dystrophy and Spinal Muscular Atrophy.10 Finally, splicing provides an even finer level of detail on biological mechanism that cannot always be inferred through bulk expression analysis alone. For example, while the Alzheimer disease-associated gene ABCA7 has both an eQTL and sQTL association, the sQTL is hypothesized to be the causal variant associated with a nonfunctional transcript.11,12
Functional studies of QTLs have been hindered by the time and energy required to identify and then mechanistically characterize the QTLs in model systems. QTL discovery efforts in primary tissues have been highly productive but have several important drawbacks, such as the reliance on heterogeneous postmortem tissue collection and the difficulty of interrogating phenotypes in tissues. However, it is now possible to generate individual-specific, renewable, induced pluripotent stem cell (iPSC) lines, which can be used to both identify and characterize QTLs in specific cell types. iPSCs can be differentiated into a variety of cell types, including hepatocyte-like cells (HLCs).13 Given the liver’s importance in the synthesis and uptake of lipids,14 using HLCs as a model to understand the mechanisms underlying genetic associations with lipid levels is of particular interest. Previous research has demonstrated the utility of these cell models by identifying and characterizing eQTLs in these HLCs.15,16
Therefore, this report has two goals, namely (1) to evaluate the feasibility of using HLC/iPSC lines for sQTL discovery, and (2) to determine whether the sQTLs discovered are informative for our understanding of lipid biology. Therefore, we mapped sQTLs in 83 individual-matched iPSC lines and iPSC-derived HLCs, on which we previously conducted genotyping and paired-end RNA-sequencing. To evaluate the degree to which the genetic control of splicing information compares to total gene expression, we also perform a variety of analyses evaluating differential splicing, differential expression, and colocalization analysis with both HLC sQTLs and HLC eQTLs. We then highlight two loci where the colocalization of GWAS and sQTL results inform the mechanism underlying the variant-trait association.
Methods
The data that support the findings of this study are available from the corresponding author upon reasonable request. Summary results generated for this report are anonymized and deidentified and are available at: doi.org/10.5281/zenodo.7864256. Data utilized in this study were derived from a previous report.15 Briefly, the University of Pennsylvania Human Subjects Research Institutional Review Board approved the previous study protocol used to generate the lines, and all subjects gave written informed consent for study participation. Full methods for the presented are provided in Supplemental Material.
Results
iPSC and HLC Samples Have Distinct Profiles of Gene Expression and Alternative Splicing
We first sought to compare the patterns of alternative splicing and gene expression between iPSCs and HLCs. After normalization and standardization procedures (Supplemental Material), we applied Leafcutter to identify genes with differences in proportions of alternatively spliced transcripts between cell types. Five thousand nine hundred eighty-six genes out of 11 732 genes tested are differentially spliced between iPSC and HLC samples (false discovery rate, FDR <5%). The first two principal components of the quantile normalized and standardized splicing proportions clearly separate the samples by cell type (Figure 1A). Gene ontology analysis also demonstrated that differentially spliced genes that had at least 10% splicing difference across cell types were enriched for metabolism-relevant Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways such as glycosphingolipid biosynthesis (Figure 1B).
We next compared patterns of gene expression between the iPSC and HLC lines (Supplemental Material). Virtually all (22 134 out of 23 857) tested genes were differentially expressed between iPSCs and HLCs (FDR<5%), with 6168 having an absolute log2 fold change of ≥2. Differentially expressed genes that were more highly expressed in HLCs (relative to iPSCs) were enriched for KEGG pathways pertaining to hepatocyte function such as cholesterol and drug metabolism (Figure S1C). The first two principal components obtained from the normalized expression profiles for the iPSCs and HLCs clearly separated samples by cell type (Figure S1A). In sum, we found substantial differences in alternative splicing and gene expression between iPSC and HLCs, and the top differentially expressed or alternatively spliced genes were enriched for liver, lipid, and metabolism-relevant pathways. These results indicate that the successful differentiation of the iPSCs into HLCs results in broad changes in steady-state transcriptional and post-transcriptional regulation.
Thousands of sQTLs Identified Across iPSC and HLC Samples
To identify genetic variants associated with differences in the proportion of alternatively spliced transcripts (ie, sQTLs), we conducted sQTL discovery scans in both cell types using QTLtools’ permutation-based scan to identify the most highly associated differentially spliced intron per gene (Supplemental Material). After filtering for intron clusters that map to annotated genes, we obtained 13 077 genes with testable introns in HLCs and 12 543 genes with testable introns in iPSCs. We identify 1384 sQTLs in HLCs and 1455 sQTLs in iPSCs (FDR<5%). iPSC sQTLs correspond to 1436 unique sentinel single nucleotide polymorphisms (SNPs) and 1455 unique genes, whereas HLC sQTLs correspond to 1368 unique sentinel SNPs and 1384 unique genes.
To evaluate the extent to which iPSC and HLC sQTLs are discoverable in primary liver tissue, we performed replication analysis in the Gene-Tissue Expression project v6 primary liver samples (n=96).17 Compared to iPSC sQTLs, HLC sQTLs were slightly more likely to be replicated at the gene level in primary liver tissue as expected (π1=0.78 for HLC sQTLs versus π1=0.66 for iPSC sQTLs). These results demonstrate that sQTL discovery in iPSC and HLC samples is feasible and that both iPSC and HLC sQTLs replicate in primary liver tissue, with HLC sQTLs replicating at a higher degree than iPSCs.
sQTLs Are Found Near Their Associated Splice Event and Are Enriched for Splicing Relevant Annotations
Because alternative splicing involves protein regulatory machinery interacting with genomic elements near intron/exon boundaries, we expect true sQTL variants to be found predominantly in the regions near these boundaries. Thus, we assessed the genomic context of each sQTL variant and measured the distance between each variant and its associated splice event. The majority of sentinel sQTL variants (75%) are within 25 kb of their associated splice event, although the sentinel variant may not be the ultimate causal variant. We observe similar patterns for sQTLs that fall within the intronic region of their splice event (in between the two exons that delineate the excised intron). After binning to normalize for intron length, we observe that the first and 10th deciles that are closest to canonical splice sites contain the highest numbers of sQTLs (Figure 2A and 2B). Finally, both sets of HLC and iPSC sQTLs are significantly enriched in exonic/intronic regions encompassing canonical splice sites and in several other genomic annotations (Figure 2C). These observations demonstrate that our identified sQTLs are often found close enough to canonical splicing regulatory elements to facilitate fine-mapping of causal variants and genes.
We also conducted credible set fine-mapping analysis of the colocalized sQTLs to identify 95% credible set SNPs; 75% of the colocalized sQTLs have a credible set SNP located within the sQTL-associated gene. Median credible set sizes for iPSC and HLC sQTLs were comparable to that of iPSC and HLC eQTLs—median credible set size for sQTLs is 10 and 9 SNPs for iPSC and HLC samples, respectively; median credible set size for eQTLs is 12 and 10 SNPs for iPSC and HLC samples. This suggests that sQTLs may be no more difficult to fine-map and functionally characterize than eQTLs.
Majority of sQTLs Are Shared Between iPSCs and HLCs
Previous research has indicated that sQTL effects are often shared across tissues.18,19 To evaluate the extent of sQTL sharing in our cell types, we examined the top one thousand sQTLs from the FDR<5% sQTL set for both the iPSC and HLC samples and used METASOFT to determine the extent to which sQTLs discovered in one cell type are found in the other. We find that if an alternatively spliced isoform is expressed in both cell types, the sQTL effect is observed in both cell types more than 90% of the time (Figure 3A and 3B). In contrast, 60% to 74% of eQTL effects are shared, appearing to be more cell type restricted (Figure 3C and 3D). However, this difference may simply be due to statistical power for discovery. The power to detect sQTLs is lower than that of eQTLs, due to a higher burden of multiple testing correction (Supplemental Material). In the Gene-Tissue Expression project, tissues with larger sample sizes report higher percentages of tissue-specific eQTLs compared to tissues with smaller sample sizes, indicating that power is correlated with tissue-specific eQTL discovery. The high percentage of shared sQTLs identified in this study may indicate similar power limitations.
HLC sQTLs Identify Trait-Relevant Genes and Mechanisms at GWAS Lipid Loci
To determine if our discovered sQTLs map to previously established associations for lipid traits in humans, we evaluated the colocalization probability that sQTL and GWAS signals share underlying causal architecture using coloc R package.20 Colocalization analysis of HLC sQTLs (FDR<5%) with genome-wide significant (P<5×10-8) loci from the Global Lipids Genetics Consortium identified 5, 3, 4, and 5 HLC sQTL genes colocalizing with HDL (high-density lipoprotein), LDL, triglycerides, and total cholesterol loci respectively (using a posterior probability [PP] cutoff of PP4/PP3+PP4≥0.9 and PP3+PP4≥0.8, Table 1).1 Colocalization analysis of FDR <5% HLC eQTLs identified 3, 5, 2, and 5 eQTL genes colocalized loci for HDL, LDL, triglycerides, and total cholesterol respectively (Table 2). Encouragingly, we observed colocalization of sQTL and eQTLs in genes with known roles in lipid biology (APOC1, ANGPTL3).15,21 In general, the colocalized sQTLs and eQTLs are associated with different genes, a result that is in line with previous studies.19 Only 2 genes colocalized for both an eQTL and sQTL, RPAP2 and DR1.
GLGC trait | sQTL–genes |
---|---|
HDL | CCDC92-ZNF664, DR1, GSDMB, PGS1 |
LDL | HCP5-MICA, RPAP2 |
TG | CCDC92-ZNF664, APOC1, PDXDC1 |
TC | HCP5-MICA, M6PR, RPAP2 |
GLGC indicates Global Lipids Genetics Consortium; HDL, high-denisty lipoprotein; HLC, hepatocyte-like cell; LDL, low-density lipoprotein; sQTL, splicing quantitative trait loci; TC, total cholesterol; and TG, triglyceride.
GLGC trait | eQTL–genes |
---|---|
HDL | RBM6, PGAP3, DR1 |
LDL | SPTY2D1, SYPL2, FRK, TBKBP1, RPAP2 |
TG | ANGPTL3, AL138847.1 |
TC | SPTY2D1, ANGPTL3, FRK, TBKBP1, AL138847.1 |
eQTL indicates expression quantitative trait locus; GLGC, Global Lipids Genetics Consortium; HDL, high-density lipoprotein; HLC, hepatocyte-like cell; LDL, low-density lipoprotein; TC, total cholesterol; and TG, triglyceride.
Of particular interest are the colocalized sQTLs mapping to CCDC92 and PGS1 (Figure 4). PGS1 encodes an enzyme known as phosphatidylglycerophosphate synthase 1, which is involved in the synthesis of the anionic phospholipids phosphatidylglycerol and cardiolipin. SNPs in PGS1 are associated with changes in triglyceride levels during diabetes treatment.22 Interestingly, the second SNP in the credible set for this sQTL falls within the canonical splice site of the splicing event (as demonstrated by the red arrow in Figure 4D).23 Furthermore, the differentially spliced intron for PGS1 is associated with a transcript that is predicted to undergo nonsense-mediated decay (PGS1-002), suggesting that the splicing event will have significant impact on levels of functional protein. The CCDC92-ZNF664 locus has also been implicated in many different metabolic disorders, including type-2 diabetes and coronary heart disease (Figure 5).24 The sentinel sQTL SNP at this colocalized locus has been associated with variation in total cholesterol, metabolic syndrome, and waist circumference in previous studies.
Discussion
In sum, we demonstrate the utility of using iPSC-derived HLCs for the identification of lipid-relevant sQTLs. Our sQTL scans identify thousands of sQTLs in both iPSC and HLC lines, and the sentinel sQTL variants are enriched near canonical splice sites and are located close to their associated splice event, indicating that fine-mapping the causal genes and variants for these sQTLs is feasible. Regarding the degree to which sQTLs are shared across cell types, we find that if a spliced isoform is expressed across both cell types, the sQTL effect is also often shared. Although this finding is in line with previous sQTL analyses, it may be due to the fact that low-expressed splice events are often removed during more stringent filtering processes.
Our colocalization analysis of the HLC sQTLs with GWAS loci from the Global Lipids Genetics Consortium identifies several interesting genes and variants that may underlie blood lipid level variation. Furthermore, the fact that (1) our HLC sQTLs generally colocalize with different GWAS loci than the HLC eQTLs, and (2) the colocalized sQTLs nominate different genes than the eQTLs, indicates that sQTLs likely provide information pertaining to the causal genes and molecular mechanisms underlying complex traits that may not otherwise be captured by eQTL analysis alone.
One limitation of this study is the fact that Leafcutter cannot distinguish between types of sQTL events and may miss variants associated with alternative transcription start sites and alternative polyadenylation. As more differential splicing and sQTL detection methods are published, comparing the results of these various methods may provide a more nuanced assessment of the role of sQTLs in complex disease. Another limitation is that the Gene-Tissue Expression project v6 liver RNA-sequencing data was processed differently than the iPSC and HLC data, which may have resulted in the inability to detect the same sQTLs across cell types at the intron level. The use of primary liver tissue samples to interpret and replicate specific sQTLs we have cataloged here would be an important direction for future work.
In summary, sQTL discovery in iPSC-derived cells may serve as an approach complementary to primary tissue QTL discovery efforts and provide a model system in which to identify and functionally characterize sQTLs relevant for complex trait variation.
Article Information
Supplemental Material
Supplemental Methods
Figures S1–S3
References 25–45
Footnote
Nonstandard Abbreviations and Acronyms
- eQTL
- expression quantitative trait locus
- FDR
- false discovery rate
- GLGC
- Global Lipids Genetics Consortium
- HDL
- high-density lipoprotein
- HLC
- hepatocyte-like cell
- iPSC
- induced pluripotent stem cell
- LDL
- low-density lipoprotein
- PBMC
- peripheral blood mononuclear cells
- SNP
- single nucleotide polymorphism
- sQTL
- splicing quantitative trait locus
Supplemental Material
File (circgenetics_circcvg-2020-003249_supp1.pdf)
- Download
- 600.84 KB
References
1.
Global Lipids Genetics Consortium. Discovery and refinement of loci associated with lipid levels. Nat Genet. 2013;45:1274–1283. doi: 10.1038/ng.2797
2.
Musunuru K, Strong A, Frank-Kamenetsky M, Lee NE, Ahfeldt T, Sachs K, Li X, Li H, Kuperwasser N, Ruda VM, et al. From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature. 2010;466:714–719. doi: 10.1038/nature09266
3.
Lee Y, Gamazon ER, Rebman E, Lee Y, Lee S, Dolan ME, Cox NJ, Lussier YA. Variants affecting exon skipping contribute to complex traits. PLoS Genet. 2012;8:e1002998. doi: 10.1371/journal.pgen.1002998
4.
Li YI, van de Geijn B, Raj A, Knowles DA, Petti AA, Golan D, Gilad Y, Pritchard JK. RNA splicing is a primary link between genetic variation and disease. Science. 2016;352:600–604. doi: 10.1126/science.aad9417
5.
Takata A, Matsumoto N, Kato T. Genome-wide identification of splicing QTLs in the human brain and their enrichment among schizophrenia-associated loci. Nat Commun. 2017;8:14519. doi: 10.1038/ncomms14519
6.
Fadista J, Vikman P, Laakso EO, Mollet IG, Esguerra JL, Taneera J, Storm P, Osmark P, Ladenvall C, Prasad RB, et al. Global genomic and transcriptomic analysis of human pancreatic islets reveals novel genes influencing glucose metabolism. Proc Natl Acad Sci U S A. 2014;111:13924–13929. doi: 10.1073/pnas.1402665111
7.
Odhams CA, Cortini A, Chen L, Roberts AL, Viñuela A, Buil A, Small KS, Dermitzakis ET, Morris DL, Vyse TJ, et al. Mapping eQTLs with RNA-seq reveals novel susceptibility genes, non-coding RNAs and alternative-splicing events in systemic lupus erythematosus. Hum Mol Genet. 2017;26:1003–1017. doi: 10.1093/hmg/ddw417
8.
Kim HI, Raffler J, Lu W, Lee JJ, Abbey D, Saleheen D, Rabinowitz JD, Bennett MJ, Hand NJ, Brown C, et al. Fine mapping and functional analysis reveal a role of SLC22A1 in acylcarnitine transport. Am J Hum Genet. 2017;101:489–502. doi: 10.1016/j.ajhg.2017.08.008
9.
Li M, Jaffe AE, Straub RE, Tao R, Shin JH, Wang Y, Chen Q, Li C, Jia Y, Ohi K, et al. A human-specific AS3MT isoform and BORCS7 are molecular risk factors in the 10q24.32 schizophrenia-associated locus. Nat Med. 2016;22:649–656. doi: 10.1038/nm.4096
10.
Havens MA, Hastings ML. Splice-switching antisense oligonucleotides as therapeutic drugs. Nucleic Acids Res. 2016;44:6549–6563. doi: 10.1093/nar/gkw533
11.
de Roeck A, van Broeckhoven C, Sleegers K. The role of ABCA7 in Alzheimer’s disease: evidence from genomics, transcriptomics and methylomics. Acta Neuropathol. 2019;138:201–220. doi: 10.1007/s00401-019-01994-1
12.
de Roeck A, Duchateau L, van Dongen J, Cacace R, Bjerke M, van den Bossche T, Cras P, Vandenberghe R, de Deyn PP, Engelborghs S, et al; BELNEU Consortium. An intronic VNTR affects splicing of ABCA7 and increases risk of Alzheimer’s disease. Acta Neuropathol. 2018;135:827–837. doi: 10.1007/s00401-018-1841-z
13.
Cai, J, DeLaForest, A, Fisher, J, Urick, A, Wagner, T, Twaroski, K, Cayo, M, Nagaoka, M, Duncan, SA. Protocol for directed differentiation of human pluripotent stem cells toward a hepatocyte fate. Stembook. 2008.
14.
Nguyen P, Leray V, Diez M, Serisier S, le Bloc’H J, Siliart B, Dumon H. Liver lipid metabolism. J Anim Physiol Anim Nutr (Berl). 2008;92:272–283. doi: 10.1111/j.1439-0396.2007.00752.x
15.
Pashos EE, Park Y, Wang X, Raghavan A, Yang W, Abbey D, Peters DT, Arbelaez J, Hernandez M, Kuperwasser N, et al. Large, diverse population cohorts of hiPSCs and derived hepatocyte-like cells reveal functional genetic variation at blood lipid-associated loci. Cell Stem Cell. 2017;20:558–570.e10. doi: 10.1016/j.stem.2017.03.017
16.
Warren CR, O’Sullivan JF, Friesen M, Becker CE, Zhang X, Liu P, Wakabayashi Y, Morningstar JE, Shi X, Choi J, et al. Induced pluripotent stem cell differentiation enables functional validation of GWAS variants in metabolic disease. Cell Stem Cell. 2017;20:547–557.e7. doi: 10.1016/j.stem.2017.01.010
17.
Storey JD. A direct approach to false discovery rates. J R Stat Soc Series B Stat Methodol. 2002;64:479–498. https://doi.org/10.1111/1467-9868.00346
18.
Li YI, Knowles DA, Humphrey J, Barbeira AN, Dickinson SP, Im HK, Pritchard JK. Annotation-free quantification of RNA splicing using LeafCutter. Nat Genet. 2018;50:151–158. doi: 10.1038/s41588-017-0004-9
19.
Aguet F, Barbeira AN, Bonazzola R, Brown A, Castel SE, Jo B, Kasela S, Kim-Hellmuth S, Liang Y, Oliva M, et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776
20.
Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383
21.
Westerterp M, Berbée JFP, Delsing DJM, Jong MC, Gijbels MJJ, Dahlmans VEH, Offerman EH, Romijn JA, Havekes LM, Rensen PCN. Apolipoprotein C-I binds free fatty acids and reduces their intracellular esterification. J Lipid Res. 2007;48:1353–1361. doi: 10.1194/jlr.M700024-JLR200
22.
Huggins GS, Papandonatos GD, Erar B, Belalcazar LM, Brautbar A, Ballantyne C, Kitabchi AE, Wagenknecht LE, Knowler WC, Pownall HJ, et al; Genetics Subgroup of the Action for Health in Diabetes (Look AHEAD) Study. Do genetic modifiers of high-density lipoprotein cholesterol and triglyceride levels also modify their response to a lifestyle intervention in the setting of obesity and type-2 diabetes mellitus?: the Action for Health in Diabetes (Look AHEAD) study. Circ Cardiovasc Genet. 2013;6:391–399. doi: 10.1161/CIRCGENETICS.113.000042
23.
Abril JF, Castelo R, Guigó R. Comparison of splice sites in mammals and chicken. Genome Res. 2005;15:111–119. doi: 10.1101/gr.3108805
24.
Kraja AT, Vaidya D, Pankow JS, Goodarzi MO, Assimes TL, Kullo IJ, Sovio U, Mathias RA, Sun Y, Franceschini N, et al. A bivariate genome-wide approach to metabolic syndrome: STAMPEED Consortium. Diabetes. 2011;60:1329–1339. doi: 10.2337/db10-1011
25.
Carcamo-Orive I, Hoffman GE, Cundiff P, Beckmann ND, D’Souza SL, Knowles JW, Patel A, Papatsenko D, Abbasi F, Reaven GM, et al. Analysis of transcriptional variability in a large human iPSC library reveals genetic and non-genetic determinants of heterogeneity. Cell Stem Cell. 2017;20:518–532.e9. doi: 10.1016/j.stem.2016.11.005
26.
Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2011;9:179–181. doi: 10.1038/nmeth.1785
27.
Andrews, S. FastQC: A Quality Control tool for High Throughput Sequence Data [Online]. 2010. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
28.
Krueger, F, James, F, Ewels, P, Afyounian, E, Weinstein, M, Schuster-Boeckler, B, Hulselmans, G and sclamons. FelixKrueger/TrimGalore: v0.6.10 - add default decompression path (0.6.10). Zenodo 2023. doi: https://doi.org/10.5281/zenodo.7598955
29.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635
30.
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–883. doi: 10.1093/bioinformatics/bts034
31.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118
32.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8
33.
Delaneau O, Ongen H, Brown AA, Fort A, Panousis NI, Dermitzakis ET. A complete tool set for molecular QTL discovery and analysis. Nat Commun. 2017;8:15452. doi: 10.1038/ncomms15452
34.
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38:904–909. doi: 10.1038/ng1847
35.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795
36.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol. 1995;57:289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
37.
Ongen H, Buil A, Brown AA, Dermitzakis ET, Delaneau O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics. 2016;32:1479–1485. doi: 10.1093/bioinformatics/btv722
38.
Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ. Software for computing and annotating genomic ranges. PLoS Comput Biol. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118
39.
van Nostrand EL, Pratt GA, Shishkin AA, Gelboin-Burkhart C, Fang MY, Sundararaman B, Blue SM, Nguyen TB, Surka C, Elkins K, et al. Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP). Nat Methods. 2016;13:508–514. doi: 10.1038/nmeth.3810
40.
Han B, Eskin E. Interpreting meta-analyses of genome-wide association studies. PLoS Genet. 2012;8:e1002555. doi: 10.1371/journal.pgen.1002555
41.
Aguet F, Brown AA, Castel SE, Davis JR, He Y, Jo B, Mohammadi P, Park YS, Parsana P, Segrè A, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. doi: 10.1038/nature24277
42.
Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656
43.
Stegle O, Parts L, Piipari M, Winn J, Durbin R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat Protoc. 2012;7:500–507. doi: 10.1038/nprot.2011.457
44.
Maller JB, McVean G, Byrnes J, Vukcevic D, Palin K, Su Z, Howson JMM, Auton A, Myers S, Morris A, et al; Wellcome Trust Case Control Consortium. Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat Genet. 2012;44:1294–1301. doi: 10.1038/ng.2435
45.
Trynka G, Westra HJ, Slowikowski K, Hu X, Xu H, Stranger BE, Klein RJ, Han B, Raychaudhuri S. Disentangling the effects of colocalizing genomic annotations to functionally prioritize non-coding variants within complex-trait loci. Am J Hum Genet. 2015;97:139–152. doi: 10.1016/j.ajhg.2015.05.016
Information & Authors
Information
Published In
Copyright
© 2023 The Authors. Circulation: Genomic and Precision Medicine is published on behalf of the American Heart Association, Inc., by Wolters Kluwer Health, Inc. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution, and reproduction in any medium, provided that the original work is properly cited.
Versions
You are viewing the most recent version of this article.
History
Received: 9 November 2020
Accepted: 23 February 2023
Published online: 11 May 2023
Published in print: June 2023
Keywords
Subjects
Authors
Disclosures
Disclosures Dr Voight is an associate editor for Circulation: Genomics and Precision Medicine. Dr Pashos was solely affiliated with the University of Pennsylvania at the time of their contribution to this article; their current affiliation is now with Pfizer, Inc. The contents and viewpoints expressed herein and the materials therein reflect the personal opinion of the authors, and cannot be construed as representation of any position or opinion of Pfizer Inc. or any of its subsidiaries. All materials were created at the University of Pennsylvania. The other authors report no conflicts.
Sources of Funding
Dr Brown acknowledges support from the National Institutes of Health (NIH)–National Heart, Lung, and Blood Institute (HL133218) for the work. Dr Voight is grateful for support from the work from the NIH–National Institute of Diabetes and Digestive Kidney Diseases (DK101478 and DK126194). Dr Gawronski was supported in part from the NIH–National Institute of General Medical Sciences (GM008216) and the American Heart Association (19PRE34450081).
Metrics & Citations
Metrics
Citations
Download Citations
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Select your manager software from the list below and click Download.
- A scoping review of statistical methods to investigate colocalization between genetic associations and microRNA expression in osteoarthritis, Osteoarthritis and Cartilage Open, 6, 4, (100540), (2024).https://doi.org/10.1016/j.ocarto.2024.100540
Loading...
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Personal login Institutional LoginPurchase Options
Purchase this article to access the full text.
eLetters(0)
eLetters should relate to an article recently published in the journal and are not a forum for providing unpublished data. Comments are reviewed for appropriate use of tone and language. Comments are not peer-reviewed. Acceptable comments are posted to the journal website only. Comments are not published in an issue and are not indexed in PubMed. Comments should be no longer than 500 words and will only be posted online. References are limited to 10. Authors of the article cited in the comment will be invited to reply, as appropriate.
Comments and feedback on AHA/ASA Scientific Statements and Guidelines should be directed to the AHA/ASA Manuscript Oversight Committee via its Correspondence page.