In silico identification of common and specific signatures in coronary heart diseases
- Authors:
- Published online on: August 13, 2020 https://doi.org/10.3892/etm.2020.9121
- Pages: 3595-3614
-
Copyright: © Yang et al. This is an open access article distributed under the terms of Creative Commons Attribution License.
Abstract
Introduction
Coronary heart disease (CHD), also known as coronary artery disease (CAD) is one of a group of diseases of the heart blood vessels affecting millions of individuals worldwide. According to the center for disease control (CDC) reports, each death out of four is related to heart diseases, leading to approximately 610,000 mortalities annually worldwide (1). Among the heart diseases, CAD is the most common, responsible for the death of 370,000 individuals annually worldwide (1). CAD occurs when the elasticity of arteries, as well as vein and vessel smoothing, become plaque in the inner wall, making them rigid and narrowed. This condition restricts the blood flow to the heart muscle, leading to oxygen starvation. The condition of plaque rupture leads to the heart failure or cardiac death (2).
Recently, there has been an increase in the incidence of CHD (also known as ischemic heart disease) in China (3). In addition, CHD has become the most common reason for death in middle and high-income countries (4). According to the data report by NHANES, CHD prevalence was higher in males than females across all ages (7.4 v/s 5.3%, respectively) (3). The American Heart Association explains ‘The important difference between sex and pathology’, clinical presentation and outcomes in CHD patients (5). Thus it is crucial to pay attention to sex disparities and subsequently to personalize treatment (6). Patients with CHD are also susceptible to more complicated clinical problems. Currently, the diagnosis and therapy of CHD is rare and costly as compared to coronary angiography, which is the most popular clinical management option (7). CHD is one of the leading causes of death, and markedly affects the immunity of the body, making it an economic burden worldwide (8,9). This is a complex disease involving multiple mechanisms and influenced by many risk factors, including physical activity, genetics, diet, and smoking (10,11). Recently, a genome-wide association study (GWAS) identified many candidate loci associated with CHD and myocardial infection (MI) (12-14). Although genetics play an important role, accounting for approximately 50% of CHD heritability, the exact mechanism and causative agent of CHD are not yet revealed clearly (15-17). In this regard, it is important to understand and address the candidate genome association in developing CHD.
MicroRNAs (miRNA) are small noncoding RNAs with a length of 22-25 nucleotide and which play a key role in the regulation of gene expression and have implications in many human disorders (18), including many biological processes such as cell differentiation, proliferation and apoptosis (19-21). To the best of the knowledge of the authors, the association pattern of miRNAs to CHD is lacking, leading to demand for specific CHD patients. Although relevant research has been undertaken to address DEGs associated with CHD, DEGs have only been used to check the expression pattern in case of CHD. In this study, we addressed the possible association of genes with CHD, which may be useful for the diagnosis and treatment of this disease in the near future. Additionally, analysis of gene expression data and network analysis were performed to gain a better understanding of CHD for the identification of differentially expressed genes (DEGs), biomarkers and therapeutic target options.
Materials and methods
Data availability
To identify key genes for the development of CHD biomarkers, we used gene expression datasets of 4 angiographically proven patients who were being treated for more than 3 months or from group-1 (n=100) compared to healthy control (n=50). This dataset was downloaded from the GEO module of National Center for Biotechnology Information (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE56885). Microarray gene expression profiles were downloaded and further analyzed for the identification of DEGs. In this dataset, GSM1370681 and GSM1370682 represent the replicate samples of healthy individuals and GSM1370683 to GSM1370686 of four patients as baseline associated with CAD.
Differential Expression Analysis (DEG)
Using the default parameters, WEGO 2.0, and GEO2R (https://www.ncbi.nlm.nih.gov/geo/geo2r/) were used to analyze the GEO series (22). The Benjamini and Hochberg false discovery rate method was utilized to adjust the P-values. NCBI-generated annotations were employed to display the DEG list by comparing the overall common gene expression pattern as compared to the control. On the basis of this analysis, possible associations related to CHD were reported. Although inappropriate to consider the data for analysis on inter-datasets, the average value of LogFC for all four datasets was assessed to represent the expression level.
Gene ontology (GO) analysis
The major bioinformatics tool GO was used as an initiative to understand the function of genes and gene products of Homo sapiens. The PANTHER (Protein ANalysis THrough Evolutionary Relationships) classification database (23) was used to perform the GO analysis, and the pathway analysis was performed using Reactome (24).
Protein-Protein Interaction (PPI) network construction analysis
An online freely available software package, STRING, was utilized to establish the PPI network (25), and all the cut-off points were combined to analyze the topology property of networks. Gene edges of >15 degrees were defined as hub genes.
Results and Discussion
Screening of differentially expressed common genes from microarray data sets
Atherosclerosis is one of the leading causes of cardiovascular diseases such as CHD (26). Understanding of the key players in expression, regulation and function, of GWAS CHD genes will provide the options to treat this disease, leading to further developments of novel therapeutic interventions (27). In this study, the first compressive investigation was conducted to identify the expression profile of collected microarray data sets of CHD. The dataset of two controls in replicate and four baseline test samples were used. We report an overall expression and function of genes associated with different biological processes, which may lead to CHD during pathological conditions. The overall study design is shown in Fig. 1, which presents CHD data of Homo sapiens from the GEO database, with four series of test samples and two control study sets. First, we used WEGO to visualize the GO annotations and the percentage of genetic association of different functions in cells to address the possible association with CHD.
A total of 52,998 genes sharing different functions such as cellular (18,476), biological (17,307) and molecular (17,215) functions were identified
Out of those, the highest gene association to cell, cell part, organelle, organelle part, membrane, binding, cellular process, biological regulation, and metabolic were topmost in the metadata of the CHD-associated data set (Fig. 2). The different GO representing the 0-90% range of gene expression as compared to control data set is shown in Fig. 3. The principal findings of this study confirm the association of immune system, inflammation, and apoptosis as mediators in the development of CHD. The impact of the immune system plays a key role in the development of heart failure. A transcriptomic study reported the sustained activation of the adoptive immune system which may be a contributing factor in the progression of CHD (28). Another report suggests that the imbalance in inflammatory and anti-inflammatory cytokines may lead to the onset of extensive fibrosis (29).
From the GEO database, accession GSE56885 of CSD patients, who were being treated for more than 3 months was selected
From the included patients, two controls in replicate and four overall test samples were used to consolidate data refining. The GEO2R was used to analyze the control and test data series by normalizing the microarray data for high quality. DEGs with different fold change confirm their crucial role in CHD (Table I).
In our study, many immune response processes were significantly changed and DEGs are associated with the metabolic process, which is associated with CHD
The CHDs of the innate immune system were largely mediated through neutrophils and monocyte, and macrophages (30), to contribute to the process of the chronic inflammation process.
Functional enrichment and unified DEG analysis
To precisely understand the gene changes during CHD, the DEGs GO was performed using the online PANTHER database for high-throughput analysis to classify the proteins and their genes into family and subfamily, molecular function, biological process, and pathway (31). In the dataset analyzed, the two significant changes in molecular function were protein binding (75%) and catalytic activity (56%), followed by molecular regulator, molecular transducer activity, structural activity, transcription regulator activity, and transporter activity (Fig. 4A). In terms of the biological process, the three most significant classes of CHD were cellular process (83%), biological regulation (57%), and metabolic process (44%) (Fig. 4B). Additionally, in terms of cellular components, another two more significant components are cell (65%) and organelle (58%), which were found to be associated with CHD (Fig. 4C). Many other target-associated DEGs were involved in the biological process, molecular function and cellular components.
Analyzed potential DEGs of the CHD data set shows protein classes distributed among transcription factor (24%), enzyme modulator (20%), nucleic acid binding (18%), and signaling molecules (18%) (Fig. 5A)
The DEGs mainly associated with CHD key pathways showed the significance are inflammation mediated by chemokine and cytokine signaling pathway (11%), CCKR signaling map (11%), gonadotropin-releasing hormone receptor pathway (8%), apoptosis signaling pathway (6%), and p53 pathway (5%) (Fig. 5B). This result was consistent with GO analysis, confirming the classes of proteins associated with CHD. Many genes associated with inflammatory roles, and a previous study showed a conserved signature of dilated cardiomyopathy (DCM) plays an important role in cell survival promotion during end-stage of heart failure (32). In the present study, we also revealed the expression pattern of apoptotic or inflammatory genes (Fig. 4) (33,34).
Pathway analysis
To address the overview of data insight into the pathways, which are associated and connected for CHD development (35), we analyzed 164 DEGs involved in different functional pathways compared to reference and expected genes for those pathways. A total of 13 pathways were found to be associated with signaling-, immune-, and transcription-related pathways (36). Genes were confirmed in the uploaded list over the expected one (number in the list divided by the expected number). If >1, it indicated that the category is over-represented in the experiment. Conversely, the category is under-represented if <1. In the future, overexpressed genes are likely to serve as the marker selected in the development of CHD interventions. The P-value indicates the Fisher's exact test (37) or Binomial statistic in which the probability is the number of genes observed in this category occurred by chance (randomly), as determined by the reference list (Table II).
PPI analysis
To address the PPI of the CHD dataset in this study, STRING online suits was used to address the possible interaction of protein of CHD associated DEGs. A total of 112 nodes, 257 edges, 4.59 average node edge, 0.387 average clustering coefficient, 77 expected edge number, and <1.0e-16 PPI enrichment value were observed, and shown the network was significantly interacted than expected. Previous studies investigated the rare variants through targeted expression profiling across CHD relevant tissues from appropriate cases and controls (38,39). The PPI indicates the interaction of genes associated with multiple genes for outcome. In the present study, we identified 422 GO for biological process, 31 GO for molecular function, 12 GO for cellular component, 33 pathways, 30 reactome pathways, 13 UniProt keywords, 11 PFAM protein domains, 29 INTERO protein domains, and 3 SMART protein domains in the analysis of CHD microarray data set. In those findings, associated edges shows physically binding protein and some of them were associated with but did not have physical binding. Of these, only the top-ranking ones have been presented (Fig. 6 and Table III).
Understanding and ruling the mechanism
There are several challenges to identifying the genetic basis of CHD that are also the determinants of this complex disease, including phenotypic and genetic heterogeneity, gene-environment, and etiological spectrum range and their effect. Considering research efforts involved in determining the genetic basis of this CHD, there is a need to understand the fine complexity of genetic association leading to mortality in developing countries. There is a need to focus on clinical manifestation rather than factors which influence or are heritable by genetic factors. There are many challenges in determining the genetic association of CHDs, such as phenotypic heterogeneity, genetic heterogeneity, small gene effects, gene-gene and gene-environment interactions and rare variants causing complex diseases. Some of the key points to be undertaken such as mortality, challenge in identifying the genetic determinants, studying linkage mapping through conventional approaches, and cataloguing of human diseases variation at single-nucleotide polymorphism (SNP), as well as genotyping will increase the likelihood of success.
In conclusion, we studied a comprehensive gene expression profile of microarray data of CHD. During the progression of CHD, there was a significant change in the expression of genes involved in the immune system, inflammation, and cell signaling through protein binding. This analysis provides valuable information for future research and in understanding the mechanism of CHD as well as identification of novel interventions for therapeutic application.
Acknowledgements
Not applicable
Funding
This research received no specific grants from any funding agencies.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Authors' contributions
ZY conceived and designed the study. WL provided study materials. ZY, HM and WL were responsible for the collection and assembly of data, data analysis and interpretation. ZY was involved in writing the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not applicable
Patient consent for publication
Not applicable
Competing interests
The authors declare that they have no competing interests.
References
Underlying Cause of Death 1999-2017. Centers for Disease Control and Prevention 24/7. | |
Haukilahti MAE, Holmström L, Vähätalo J, Kenttä T, Tikkanen J, Pakanen L, Kortelainen ML, Perkiömäki J, Huikuri H, Myerburg RJ, et al: Sudden cardiac death in women. Circulation. 139:1012–1021. 2019.PubMed/NCBI View Article : Google Scholar | |
Ding D, Wang M, Su D, Hong C, Li X, Yang Y, Zhang Y, Hu G and Ling W: Body mass index, high-sensitivity C-reactive protein and mortality in Chinese with coronary artery disease. PLoS One. 10(e0135713)2015.PubMed/NCBI View Article : Google Scholar | |
Yusuf S, Rangarajan S, Teo K, Islam S, Li W, Liu L, Bo J, Lou Q, Lu F, Liu T, et al: Cardiovascular risk and events in 17 low-, middle-, and high-income countries. N Engl J Med. 371:818–827. 2014.PubMed/NCBI View Article : Google Scholar | |
Benjamin EJ, Blaha MJ, Chiuve SE, Cushman M, Das SR, Deo R, de Ferranti SD, Floyd J, Fornage M, Gillespie C, et al: American Heart Association Statistics Committee and Stroke Statistics Subcommittee: Heart disease and stroke statistics-2017 Update: A report from the American Heart Association. Circulation. 135:e146–e603. 2017.PubMed/NCBI View Article : Google Scholar | |
Mehta LS, Beckie TM, DeVon HA, Grines CL, Krumholz HM, Johnson MN, Lindley KJ, Vaccarino V, Wang TY, Watson KE, et al: American Heart Association Cardiovascular Disease in Women and Special Populations Committee of the Council on Clinical Cardiology, Council on Epidemiology and Prevention, Council on Cardiovascular and Stroke Nursing, and Council on Quality of Care and Outcomes Research: Acute myocardial infarction in women: A scientific statement from the American Heart Association. Circulation. 133:916–947. 2016.PubMed/NCBI View Article : Google Scholar | |
Fyyaz S, Rauf A, Hudson J and Olabintan O: 107 Nice 2016 stable chest pain guidelines: Improved yield of severe coronary artery disease at invasive coronary angiography. Heart. 105(88)2019. | |
Timmis A, Townsend N, Gale C, Grobbee R, Maniadakis N, Flather M, Wilkins E, Wright L, Vos R, Bax J, et al: ESC Scientific Document Group: European Society of Cardiology: Cardiovascular Disease Statistics 2017. Eur Heart J. 39:508–579. 2018.PubMed/NCBI View Article : Google Scholar | |
Erbel R, Aboyans V, Boileau C, Bossone E, Bartolomeo RD, Eggebrecht H, Evangelista A, Falk V, Frank H, Gaemperli O, et al: ESC Committee for Practice Guidelines: 2014 ESC Guidelines on the diagnosis and treatment of aortic diseases: Document covering acute and chronic aortic diseases of the thoracic and abdominal aorta of the adult. Eur Heart J. 35:2873–2926. 2014.PubMed/NCBI View Article : Google Scholar | |
Piepoli MF, Hoes AW, Agewall S, Albus C, Brotons C, Catapano AL, Cooney MT, Corrà U, Cosyns B, Deaton C, et al: ESC Scientific Document Group: 2016 European guidelines on cardiovascular disease prevention in clinical practice: The Sixth Joint Task Force of the European Society of Cardiology and Other Societies on Cardiovascular Disease Prevention in Clinical Practice (constituted by representatives of 10 societies and by invited experts) Developed with the special contribution of the European Association for Cardiovascular Prevention & Rehabilitation (EACPR). Eur Heart J. 37:2315–2381. 2016.PubMed/NCBI View Article : Google Scholar | |
Kwak BR, Bäck M, Bochaton-Piallat ML, Caligiuri G, Daemen MJAP, Davies PF, Hoefer IE, Holvoet P, Jo H and Krams R: Biomechanical factors in atherosclerosis: Mechanisms and clinical implications. Eur Heart J. 35:3013–3020. 2014.PubMed/NCBI View Article : Google Scholar | |
CARDIoGRAMplusC4D Consortium. Deloukas P, Kanoni S, Willenborg C, Farrall M, Assimes TL, Thompson JR, Ingelsson E, Saleheen D and Erdmann J: Large-scale association analysis identifies new risk loci for coronary artery disease. Nat Genet. 45:25–33. 2013.PubMed/NCBI View Article : Google Scholar | |
Nikpay M, Goel A, Won H-H, Hall LM, Willenborg C, Kanoni S, Saleheen D, Kyriakou T, Nelson CP, Hopewell JC, et al: A comprehensive 1,000 genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 47:1121–1130. 2015.PubMed/NCBI View Article : Google Scholar | |
Kessler T, Vilne B and Schunkert H: The impact of genome-wide association studies on the pathophysiology and therapy of cardiovascular disease. EMBO Mol Med. 8:688–701. 2016.PubMed/NCBI View Article : Google Scholar | |
Roberts R: Genetics of coronary artery disease. Circ Res. 114:1890–1903. 2014.PubMed/NCBI View Article : Google Scholar | |
Zhao Y, Chen J, Freudenberg JM, Meng Q, Rajpal DK and Yang X: Network-based identification and prioritization of key regulators of coronary artery disease loci. Arterioscler Thromb Vasc Biol. 36:928–941. 2016.PubMed/NCBI View Article : Google Scholar | |
Miller CL, Pjanic M and Quertermous T: From locus association to mechanism of gene causality: The devil is in the details. Arterioscler Thromb Vasc Biol. 35:2079–2080. 2015.PubMed/NCBI View Article : Google Scholar | |
Cheng Y and Zhang C: MicroRNA-21 in cardiovascular disease. J Cardiovasc Transl Res. 3:251–255. 2010.PubMed/NCBI View Article : Google Scholar | |
Chen CZ, Li L, Lodish HF and Bartel DP: MicroRNAs modulate hematopoietic lineage differentiation. Science. 303:83–86. 2003.PubMed/NCBI View Article : Google Scholar | |
Johnson CD, Esquela-Kerscher A, Stefani G, Byrom M, Kelnar K, Ovcharenko D, Wilson M, Wang X, Shelton J, Shingara J, et al: The let-7 microRNA represses cell proliferation pathways in human cells. Cancer Res. 67:7713–7722. 2007.PubMed/NCBI View Article : Google Scholar | |
Cheng AM, Byrom MW, Shelton J and Ford LP: Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis. Nucleic Acids Res. 33:1290–1297. 2005.PubMed/NCBI View Article : Google Scholar | |
Smyth GK, Michaud J and Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics. 21:2067–2075. 2005.PubMed/NCBI View Article : Google Scholar | |
Mi H, Muruganujan A, Casagrande JT and Thomas PD: Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 8:1551–1566. 2013.PubMed/NCBI View Article : Google Scholar | |
Jupe S, Fabregat A and Hermjakob H: Expression data analysis with reactome. Curr Protoc Bioinformatics 49: 8.20.1-8.20.9, 2015. https://doi.org/10.1002/0471250953.bi0820s49. | |
Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P, et al: The STRING database in 2017: Quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 45:D362–D368. 2017.PubMed/NCBI View Article : Google Scholar | |
Rana JS, Nieuwdorp M, Jukema JW and Kastelein JJ: Cardiovascular metabolic syndrome - an interplay of, obesity, inflammation, diabetes and coronary heart disease. Diabetes Obes Metab. 9:218–232. 2007.PubMed/NCBI View Article : Google Scholar | |
Kessler T, Vilne B and Schunkert H: The impact of genome-wide association studies on the pathophysiology and therapy of cardiovascular disease. EMBO Mol Med. 8:688–701. 2016.PubMed/NCBI View Article : Google Scholar | |
Torre-Amione G: Immune activation in chronic heart failure. Am J Cardiol. 95 (11A):3C–8C; discussion 38C-40C. 2005.PubMed/NCBI View Article : Google Scholar | |
Brenes-Castro D, Castillo EC, Vázquez-Garza E, Torre-Amione G and García-Rivas G: Temporal frame of immune cell infiltration during heart failure establishment: Lessons from animal models. Int J Mol Sci. 19(19)2018.PubMed/NCBI View Article : Google Scholar | |
Flores-Arredondo JH, García-Rivas G and Torre-Amione G: Immune modulation in heart failure: Past challenges and future hopes. Curr Heart Fail Rep. 8:28–37. 2011.PubMed/NCBI View Article : Google Scholar | |
Gu X, Li B, Jiang M, Fang M, Ji J, Wang A, Wang M, Jiang X and Gao C: RNA sequencing reveals differentially expressed genes as potential diagnostic and prognostic indicators of gallbladder carcinoma. Oncotarget. 6:20661–20671. 2015.PubMed/NCBI View Article : Google Scholar | |
Barth AS, Kuner R, Buness A, Ruschhaupt M, Merk S, Zwermann L, Kääb S, Kreuzer E, Steinbeck G, Mansmann U, et al: Identification of a common gene expression signature in dilated cardiomyopathy across independent microarray studies. J Am Coll Cardiol. 48:1610–1617. 2006.PubMed/NCBI View Article : Google Scholar | |
Satterthwaite G, Francis SE, Suvarna K, Blakemore S, Ward C, Wallace D, Braddock M and Crossman D: Differential gene expression in coronary arteries from patients presenting with ischemic heart disease: Further evidence for the inflammatory basis of atherosclerosis. Am Heart J. 150:488–499. 2005.PubMed/NCBI View Article : Google Scholar | |
Sinnaeve PR, Donahue MP, Grass P, Seo D, Vonderscher J, Chibout SD, Kraus WE, Sketch M Jr, Nelson C, Ginsburg GS, et al: Gene expression patterns in peripheral blood correlate with the extent of coronary artery disease. PLoS One. 4(e7037)2009.PubMed/NCBI View Article : Google Scholar | |
Gotoh T, Endo M and Oike Y: Endoplasmic reticulum stress-related inflammation and cardiovascular diseases. Int J Inflam. 2011:1–9. 2011.PubMed/NCBI View Article : Google Scholar | |
Haas MJ and Mooradian AD: Regulation of high-density lipoprotein by inflammatory cytokines: Establishing links between immune dysfunction and cardiovascular disease. Diabetes Metab Res Rev. 26:90–99. 2010.PubMed/NCBI View Article : Google Scholar | |
Costa THR, de Figueiredo Neto JA, de Oliveira AEF, Lopes e Maia MF and de Almeida AL: Association between chronic apical periodontitis and coronary artery disease. J Endod. 40:164–167. 2014.PubMed/NCBI View Article : Google Scholar | |
Jensen MK, Pers TH, Dworzynski P, Girman CJ, Brunak S and Rimm EB: Protein interaction-based genome-wide analysis of incident coronary heart disease. Circ Cardiovasc Genet. 4:549–556. 2011.PubMed/NCBI View Article : Google Scholar | |
Duan S, Luo X and Dong C: Identification of susceptibility modules for coronary artery disease using a genome wide integraded network analysis. Gene:. 531:347–357. 2013.PubMed/NCBI View Article : Google Scholar |