Long non‑coding RNA‑based risk scoring system predicts prognosis of alcohol‑related hepatocellular carcinoma
- Authors:
- Published online on: May 22, 2020 https://doi.org/10.3892/mmr.2020.11179
- Pages: 997-1007
-
Copyright: © Luo et al. This is an open access article distributed under the terms of Creative Commons Attribution License.
Abstract
Introduction
Hepatocellular carcinoma (HCC) is the sixth most common malignancy and the fourth leading cause of cancer-related death globally, and worldwide incidence increases by 3–4% per year (1,2). Hepatitis B virus (HBV) or hepatitis C virus (HCV) infection, alcohol consumption, diabetes, nonalcoholic fatty liver disease and smoking are known to be major risk factors for HCC (3,4). HCC is highly heterogeneous, and the pathogenesis is extremely complex. The progression of HCC involves multiple processes such as mutation and signaling pathway maladjustment, reflecting the interaction among multiple genes (5,6). Despite the development of various drugs and breakthroughs in diagnosis, the prognosis of HCC remains poor, with a 5-year survival rate of only 5% for patients with advanced HCC (7). Timely and effective assessment of prognosis is of great significance to guide the treatment. At present, there are no biomarkers that effectively predict the survival of patients with HCC, and thus, finding effective prognostic biomarkers for patients with HCC is crucial.
Long non-coding RNAs (lncRNAs), non-coding transcripts >200 nucleotides, serve important cellular functions such as in chromatin modification as well as transcriptional and post-transcriptional regulation (8,9). Increasing evidence demonstrates that aberrant expression of lncRNAs is associated with the occurrence and development of various human diseases, especially cancer (10–12). For example, the overexpression of lncRNA HOX Transcript Antisense Intergenic RNA (HOTAIR) was demonstrated to predict tumor recurrence after liver transplantation (13). There was also a significant association between HOTAIR expression and tumor progression in patients with HCC (14–16). Increased biallelic expression of H19 and IGF2 may participate in an epigenetic mechanism of HCC development and progression (17). The lncRNA GPC3-AS1 promotes HCC progression by epigenetic GPC3 activation (18). However, the role of lncRNAs in alcohol-related HCC remains unclear.
Alcohol is a dose-related risk factor known to be associated with more than 200 diseases, including HCC (19,20). Heavy drinkers are at 3- to 10-fold higher risk of hepatocellular carcinoma than non-drinkers (2). In addition, the overall survival rate is lower for patients with alcohol-related HCC than for those with non-alcohol-related HCC, suggesting that there may be a link between alcohol and prognosis (21). Thus, the present study aimed to examine whether lncRNAs are differentially expressed in the presence of alcohol consumption that may be used as prognostic markers in HCC, and whether these differences might influence the risk of HCC recurrence or death. Using data from The Cancer Genome Atlas (TCGA), the present study developed a risk-scoring system based on lncRNA levels that may be valuable for predicting the prognosis of patients with alcohol-related HCC.
Materials and methods
Patient selection and data collection
Profiles of lncRNA and mRNA expression in HCC patients were downloaded from the University of California Santa Cruz (UCSC) Xena server (https://xenabrowser.net/datapages/). Corresponding clinical information was obtained from TCGA (version 09–14–2017 for HCC). The patients in this dataset had histologically confirmed HCC. Patient data included a complete lncRNA expression profile, alcohol consumption status and survival data for determining OS and RFS. A total of 113 patients with HCC and alcohol consumption, and 224 without alcohol consumption were selected (Table I). This study complied with TCGA publication guidelines and policies (http://cancergenome.nih.gov/publications/publicationguidelines). No ethics approval was required for this study since data were obtained from TCGA.
Table I.Clinicopathological characteristics of 337 patients with alcohol- or non-alcohol-related hepatocellular carcinoma. |
Identification of lncRNAs differentially expressed between alcohol-related or non-alcohol-related HCC
After eliminating lncRNAs showing zero expression in >50% of all patients, the edgeR package in R (https://www.r-project.org/) was used to identify lncRNAs differentially expressed between patients with or without alcohol consumption (22). Differential expression was defined as log2fold change (log2FC) >1 and false discovery rate (FDR) <0.05. Differentially expressed lncRNAs were then presented in cluster heat maps and volcano maps generated using the packages gplots and heatmap in R.
Construction of lncRNA-based risk scoring systems
Standardized expression of lncRNAs in multiple tissues of the same patient were averaged. Univariate Cox analysis was then performed to screen differentially expressed lncRNAs to determine their significant relationship with OS or RFS, with the threshold set at P=0.05. Selected lncRNAs were included in subsequent multivariate Cox regression by the backward stepwise method in order to identify the best model. The expression level of each lncRNA was multiplied by the corresponding regression coefficient β and linearly combined to generate a risk scoring system:
Risk score=(β1 × expression level of lncRNA1) + (β2 × expression level of lncRNA2) + (β3 × expression level of lncRNA3) + (βn × expression level of lncRNAn).
This formula was used to calculate a risk score for each patient. The prognosis prediction performance of this risk score was assessed using time-dependent receiver operating characteristic (ROC) curves within three years (23). Patients with HCC were divided into a high- or a low-risk group according to the cut-off value of the median risk score, as demonstrated in non-cluster heat maps. Kaplan-Meier survival curves were generated and compared between high- and low-risk groups. All these analyses were conducted using R/Bioconductor (version 3.4.4, http://www.r-project.org/).
Prognostic performance of the risk scoring systems
To validate the prognostic performance of the risk scoring systems, univariate and multivariate Cox regression analyses were performed to determine whether the risk score was an independent factor for survival. This regression was performed in SPSS 16.0 (SPSS, Inc.), and a significance threshold of P=0.05 (two-sided).
Co-expression and functional enrichment analysis of related mRNAs
Pearson correlation was performed to screen for relationships between lncRNAs in the risk scoring systems and mRNAs based on data of 337 patients with HCC. Relationships were considered significant if the mRNA expression co-varied with that of lncRNAs with a two-sided absolute value of the Pearson correlation coefficient >0.30 and a z-test P<0.01. To obtain a deeper understanding of these mRNAs, enrichment analyses were performed using the Genomes pathway in the Kyoto Encyclopedia of Genes and Genomes by the package clusterProfiler in R (24). P<0.05 was considered to indicate a statistically significant difference.
Results
lncRNAs are differentially expressed in alcohol-related HCC or non-alcohol-related HCC
A total of 102 differentially expressed lncRNAs were identified, 47 (46.08%) of which were upregulated and 55 (53.92%) downregulated (Figs. 1 and 2); the first 20 up- and downregulated lncRNAs, together with the corresponding values for log2FC, P and FDR are demonstrated in Table II.
Table II.Differentially expressed lncRNAs in patients with alcohol-related or non-alcohol-related hepatocellular carcinoma. |
Risk-scoring system based on lncRNA expression and OS
Univariate Cox analysis identified six lncRNAs that were significantly associated with OS: AC012640.1, AC013451.2, AC062004.1, LINC02334, AC090921.1 and LINC01605. The first four were independent prognostic indicators of OS based on multivariate Cox regression (Table III). The resulting risk scoring system was: Risk score=(0.186 × AC012640.1) + (0.363 × AC013451.2) + (−0.243 × AC062004.1) + (−0.275 × LINC02334).
In this scoring system, increased expression of AC012640.1 and AC013451.2 predicted worse OS (β>0), whereas increased expression of AC062004.1 and LINC02334 predicted better OS (β<0).
Based on their risk scores, patients were classified as at low or high risk of poor OS using the median risk score as cutoff (Fig. 3A). Kaplan-Meier curves demonstrated that patients with high risk had significantly lower OS at 3 and 5 years compared with that of patients with low risk (Fig. 4A). The area under the ROC curve (AUC) for the risk scoring system was 0.721 (Fig. 5A).
The risk scoring system was used to predict OS of patients with different clinicodemographic characteristics. This is an important test of the scoring system because of the heterogeneity of HCC and the large number of factors that influence prognosis.
Univariate analysis identified risk score, family cancer history and vascular invasion as significantly associated with OS, but not age, body mass index (BMI), ethnicity, sex, hepatitis, cirrhosis, histological grade of cancer, new tumor event, pathology stage, cancer status or residual tumor. Multivariate Cox regression identified the following as independent predictors of poor OS: Risk score [hazard ratio (HR) 3.393, 95% confidence interval (CI) 1.597–7.210] and vascular invasion (HR 2.146, 95% CI 0.903–5.104, Table V).
Risk-scoring system based on lncRNA expression and RFS
Univariate analysis identified 11 lncRNAs that were significantly correlated with RFS: ERVH48-1, LINC02043, LINC01605, AC062004.1, AL139385, AC007938.3, AC090921.1, AC025580.1, AC012640.1, C10orf91, and LINC01589. Multivariate analysis demonstrated the first five to be independent prognostic indicators of RFS (Table IV). The resulting risk scoring system was: Risk score=(0.3529 × ERVH48-1) + (0.3499 × LINC02043) + (0.1701 × LINC01605) + (−0.3531 × AC062004.1) + (−0.1924 × AL139385).
In this scoring system, increased expression of ERVH48-1, LINC02043, and LINC01605 predicted worse RFS (β>0), whereas increased expression of AC062004.1 and AL139385 predicted better RFS (β<0).
Patients were classified as at low or high risk of poor RFS (Fig. 3B). Kaplan-Meier curves demonstrated that patients with high risk had significantly lower RFS at 3 and 5 years compared with that of patients with low risk (Fig. 4B). AUC for the risk scoring system was 0.777 (Fig. 5B).
Univariate analysis identified that risk score and vascular invasion were significantly correlated with RFS, but not age, BMI, ethnicity, sex, hepatitis, cirrhosis, histology grade, new tumor event, pathology stage, cancer status, family cancer history or residual tumor. Multivariate analysis identified the independent predictors to be risk score (HR 2.895, 95% CI 1.491–5.621) and vascular invasion (HR 2.398, 95% CI 1.104–5.210, Table VI).
Functional analysis of co-expressed lncRNA and mRNAs
KEGG pathway analysis revealed that co-expressed lncRNA and mRNAs that correlated with OS were involved mainly in chemical carcinogenesis, cytochrome P450-mediated drug metabolism and retinol metabolism (Fig. 6A). Co-expressed lncRNAs and mRNAs that correlated with RFS were involved mainly in cell cycle and carbon metabolism (Fig. 6B).
Discussion
HCC is a major health problem worldwide with poor overall prognosis (25,26). Most patients with HCC are diagnosed at advanced stages (III–IV) (27). Earlier diagnosis and more reliable prognosis, based on suitable biomarkers, are crucial for improving the management and therefore outcomes of patients with HCC. Accumulating evidence has suggested that the abnormal expression of lncRNAs is associated with the recurrence, metastasis and prognosis of HCC (28–30). Since the prognosis in HCC may differ depending on whether it is alcohol-related or not, the present study developed a risk scoring system based on lncRNA expression to evaluate the risk of poor OS or RFS in alcohol-related patients with HCC. The results of the present study may suggest good potential for lncRNAs to be prognostic biomarkers in alcohol-related HCC.
The results of the present study demonstrated that the risk-scoring system and vascular invasion were important independent predictors of prognosis in the sample of patients with HCC. The AUCs for OS and RFS risk scoring systems were high, suggesting good predictive power. Thus, an lncRNA-based risk scoring system may be used to estimate the risk scores of different alcohol-related patients with HCC, predict survival and determine treatment.
Previous studies have identified lncRNAs as prognostic biomarkers for HCC using the TCGA database (31,32). To the best of our knowledge, the present study is the first to analyze alcohol-related HCC. The present study identified eight lncRNAs as potential prognostic biomarkers for alcohol-related HCC. Among them, LINC01605 has been demonstrated to be upregulated in bladder cancer tissues and may be associated with poor prognosis (33), whereas ERVH48-1 has been identified as a prognostic biomarker for tongue squamous cell carcinoma (34). The remaining potential biomarkers from the present study (AC012640.1, AC013451.2, AC062004.1, LINC02334, LINC02043, and AL139385) do not appear to have been analyzed in detail. The eight lncRNAs in this model appear to be involved in chemical carcinogenesis, metabolism and the cell cycle. Investigating these lncRNA-mediated pathways may provide new insights into the development of alcohol-related HCC.
There are some limitations in this study. First, HCC treatment types were not included in the multivariate Cox regression due to lack of data. Second, Cox analyses may be less accurate because some clinical data were missing for some patients. Third, the sample was relatively small, and as a result the present study could not divide the samples into training and test dataset for determining and validating the model. Thus the findings of the present study should be verified and extended in larger studies.
Despite these limitations, the results of the present study suggested that an lncRNA-based risk scoring system may predict the risk of poor prognosis in patients with alcohol-related HCC. Eight lncRNAs are independent clinicopathological variables for alcohol-related HCC.
Acknowledgements
Not applicable.
Funding
This study was supported by the Guangxi Key Research and Development Program (grant. no. GuiKeAB16380215), Guangxi First-class Discipline Project for Basic Medicine Sciences (grant. no. GXFCDP-BMS-2018) and Guangxi Medical University Training Program for Distinguished Young Scholars.
Availability of data and material
The datasets used during the present study are included in this published article.
Authors' contributions
YUL. and JY designed and performed the research, analyzed and interpreted the data, and drafted the manuscript. JW collected and analyzed the data. YOL and JZ conceived the study, designed the methodology and reviewed the manuscript. All authors read and approved the final version of the manuscript.
Ethics approval and consent to participate
Not applicable.
Patient consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
References
Villanueva A: Hepatocellular carcinoma. N Engl J Med. 380:1450–1462. 2019. View Article : Google Scholar : PubMed/NCBI | |
Matsushita H and Takaki A: Alcohol and hepatocellular carcinoma. BMJ Open Gastroenterol. 6:e0002602019. View Article : Google Scholar : PubMed/NCBI | |
Petrick JL, Kelly SP, Altekruse SF, McGlynn KA and Rosenberg PS: Future of hepatocellular carcinoma incidence in the United States forecast through 2030. J Clin Oncol. 34:1787–1794. 2016. View Article : Google Scholar : PubMed/NCBI | |
El-Serag HB and Rudolph KL: Hepatocellular carcinoma: Epidemiology and molecular carcinogenesis. Gastroenterology. 132:2557–2576. 2007. View Article : Google Scholar : PubMed/NCBI | |
Zucman-Rossi J, Villanueva A, Nault JC and Llovet JM: Genetic landscape and biomarkers of hepatocellular carcinoma. Gastroenterology. 149:1226–1239 e1224. 2015. View Article : Google Scholar : PubMed/NCBI | |
Levrero M and Zucman-Rossi J: Mechanisms of HBV-induced hepatocellular carcinoma. J Hepatol. 64 (1 Suppl):S84–S101. 2016. View Article : Google Scholar : PubMed/NCBI | |
Louafi S, Boige V, Ducreux M, Bonyhay L, Mansourbakht T, de Baere T, Asnacios A, Hannoun L, Poynard T and Taïeb J: Gemcitabine plus oxaliplatin (GEMOX) in patients with advanced hepatocellular carcinoma (HCC): Results of a phase II study. Cancer. 109:1384–1390. 2007. View Article : Google Scholar : PubMed/NCBI | |
Hung T and Chang HY: Long noncoding RNA in genome regulation: Prospects and mechanisms. RNA Biol. 7:582–585. 2010. View Article : Google Scholar : PubMed/NCBI | |
He Y, Meng XM, Huang C, Wu BM, Zhang L, Lv XW and Li J: Long noncoding RNAs: Novel insights into hepatocelluar carcinoma. Cancer Lett. 344:20–27. 2014. View Article : Google Scholar : PubMed/NCBI | |
Cheetham SW, Gruhl F, Mattick JS and Dinger ME: Long noncoding RNAs and the genetics of cancer. Br J Cancer. 108:2419–2425. 2013. View Article : Google Scholar : PubMed/NCBI | |
Gibb EA, Brown CJ and Lam WL: The functional role of long non-coding RNA in human carcinomas. Mol Cancer. 10:382011. View Article : Google Scholar : PubMed/NCBI | |
Hauptman N and Glavac D: Long non-coding RNA in cancer. Int J Mol Sci. 14:4655–4669. 2013. View Article : Google Scholar : PubMed/NCBI | |
Yang Z, Zhou L, Wu LM, Lai MC, Xie HY, Zhang F and Zheng SS: Overexpression of long non-coding RNA HOTAIR predicts tumor recurrence in hepatocellular carcinoma patients following liver transplantation. Ann Surg Oncol. 18:1243–1250. 2011. View Article : Google Scholar : PubMed/NCBI | |
Geng YJ, Xie SL, Li Q, Ma J and Wang GY: Large intervening non-coding RNA HOTAIR is associated with hepatocellular carcinoma progression. J Int Med Res. 39:2119–2128. 2011. View Article : Google Scholar : PubMed/NCBI | |
Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, Tsai MC, Hung T, Argani P, Rinn JL, et al: Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 464:1071–1076. 2010. View Article : Google Scholar : PubMed/NCBI | |
Yang L, Peng X, Li Y, Zhang X, Ma Y, Wu C, Fan Q, Wei S, Li H and Liu J: Long non-coding RNA HOTAIR promotes exosome secretion by regulating RAB35 and SNAP23 in hepatocellular carcinoma. Mol Cancer. 18:782019. View Article : Google Scholar : PubMed/NCBI | |
Kim KS and Lee YI: Biallelic expression of the HI9 and ZGF2 genes in hepatocellular carcinoma. Cancer Lett. 119:143–148. 1997. View Article : Google Scholar : PubMed/NCBI | |
Zhu XT, Yuan JH, Zhu TT, Li YY and Cheng XY: Long noncoding RNA glypican 3 (GPC3) antisense transcript 1 promotes hepatocellular carcinoma progression via epigenetically activating GPC3. FEBS J. 283:3739–3754. 2016. View Article : Google Scholar : PubMed/NCBI | |
Ganne-Carrie N and Nahon P: Hepatocellular carcinoma in the setting of alcohol-related liver disease. J Hepatol. 70:284–293. 2019. View Article : Google Scholar : PubMed/NCBI | |
Pimpin L, Cortez-Pinto H, Negro F, Corbould E, Lazarus JV, Webber L and Sheron N; EASL HEPAHEALTH Steering Committee, : Burden of liver disease in Europe: Epidemiology and analysis of risk factors to identify prevention policies. J Hepatol. 69:718–735. 2018. View Article : Google Scholar : PubMed/NCBI | |
Bucci L, Garuti F, Camelli V, Lenzi B, Farinati F, Giannini EG, Ciccarese F, Piscaglia F, Rapaccini GL, Di Marco M, et al: Comparison between alcohol- and hepatitis C virus-related hepatocellular carcinoma: Clinical presentation, treatment and outcome. Aliment Pharmacol Ther. 43:385–399. 2016. View Article : Google Scholar : PubMed/NCBI | |
Robinson MD, McCarthy DJ and Smyth GK: edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 26:139–140. 2009. View Article : Google Scholar : PubMed/NCBI | |
Heagerty PJ, Lumley T and Pepe MS: Time-dependent ROC curves for censored survival data and a diagnostic marker. Biometrics. 56:337–344. 2000. View Article : Google Scholar : PubMed/NCBI | |
Yu G, Wang LG, Han Y and He QY: clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS. 16:284–287. 2012. View Article : Google Scholar : PubMed/NCBI | |
Budhu A, Forgues M, Ye QH, Jia HL, He P, Zanetti KA, Kammula US, Chen Y, Qin LX, Tang ZY and Wang XW: Prediction of venous metastases, recurrence, and prognosis in hepatocellular carcinoma based on a unique immune response signature of the liver microenvironment. Cancer Cell. 10:99–111. 2006. View Article : Google Scholar : PubMed/NCBI | |
Lin Y, Liang R, Qiu Y, Lv Y, Zhang J, Qin G, Yuan C, Liu Z, Li Y, Zou D and Mao Y: Expression and gene regulation network of RBM8A in hepatocellular carcinoma based on data mining. Aging (Albany NY). 11:423–447. 2019. View Article : Google Scholar : PubMed/NCBI | |
Wu Y, Zheng S, Yao J, Li M, Yang G, Zhang N, Zhang S and Zhong B: Decreased expression of protocadherin 20 is associated with poor prognosis in hepatocellular carcinoma. Oncotarget. 8:3018–3028. 2017. View Article : Google Scholar : PubMed/NCBI | |
Yang X, Xie X, Xiao YF, Xie R, Hu CJ, Tang B, Li BS and Yang SM: The emergence of long non-coding RNAs in the tumorigenesis of hepatocellular carcinoma. Cancer Lett. 360:119–124. 2015. View Article : Google Scholar : PubMed/NCBI | |
Xiong H, Li B, He J, Zeng Y, Zhang Y and He F: lncRNA HULC promotes the growth of hepatocellular carcinoma cells via stabilizing COX-2 protein. Biochem Biophys Res Commun. 490:693–699. 2017. View Article : Google Scholar : PubMed/NCBI | |
Yuan JH, Yang F, Wang F, Ma JZ, Guo YJ, Tao QF, Liu F, Pan W, Wang TT, Zhou CC, et al: A long noncoding RNA activated by TGF-β promotes the invasion-metastasis cascade in hepatocellular carcinoma. Cancer Cell. 25:666–681. 2014. View Article : Google Scholar : PubMed/NCBI | |
Zhang Z, Li J, He T, Ouyang Y, Huang Y, Liu Q, Wang P and Ding J: The competitive endogenous RNA regulatory network reveals potential prognostic biomarkers for overall survival in hepatocellular carcinoma. Cancer Sci. 110:2905–2923. 2019. View Article : Google Scholar : PubMed/NCBI | |
Lin P, Wen DY, Li Q, He Y, Yang H and Chen G: Genome-wide analysis of prognostic lncRNAs, miRNAs, and mRNAs forming a competing endogenous RNA network in hepatocellular carcinoma. Cell Physiol Biochem. 48:1953–1967. 2018. View Article : Google Scholar : PubMed/NCBI | |
Qin Z, Wang Y, Tang J, Zhang L, Li R, Xue J, Han P, Wang W, Qin C, Xing Q, et al: High LINC01605 expression predicts poor prognosis and promotes tumor progression via up-regulation of MMP9 in bladder cancer. Biosci Rep. 38:BSR201805622018. View Article : Google Scholar : PubMed/NCBI | |
Zhang S, Cao R, Li Q, Yao M, Chen Y and Zhou H: Comprehensive analysis of lncRNA-associated competing endogenous RNA network in tongue squamous cell carcinoma. PeerJ. 7:e63972019. View Article : Google Scholar : PubMed/NCBI |