Open Access

Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods

  • Authors:
    • Youlin Tuo
    • Ning An
    • Ming Zhang
  • View Affiliations

  • Published online on: January 9, 2018     https://doi.org/10.3892/mmr.2018.8398
  • Pages: 4281-4290
  • Copyright: © Tuo et al. This is an open access article distributed under the terms of Creative Commons Attribution License.

Metrics: Total Views: 0 (Spandidos Publications: | PMC Statistics: )
Total PDF Downloads: 0 (Spandidos Publications: | PMC Statistics: )


Abstract

The aim of the present study was to investigate the feature genes in metastatic breast cancer samples. A total of 5 expression profiles of metastatic breast cancer samples were downloaded from the Gene Expression Omnibus database, which were then analyzed using the MetaQC and MetaDE packages in R language. The feature genes between metastasis and non‑metastasis samples were screened under the threshold of P<0.05. Based on the protein‑protein interactions (PPIs) in the Biological General Repository for Interaction Datasets, Human Protein Reference Database and Biomolecular Interaction Network Database, the PPI network of the feature genes was constructed. The feature genes identified by topological characteristics were then used for support vector machine (SVM) classifier training and verification. The accuracy of the SVM classifier was then evaluated using another independent dataset from The Cancer Genome Atlas database. Finally, function and pathway enrichment analyses for genes in the SVM classifier were performed. A total of 541 feature genes were identified between metastatic and non‑metastatic samples. The top 10 genes with the highest betweenness centrality values in the PPI network of feature genes were Nuclear RNA Export Factor 1, cyclin‑dependent kinase 2 (CDK2), myelocytomatosis proto‑oncogene protein (MYC), Cullin 5, SHC Adaptor Protein 1, Clathrin heavy chain, Nucleolin, WD repeat domain 1, proteasome 26S subunit non‑ATPase 2 and telomeric repeat binding factor 2. The cyclin‑dependent kinase inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1), and MYC interacted with CDK2. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from non‑metastatic samples [correct rate, specificity, positive predictive value and negative predictive value >0.89; sensitivity >0.84; area under the receiver operating characteristic curve (AUROC) >0.96]. The verification of the SVM classifier in an independent dataset (35 metastatic samples and 143 non‑metastatic samples) revealed an accuracy of 94.38% and AUROC of 0.958. Cell cycle associated functions and pathways were the most significant terms of the 30 feature genes. A SVM classifier was constructed to assess the possibility of breast cancer metastasis, which presented high accuracy in several independent datasets. CDK2, CDKN1A, E2F1 and MYC were indicated as the potential feature genes in metastatic breast cancer.
View Figures
View References

Related Articles

Journal Cover

March-2018
Volume 17 Issue 3

Print ISSN: 1791-2997
Online ISSN:1791-3004

Sign up for eToc alerts

Recommend to Library

Copy and paste a formatted citation
x
Spandidos Publications style
Tuo Y, An N and Zhang M: Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods. Mol Med Rep 17: 4281-4290, 2018
APA
Tuo, Y., An, N., & Zhang, M. (2018). Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods. Molecular Medicine Reports, 17, 4281-4290. https://doi.org/10.3892/mmr.2018.8398
MLA
Tuo, Y., An, N., Zhang, M."Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods". Molecular Medicine Reports 17.3 (2018): 4281-4290.
Chicago
Tuo, Y., An, N., Zhang, M."Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods". Molecular Medicine Reports 17, no. 3 (2018): 4281-4290. https://doi.org/10.3892/mmr.2018.8398