Development and validation of an ultra-high sensitive next-generation sequencing assay for molecular diagnosis of clinical oncology
- Authors:
- Published online on: September 26, 2016 https://doi.org/10.3892/ijo.2016.3707
- Pages: 2088-2104
Abstract
Introduction
Cancer is a genomic disease harboring a cocktail of mutated genes. Personalized medicine approaches based on molecular studies and cytogenetic analysis can treat with therapies directly on mutated cancer driving genes (1–4). For example, crizotinib (PF-02341066), a small-molecular inhibitor of the anaplastic lymphoma kinase (ALK), and kinase inhibitor vemurafenib (PLX4032) against BRAF (5–7), both have dramatic effects on most patients with corresponding driver mutations. In fact, hundreds of frequent somatic mutations, which involved in multiple cellular pathways, have been identified in different types of cancer during the past decades (8), and more comprehensive diagnostic approaches are needed to identify the individual driver mutations which have important impact on tumor progression in different cancer patients (9) and thus, could serve as therapeutic targets in clinical treatment. To assess the status of these biomarkers, several approaches have been implemented in clinical diagnosis, such as fluorescence in situ hybridization (FISH), immunohistochemistry (IHC) and Sanger methodology (10–13). However, due to the high cost and technical limitations, it is unaffordable to do the multiplexed assessment of driving somatic alterations.
NGS has already been used to identify hundreds of driving mutations and analyze tens of thousands of tumor samples in a high-throughput with increased performance and decreased costs (14–16), which makes it possible to serve as a clinical testing approach. In reality, commercial NGS-based assays have already been developed and validated to provide comprehensive genomic test in clinic (17–20). These assays usually have a good performance when detecting variants with high mutant allele frequencies (MAF >10%). However, variants with low MAF usually appear in tumor tissues for many reasons, including contaminating normal cells and intra-tumor heterogeneity (21,22). Therefore, it is critical to develop a robust clinical assay that can detect low allele frequency mutations. Here we developed an ultra-high sensitive NGS-based assay, which interrogates all 7011 exons of 483 cancer-related genes and 94 introns of 18 genes with re-arrangement. Using the Illumina HiSeq X platform, hybridization-based capture of target regions reached a high-coverage (>2000X) with acceptable cost. With in-house data analysis approaches, we could identify low MAF (0.5%) variants from sequencing error accurately. We used pools of mixed cell lines with known alterations to perform analytical validation, and 35 FFPE tissue samples to confirm the specificity of low MAF variants detection performance in clinic by dPCR (23). In addition, ARMS-PCR (24) was used to confirm the overall specificity of our assay.
Materials and methods
NCP NGS design
Novo assay was developed to characterize SNV/INDEL, CNV and gene fusion in 483 cancer-related genes. These genes were selected based on My Cancer Genome database (https://www.mycancergenome.org), Catalogue of Somatic Mutations in Cancer (COSMIC) and other sources (18,25). Briefly, genes containing clinically important variants and genes have been reported as cancer-related were included based on a record of reimbursement in sequencing. All exons of these genes were considered which underwent hybridization-based capture from 483 cancer-related genes (Table I). For structural rearrangements detection, introns spanning recurrent fusion breakpoints were also included. Agilent's proprietary algorithm and synthetic process was used to generate the baits. The hybrid selection was done using a pool of 120-mer RNA-based baits (Agilent SureSelect) with overlap excess 3-fold for target region. All 47660 hybrid baits for catching target region constitute 2.3 Mb genomic positions, including 7011 exons and 94 introns.
Clinical specimens
Tumor specimens were collected from non-small cell lung cancer (NSCLC) and breast cancer patients at Chinese PLA General Hospital with informed consent according to the internal Review and rules of Ethics. In the very beginning of this assay, clinical samples should match several standards as follows to ensure downstream analysis. At least 10 slices of 5 μm FFPE sections or tissues with a volume of >1 was required. For each sample, hematoxylin-eosin stained slides (Fig. 1) were prepared and reviewed by a pathologist to estimate tumor purity. All samples with <50% tumor purity were marked for tumor enrichment by microdis-section to minimize contamination from normal cells (Fig. 2).
Cell line sample collection
Normal cell lines harboring the population distribution of known germ line variants were mixed, and multiplexed pools with low MAF variants were used to assess and validate the limit of variant detection. First of all, to get the variants set for assessment, we sequenced 5 cell lines from the 1000 Genomes Project (26) individually and got the SNP and INDEL sites from dbSNP database (build 146) consistent with a homozygous (MAF >90%) or heterozygous (40%<MAF<60%). To estimate the INDEL detection performance, 3 additional cell lines from COSMIC database (http://cancer.sanger.ac.uk/cancergenome/projects/cosmic/) which also were sequenced individually to get the original MAF of cancer-related somatic variants in each sample. All 8 cell lines were mixed together in designed proportions, and the expected MAF of each variant was calculated on the mixed ratios (Table II). Eventually, we achieved the 2625 variants spanning a range of expected MAF (0.5–20%) and INDEL lengths (1–40 base pair, bp) as gold-standard (Table III). Cell lines obtained from Coriell Institute (http://ccr.coriell.org/) and ATCC (http://www.atcc.org/) were routinely cultured in Dulbecco's modified Eagle's media (DMEM) with 10% heat-inactivated fetal bovine serum (FBS; Invitrogen, Waltham, MA, USA) in a 75-cm2 cell culture flask. The cells were seeded into cell culture flasks at a concentration of 1×105 viable cells/ml and incubated at 37°C in a humidified atmosphere containing 5% CO2.
Library preparation and sequencing
Generally, genome DNA extracted was performed using DNeasy Blood & Tissue kit (Qiagen, Hilden, Germany). For FFPE sample special, DNA was isolated using the GeneRead DNA FFPE kit (Qiagen, Valencia, CA, USA) following the protocol. Besides the purification of high yields of DNA from FFPE tissue sections, this kit could remove deaminated cytosine to prevent false results in sequencing (27). The ratio of absorbance at 260 and 280 nm is used to assess the purity of extracted DNA, and we used the Qubit® Quantitation Platform to quantitated DNA. A Covaris S220 focused-ultrasonicator (Covaris, Woburn, MA, USA) was used to fragment genomic DNA (500 ng) and an Agilent Bioanalyzer 2100 (Agilent Technologies) to ensure an average fragment size of 200 to 400 base pair (bp). The library preparation after fragmentation were done using instruction manual of KAPA Hyper Prep kit. The protocol included: i) repairing the DNA ends; ii) adding ‘A’ base to the DNA fragments; iii) ligating the paired-end adaptor; iv) purifying the sample using AMPure XP beads; and v) amplifying the adaptor-ligated library and purifying the sample using AMPure XP beads. Prepared library was hybridized using NCP custom designed baits as described in SureSelectQXT (Agilent Technologies) and the product was then amplified for 14 PCR cycles. The size range of the prepared library was assessed using Agilent 2100 Bioanalyzer and qualified using ABI StepOnePlus. The concentration of each library was quantified using qPCR NGS Library Quantification kit and Protocol was used to calculate the final pooling volume to sequencing. The products were sequenced using the Illumina HiSeq X platform with paired-end sequencing runs (2×150) under Illumnina recommended protocols.
Data analysis
Clean data were generated by data processing steps including base calling, demultiplexing and adapter trimming. All these steps were performed using Illumina HiSeq X vendor software on default parameters. We further performed our in-house software for clean data quality control (QC) which included: i) removing read pairs if any one of the two reads containing base ‘N’ >10%; ii) removing read pairs if any one of the two reads containing base with quality below Q10 >50%; iii) trimming the 3′ end of the read from the first base below Q20; and iv) removing reads shorter than 100 bp. Clean data after QC were mapped to the human reference genome (GRCh37) using BWA aligner v0.7.8 (28) with the default parameters. PCR duplicate read removal was done using Picard 1.119 (http://picard.sourceforge.net/index.html). According to the result, a sequence metric collection was generated including the number of total reads, percentage of reads mapped, on target reads number, average target coverage and percentage of target region with >200X and 1000X coverage. Before SNV and INDEL calling, local realignment was performed using Genome Analysis Toolkit (GATK version 2.7–2-g6bda569) (29,30) with default parameters and recommended ‘known sites’ in GATK best practice (https://software.broadinstitute.org/gatk/best-practices/). For SNV detection, we denote the reference allele and the coverage of each site as r and d and denote the error rate corresponding to the base calling at read i (i = 1…d) as ei. We used a null model to explain the data in which there is no SNV at that site and all non-reference alleles to be sequencing error. The number of variant bases (k) with ei <1e−3 (associated Phred-like quality score qi>30) in each site was then given a binomial distribution. The probability under this null model was given by the following formula:
P(X≥k∣d)=1-∑i=0k-1P(X=i∣d)where P(X = i|d) was the probability of observing i variants in the d reads of the site. Assuming the sequencing errors were independent across reads and occurred with probability e0 (e0 = 1e−3/3) to each non-reference allele. We could obtain
P(X=i∣d)=(dk)e0k(1-e0)d-kThe P-value was then given by P(X≥k|d) and the cut-off (P-value <1e−6) was established to eliminate random sequencing error. For INDEL detection, we simply kept variants supporting reads >10. We also employed several filters to reduce systematic errors. Empirical filters including strand bias (Fisher's exact test, P<1e−6), site median base quality (MBQ >30), site median mapping quality (MMQ >30), variant MAF (MAF >0.5%). Variants pass filters were annotated by dbSNP b146, My Cancer Genome database (https://www.mycancergenome.org) and Oncomine database v1.4.1 to get the clinical relevant information. However, cross library contamination may occur and a report would not be generated once the sample contained >10 variants with low-MAF (MAF ≤10%) in dbSNP. In the report stage, all annotated variants with MAF ≥5% would be reported and other cancer-related variants would be validated by 3dPCR. The whole workflow for the data analysis is outlined in Fig. 3. The parameters and descriptions used are listed in Table IV.
Compared with other software
To measure the effect of our approach, we compared the pooled cell-line result with GATK, a widely used software. We followed the ‘GATK best practice’, the ‘IndelRealigner’ parameter ‘LOD_Threshold_For_Cleaning’ was 0.3, the ‘BaseRecalibrator’ was with default parameters, the SNV/INDEL calling type was ‘HaplotypeCaller’ with parameters ‘standard_min_confidence_threshold_for_emitting’ as 10 and ‘standard_min_confidence_threshold_for_calling’ as 30.
Performance statistics calculation
For sensitivity estimation, variants detected in pools would be assigned as true positive (TP), or false negative (FN) if not detected. Sensitivity was calculated as TP/(TP+FN). For specificity estimation, the pool variants also detected in the pure sample were assigned as true positive (TP), or false positive (FP) if none was detected. PPV was calculated as TP/(TP+FP).
Mutation detection by dPCR
dPCR is a method used in absolute quantification analysis of clonally amplified nucleic acids (including DNA, cDNA, methylated DNA or RNA). With dPCR, a sample is partitioned so that individual nucleic acid molecules within the sample are localized and concentrated within many separate regions. After PCR amplification, nucleic acids may be quantified by counting the regions that contain PCR end-product, positive reactions. Here, we used the QuantStudio™ 3D Digital PCR System platform (Life Technologies) regarding SNP mutation quantitation. For dPCR, the first step is preparing and loading samples onto QuantStudio™ 3D Digital PCR 20K chips. Mutations were analysed by TaqMan® SNP Genotyping Assays (Life Technologies), which containing TaqMan®-MGB probes and primers. We prepared 15 μl reaction mixes according to the manufacturer's instructions, and loaded 14.5 μl onto each chip. The Mix contains ROX® dye, which served as a passive reference. After chips were loaded, we run the Digital PCR 20K Chips with a ProFlex™ 2x Flat PCR System under the following conditions: 96°C for 10 min, 39 cycles at 56°C for 2 min and at 98°C for 30 sec, followed by a final extension step at 56°C for 2 min. After thermo-cycling, we analyzed the prepared chips using dPCR instrument.
Mutation detection by ARMS-PCR
ARMS-PCR is a real-time PCR-based test which covers the 29 EGFR hotspots from exon 18–21. The assay was performed according to the manufacturer's protocol for the ADx EGFR29 Mutation kit (Amoy Diagnostics, Co., Ltd., Xiamen, China) with the MX3000P (Stratagene, La Jolla, CA, USA) real-time PCR system. Template DNA (0.4 μl), 3.6 μl deionized water and 16 μl other reaction components was used in the RT-PCR reaction system. PCR was performed with initial denaturation at 95°C for 10 min, followed by 40 cycles of amplification (at 95°C for 30 sec and 61°C for 1 min). The results were analyzed according to the criteria defined by the manufacturer's instructions. Positive results were defined as [Ct(sample) − Ct(control)] < Ct(cut-off).
Results
Overview
NCP is a NGS-based clinical test for detection of somatic cancer related mutations. DNA was extracted from tumor tissues and FFPE samples, 500 ng of which was fragmented, captured using custom-designed hybridization-based biotinylated cRNA reagents and amplified via limited-cycle PCR to enrich 7,011 exons and 94 introns of 483 cancer related genes (totaling ~2.3 million sites). We used clinical samples to generate the bioinformatics pipeline for data analysis (Table IV) and cell lines to validate the whole work flow. For the 8 single cell lines, using the Illumina HiSeq X platform, achieving an average of 13,330 Mb (SD=3,995 Mb) total bases with 38.09% on-target (SD=4.78%), target regions were sequenced to 2148X (SD=537X) median coverage across targeted bases, with 99.05% (SD=0.28%) of targeted bases covered by at least 200 reads (Table V). The 2453 SNV and 172 INDEL detected in single cell line consistent with database would be used for assessment of SNV/INDEL detection. Pools of mixed cell lines were used to get the relationship between median coverage and performance, which achieved total bases of 4,762, 10,896 and 16,351 Mb, the median coverage of 1,029X, 2,237X and 3,194X (Table VI). Due to the high sensitivity NGS benefit from high coverage, the hotspot mutations with MAF <5% detected by this assay in 35 FFPE samples were confirmed by dPCR. All samples used in this test are summarized in Table VII. Finally, 33 hotspot mutations detected by NGS in FFPE samples with a MAF from 2 to 63% in NGS were tested by ARMS-PCR.
SNV detection performance
SNV detection was performed using a Binomial methodology allowing the detection of low MAF somatic mutations across the 2.3 Mb assayed with high sensitivity. For the mixed cell line pools, overall SNV detection performance was high, the results of different depth are shown in Table VIII, for an average depth of 2237, 100% (95% CI, 95.1–100%) of SNV at MAF >10% were successfully detected, as well as 99% (95% CI, 98.6–100%) of SNV at MAF 5–10%. The detection of SNV with MAF between 0.5–5% performance was 92.2% (95% CI, 90.7–93.5%) (Fig. 4A and C and Table VIIIA). In addition, high sensitivity was accompanied with good PPV (the fraction of SNV calls in the pools can also be detected in any of the individual cell lines; Table VIIIB) 99.2% (95% CI, 99–99.4%). The false positives may be due to variants with such a low MAF (<5%) no difference with sequencing noise could hardly be identified. A dPCR confirmation for cancer-related SNV with MAF <5% reported by NGS is necessary before reporting.
INDEL detection performance
For INDEL detection, we simply discarded the variants supporting less than 10 reads. The results of different depth are shown in Table IX, for an average depth of 2237, 100% (95% CI, 29.2–100%) of INDEL at MAF >10% were successfully detected, as well as 94.7% of INDEL (95% CI, 74–99.9%) with MAF between 5–10%. Low MAF sites detected performance was 91.5% (95% CI, 85–100%), the performance of variants with MAF <0.5% was also calculated (Fig. 4B and D and Table IXA). Few false-positive calls were observed, with a PPV of 98.2% (95% CI, 97.2–98.9%) (Table IXB). Like SNV detection, due to the false positive under 10%, a dPCR confirmation of these cancer-related INDEL with MAF <10% before reporting is needed.
Comparison with other bioinformatics approaches
We evaluated the performance of our bioinformatics pipeline with the cell line models above, focusing on two key steps of our approach. First, we applied statistical models that allow for the identification of a mutation at low MAF from random errors in Illumina sequencing. Second, we used priori knowledge to identify systematic errors always accompanied with specific characteristics, such as strand bias and low base/mapping quality. To measure the effect of our approach, we compared the pooled cell-line result with GATK - widely used software. The GATK detection sensitivity of SNV with MAF >10% was 64.38% (95% CI, 52.3–75.3%), and SNV with 5%<MAF<10% was under 10% but the PPV was 100% (95% CI, 99.7–100%). The sensitivity of INDEL with MAF >10% was 67% (95% CI, 9.4–99.2%), and a high PPV 100% (95% CI, 99–100%) (Tables X and XI), possibly because this widely used tool is designed for whole-genome or whole-exon sequencing data with relatively low depth and variants with high allele frequency, which underline that appropriate filters for ultra-deep sequencing data analysis were critical. Actually, compared with slight performance upgrades under increased coverage depth, the effect of appropriate filters was remarkable in this test.
Concordance between NGS and other approaches
The above studies demonstrate that the NGS-based test has the performance characteristics necessary to accurately detect SNV and INDEL. We further validated test accuracy by comparisons to dPCR for 35 FFPE cancer specimens. To assess the accuracy of low MAF SNV and INDEL detection in routine clinical cancer samples, we selected 35 FFPE resection specimens (31 non-small cell lung cancer, 1 parathyroid carcinoma, 3 breast cancers) previously tested for hotspot mutations in PIK3Ca, EGFR, KRAS and BRAF by NGS, every hotspot mutations detected by NGS, but with MAF <5% would be tested by dPCR. In addition, 32 of 35 (PPV=91.43%, 95% CI, 76.94–98.20%) variants have been supported to be true-positive by dPCR (Tables XII and XIII). Three variants were present at <3% MAF in NGS that were not detected by dPCR. The detected MAF of the two technologies is shown in Fig. 5. Finally, we random selected 33 FFPE samples (NSCLC) with hotspot mutations and performed the ARMS-PCR to verify the overall PPV of our assay. As a result, all 33 mutations could be detected by ARMS-PCR and the PPV was 100% (95% CI, 89.42–100%; Table XIV).
Discussion
Cancer diagnostic is undergoing a rapid development (31), routine tests like FISH and IHC can only detect limited known variants, besides it fully relies on the doctor's experience. PCR-based approach, like Sanger sequencing or dPCR used by us in this study, still cannot test multiple sites in one run. Furthermore, Sanger sequencing cannot detect variants with MAF under 10% (32) and dPCR waste too many samples, which remain problems for clinical application. The NGS-based test with increased access and decreased cost has more advantages in comprehensive detection of the cancer-related mutations (33–35). For detecting mutations with low frequency, NGS-based test with high sensitivity is needed. However, high sensitivity always comes with false-positives, which may lead to suboptimal treatment. Finally, some other factors, like DNA damage and contamination in clinical samples (36,37), make it critical to generate a complex validation of NGS assay.
In the present study, we developed and validated the NGS-based assay, using germ line mutations in 1000 genome cell lines and certain somatic INDEL in cosmic database to simulate the tumor heterogeneity or impurity in clinical samples. We mixed these samples to measure the analytic sensitivity and PPV of NCP assay at low MAF and used 3 pools to obtain the correlation between median coverage and variants detection performance. The performance of our test was high for variants with MAF >5%. In cell line model with 2236X median coverage, sensitivity was 99.8% for SNP, 94.7% for INDEL with a PPV of 99 and 98%. The 0.5%<MAF<5% variant sensitivity was 92.2% for SNV and 91.5% for INDEL which was not desirable. Because of the complexity of 483 genes, it was difficult to ensure such low MAF variant detection sensitivity. On the other hand, we confirmed the low MAF detection by dPCR which could identify rare mutations specifically. We also compared our bioinformatics pipeline with common pipeline GATK (29,30), which is widely used in genotype analysis. The overall PPV was high at the expense of sensitivity, which may be due to these approaches being developed to call germ line variants. The results highlighted that appropriate filtering approach is critical for low MAF variant detection. Actually, the filters were more important than the increase of coverage depth as showed in the different coverage tests. For specificity analysis, each called variant was classified as a false positive if a matching alteration was not detected in the pure sample. However, this approach could not recognize the false positive generated by systematic errors. Given the high sensitivity of this technology, high-throughput clinical trials are required to confirm its reliability for the molecular diagnosis of cancer (38). Therefore, 35 patient specimens previously tested by NCP assay and having low MAF <5% variants were used to test in parallel by dPCR. The correlation coefficient of NGS and dPCR was low (0.78) and 32 of 35 (91.43%) NGS detected variants could be confirmed by dPCR. The discordance was possibly due to the heterogeneity in tumor specimens or false positive in NGS, the dPCR verification is needed for such low MAF variants before reporting. Like low MAF variants, we used ARMS-PCR to test the 33 random selected FFPE samples with hotspot mutations detected by NGS and obtained a high concordance (PPV=100%).
Taken together, we used high sequencing coverage and a statistical test with several hard filters generated from clinical samples to separate low MAF SNV/INDEL from false positives. To balance the cost of NGS and accuracy of variant calls for low MAF variants, we used pooled cell line models with certain germ line SNP in different data size to get the relationship accuracy between data size and variants. From this test, we validated the best target median coverage (2000X) that can meet the analysis requirement, whereas the low MAF variants detection needed to be corrected by dPCR. On the other hand, the overall performance of this assay was good in the ARMS-PCR test. However, our results cannot meet the requirement of different variant types in clinical use like other NGS-based approaches (17–20,39), which is one of the most important aspects for NGS compared to other traditional approaches. Furthermore, due to the DNA requirement of dPCR verification and quantity of extraction in plasma (40,41), this NGS-dPCR combined approach could only be used in FFPE sample but not plasma. With the advantages of non-invasive and overcome tumor-heterogeneity (42–44), the sequencing of plasma sample still needed more study. To reduce the sequencing errors confound with rare mutations, a NGS method termed Duplex sequencing was developed these years and may be useful in future plasma sequencing (45–47). In addition, given the capability of NGS test to detect variants with low MAF, the correlation between the NGS clinical report and the effect of targeted therapy still need further assessment (48). Finally, our NCP assay can give more mutation information and thus expand the treatment choices for patients, but more efforts still need to be done for future cancer diagnostics.
References
Renfro LA, An MW and Mandrekar SJ: Precision oncology: A new era of cancer clinical trials. Cancer Lett. S0304-3835(16)30163-X. 2016. View Article : Google Scholar : PubMed/NCBI | |
Arteaga CL and Baselga J: Impact of genomics on personalized cancer medicine. Clin Cancer Res. 18:612–618. 2012. View Article : Google Scholar : PubMed/NCBI | |
MacConaill LE, Van Hummelen P, Meyerson M and Hahn WC: Clinical implementation of comprehensive strategies to characterize cancer genomes: Opportunities and challenges. Cancer Discov. 1:297–311. 2011. View Article : Google Scholar : PubMed/NCBI | |
Romano E, Schwartz GK, Chapman PB, Wolchock JD and Carvajal RD: Treatment implications of the emerging molecular classification system for melanoma. Lancet Oncol. 12:913–922. 2011. View Article : Google Scholar : PubMed/NCBI | |
Kwak EL, Bang YJ, Camidge DR, Shaw AT, Solomon B, Maki RG, Ou SH, Dezube BJ, Jänne PA, Costa DB, et al: Anaplastic lymphoma kinase inhibition in non-small-cell lung cancer. N Engl J Med. 363:1693–1703. 2010. View Article : Google Scholar : PubMed/NCBI | |
Shaw AT, Kim DW, Nakagawa K, Seto T, Crinó L, Ahn MJ, De Pas T, Besse B, Solomon BJ, Blackhall F, et al: Crizotinib versus chemotherapy in advanced ALK-positive lung cancer. N Engl J Med. 368:2385–2394. 2013. View Article : Google Scholar : PubMed/NCBI | |
Bollag G, Tsai J, Zhang J, Zhang C, Ibrahim P, Nolop K and Hirth P: Vemurafenib: The first drug approved for BRAF-mutant cancer. Nat Rev Drug Discov. 11:873–886. 2012. View Article : Google Scholar : PubMed/NCBI | |
Garraway LA and Lander ES: Lessons from the cancer genome. Cell. 153:17–37. 2013. View Article : Google Scholar : PubMed/NCBI | |
Pao W: New approaches to targeted therapy in lung cancer. Proc Am Thorac Soc. 9:72–73. 2012. View Article : Google Scholar : PubMed/NCBI | |
Thomas RK, Baker AC, Debiasi RM, Winckler W, Laframboise T, Lin WM, Wang M, Feng W, Zander T, MacConaill L, et al: High-throughput oncogene mutation profiling in human cancer. Nat Genet. 39:347–351. 2007. View Article : Google Scholar : PubMed/NCBI | |
MacConaill LE, Campbell CD, Kehoe SM, Bass AJ, Hatton C, Niu L, Davis M, Yao K, Hanna M, Mondal C, et al: Profiling critical cancer gene mutations in clinical tumor samples. PLoS One. 4:e78872009. View Article : Google Scholar : PubMed/NCBI | |
Tao YF, Wu D, Pang L, Zhao WL, Lu J, Wang N, Wang J, Feng X, Li YH, Ni J, et al: Analyzing the gene expression profile of pediatric acute myeloid leukemia with real-time PCR arrays. Cancer Cell Int. 12:1946–1958. 2012. | |
McCourt CM, Boyle D, James J and Salto-Tellez M: Immunohistochemistry in the era of personalised medicine. J Clin Pathol. 66:58–61. 2013. View Article : Google Scholar | |
Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, Jia M, Shepherd R, Leung K, Menzies A, et al: COSMIC: Mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 39(Database issue): D945–D950. 2010. View Article : Google Scholar : PubMed/NCBI | |
Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, Meyerson M, Gabriel SB, Lander ES and Getz G: Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. 505:495–501. 2014. View Article : Google Scholar : PubMed/NCBI | |
Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, Carter SL, Stewart C, Mermel CH, Roberts SA, et al: Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 499:214–218. 2013. View Article : Google Scholar : PubMed/NCBI | |
Frampton GM, Fichtenholtz A, Otto GA, Wang K, Downing SR, He J, Schnall-Levin M, White J, Sanford EM, An P, et al: Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat Biotechnol. 31:1023–1031. 2013. View Article : Google Scholar : PubMed/NCBI | |
Hovelson DH, McDaniel AS, Cani AK, Johnson B, Rhodes K, Williams PD, Bandla S, Bien G, Choppa P, Hyland F, et al: Development and validation of a scalable next-generation sequencing system for assessing relevant somatic variants in solid tumors. Neoplasia. 17:385–399. 2015. View Article : Google Scholar : PubMed/NCBI | |
Choudhary A, Mambo E, Sanford T, Boedigheimer M, Twomey B, Califano J, Hadd A, Oliner KS, Beaudenon S, Latham GJ, et al: Evaluation of an integrated clinical workflow for targeted next-generation sequencing of low-quality tumor DNA using a 51-gene enrichment panel. BMC Med Genomics. 7:622014. View Article : Google Scholar : PubMed/NCBI | |
Cheng DT, Mitchell TN, Zehir A, Shah RH, Benayed R, Syed A, Chandramohan R, Liu ZY, Won HH, Scott SN, et al: Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A hybridization capture-based next-generation sequencing clinical assay for solid tumor molecular oncology. J Mol Diagn. 17:251–264. 2015. View Article : Google Scholar : PubMed/NCBI | |
Cibulskis K, McKenna A, Fennell T, Banks E, DePristo M and Getz G: ContEst: Estimating cross-contamination of human samples in next-generation sequencing data. Bioinformatics. 27:2601–2602. 2011.PubMed/NCBI | |
Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, Martinez P, Matthews N, Stewart A, Tarpey P, et al: Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 366:883–892. 2012. View Article : Google Scholar : PubMed/NCBI | |
Kinz E, Leiherer A, Lang AH, Drexel H and Muendlein A: Accurate quantitation of JAK2 V617F allele burden by array-based digital PCR. Int J Lab Hematol. 37:217–224. 2015. View Article : Google Scholar | |
Shao D1, Lin Y, Liu J, Wan L, Liu Z, Cheng S, Fei L, Deng R, Wang J, Chen X, et al: A targeted next-generation sequencing method for identifying clinically relevant mutation profiles in lung adenocarcinoma. Sci Rep. 6:223382016. View Article : Google Scholar : PubMed/NCBI | |
Forbes SA1, Tang G, Bindal N, Bamford S, Dawson E, Cole C, Kok CY, Jia M, Ewing R, Menzies A, et al: COSMIC (the Catalogue of Somatic Mutations in Cancer): A resource to investigate acquired mutations in human cancer. Nucleic Acids Res. 8(Database issue): D652–D657. 2009. | |
Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME and McVean GA; 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature. 467:1061–1073. 2010. View Article : Google Scholar : PubMed/NCBI | |
Warrick JI, Hovelson DH, Amin A, Liu CJ, Cani AK, McDaniel AS, Yadati V, Quist MJ, Weizer AZ, Brenner JC, et al: Tumor evolution and progression in multifocal and paired non-invasive/invasive urothelial carcinoma. Virchows Arch. 466:297–311. 2015. View Article : Google Scholar | |
Li H and Durbin R: Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 26:589–595. 2010. View Article : Google Scholar : PubMed/NCBI | |
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al: The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20:1297–1303. 2010. View Article : Google Scholar : PubMed/NCBI | |
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, et al: A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 43:491–498. 2011. View Article : Google Scholar : PubMed/NCBI | |
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G and Durbin R; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics. 25:2078–2079. 2009. View Article : Google Scholar : PubMed/NCBI | |
Arsenic R, Treue D, Lehmann A, Hummel M, Dietel M, Denkert C and Budczies J: Comparison of targeted next-generation sequencing and Sanger sequencing for the detection of PIK3CA mutations in breast cancer. BMC Clin Pathol. 15:202015. View Article : Google Scholar : PubMed/NCBI | |
Borad MJ, Champion MD, Egan JB, Liang WS, Fonseca R, Bryce AH, McCullough AE, Barrett MT, Hunt K, Patel MD, et al: Integrated genomic characterization reveals novel, therapeutically relevant drug targets in FGFR and EGFR pathways in sporadic intrahepatic cholangiocarcinoma. PLoS Genet. 10:e10041352014. View Article : Google Scholar : PubMed/NCBI | |
Hadd AG, Houghton J, Choudhary A, Sah S, Chen L, Marko AC, Sanford T, Buddavarapu K, Krosting J, Garmire L, et al: Targeted, high-depth, next-generation sequencing of cancer genes in formalin-fixed, paraffin-embedded and fine-needle aspiration tumor specimens. J Mol Diagn. 15:234–247. 2013. View Article : Google Scholar : PubMed/NCBI | |
Roychowdhury S, Iyer MK, Robinson DR, Lonigro RJ, Wu YM, Cao X, Kalyana-Sundaram S, Sam L, Balbin OA, Quist MJ, et al: Personalized oncology through integrative high-throughput sequencing: A pilot study. Sci Transl Med. 3:111ra1212011. View Article : Google Scholar : PubMed/NCBI | |
Kerick M, Isau M, Timmermann B, Sültmann H, Herwig R, Krobitsch S, Schaefer G, Verdorfer I, Bartsch G, Klocker H, et al: Targeted high throughput sequencing in clinical cancer settings: Formaldehyde fixed-paraffin embedded (FFPE) tumor tissues, input amount and tumor heterogeneity. BMC Med Genomics. 4:682011. View Article : Google Scholar : PubMed/NCBI | |
Schweiger MR, Kerick M, Timmermann B, Albrecht MW, Borodina T, Parkhomchuk D, Zatloukal K and Lehrach H: Genome-wide massively parallel sequencing of formaldehyde fixed-paraffin embedded (FFPE) tumor tissues for copy-number- and mutation-analysis. PLoS One. 4:e55482009. View Article : Google Scholar : PubMed/NCBI | |
Chevrier S, Arnould L, Ghiringhelli F, Coudert B, Fumoleau P and Boidot R: Next-generation sequencing analysis of lung and colon carcinomas reveals a variety of genetic alterations. Int J Oncol. 45:1167–1174. 2014.PubMed/NCBI | |
Cottrell CE, Al-Kateb H, Bredemeyer AJ, Duncavage EJ, Spencer DH, Abel HJ, Lockwood CM, Hagemann IS, O'Guin SM, Burcea LC, et al: Validation of a next-generation sequencing assay for clinical molecular oncology. J Mol Diagn. 16:89–105. 2014. View Article : Google Scholar | |
Haber DA and Velculescu VE: Blood-based analyses of cancer: Circulating tumor cells and circulating tumor DNA. Cancer Discov. 4:650–661. 2014. View Article : Google Scholar : PubMed/NCBI | |
Arnedos M, Vicier C, Loi S, Lefebvre C, Michiels S, Bonnefoi H and Andre F: Precision medicine for metastatic breast cancer--limitations and solutions. Nat Rev Clin Oncol. 12:693–704. 2015. View Article : Google Scholar : PubMed/NCBI | |
Newman AM, Bratman SV, To J, Wynne JF, Eclov NC, Modlin LA, Liu CL, Neal JW, Wakelee HA, Merritt RE, et al: An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage. Nat Med. 20:548–554. 2014. View Article : Google Scholar : PubMed/NCBI | |
Ignatiadis M and Dawson SJ: Circulating tumor cells and circulating tumor DNA for precision medicine: Dream or reality? Ann Oncol. 25:2304–2313. 2014. View Article : Google Scholar : PubMed/NCBI | |
Lipson EJ, Velculescu VE, Pritchard TS, Sausen M, Pardoll DM, Topalian SL and Diaz LA Jr: Circulating tumor DNA analysis as a real-time method for monitoring tumor burden in melanoma patients undergoing treatment with immune checkpoint blockade. J Immunother Cancer. 2:422014. View Article : Google Scholar : PubMed/NCBI | |
Schmitt MW, Kennedy SR, Salk JJ, Fox EJ, Hiatt JB and Loeb LA: Detection of ultra-rare mutations by next-generation sequencing. Proc Natl Acad Sci USA. 109:14508–14513. 2012. View Article : Google Scholar : PubMed/NCBI | |
Kennedy SR, Schmitt MW, Fox EJ, Kohrn BF, Salk JJ, Ahn EH, Prindle MJ, Kuong KJ, Shen JC, Risques RA, et al: Detecting ultralow-frequency mutations by Duplex Sequencing. Nat Protoc. 9:2586–2606. 2014. View Article : Google Scholar : PubMed/NCBI | |
Newman AM, Lovejoy AF, Klass DM, Kurtz DM, Chabon JJ, Scherer F, Stehr H, Liu CL, Bratman SV, Say C, et al: Integrated digital error suppression for improved detection of circulating tumor DNA. Nat Biotechnol. 34:547–555. 2016. View Article : Google Scholar : PubMed/NCBI | |
Luo H, Li H, Hu Z, Wu H, Liu C, Li Y, Zhang X, Lin P, Hou Q, Ding G, et al: Noninvasive diagnosis and monitoring of mutations by deep sequencing of circulating tumor DNA in esophageal squamous cell carcinoma. Biochem Biophys Res Commun. 471:596–602. 2016. View Article : Google Scholar : PubMed/NCBI |