Integrating genomic features for non-invasive early lung cancer detection

Radiologic screening of high-risk adults reduces lung-cancer-related mortality 1 , 2 ; however, a small minority of eligible individuals undergo such screening in the United States 3 , 4 . The availability of blood-based tests could increase screening uptake. Here we introduce improvements to cancer personalized profiling by deep sequencing (CAPP-Seq) 5 , a method for the analysis of circulating tumour DNA (ctDNA), to better facilitate screening applications. We show that, although levels are very low in early-stage lung cancers, ctDNA is present prior to treatment in most patients and its presence is strongly prognostic. We also find that the majority of somatic mutations in the cell-free DNA (cfDNA) of patients with lung cancer and of risk-matched controls reflect clonal haematopoiesis and are non-recurrent. Compared with tumour-derived mutations, clonal haematopoiesis mutations occur on longer cfDNA fragments and lack mutational signatures that are associated with tobacco smoking. Integrating these findings with other molecular features, we develop and prospectively validate a machine-learning method termed ‘lung cancer likelihood in plasma’ (Lung-CLiP), which can robustly discriminate early-stage lung cancer patients from risk-matched controls. This approach achieves performance similar to that of tumour-informed ctDNA detection and enables tuning of assay specificity in order to facilitate distinct clinical applications. Our findings establish the potential of cfDNA for lung cancer screening and highlight the importance of risk-matching cases and controls in cfDNA-based screening studies. Circulating tumour DNA in blood is analysed to identify genomic features that distinguish early-stage lung cancer patients from risk-matched controls, and these are integrated into a machine-learning method for blood-based lung cancer screening.

[1]  C. Lam,et al.  Predominant hematopoietic origin of cell-free DNA in plasma and serum after sex-mismatched bone marrow transplantation. , 2002, Clinical chemistry.

[2]  Richard Simon,et al.  Roadmap for developing and validating therapeutically relevant genomic classifiers. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[3]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[4]  Jeff Mellen,et al.  High-Throughput Droplet Digital PCR System for Absolute Quantitation of DNA Copy Number , 2011, Analytical chemistry.

[5]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[6]  Masahiro Tsuboi,et al.  International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society International Multidisciplinary Classification of Lung Adenocarcinoma , 2011, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[7]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[8]  Wendy S. W. Wong,et al.  Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs , 2012, Bioinform..

[9]  Steven J. M. Jones,et al.  Comprehensive genomic characterization of squamous cell lung cancers , 2012, Nature.

[10]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[11]  P. Marcus,et al.  Use of Lung Cancer Screening Tests in the United States: Results from the 2010 National Health Interview Survey , 2012, Cancer Epidemiology, Biomarkers & Prevention.

[12]  D. Aberle,et al.  Results of initial low-dose computed tomographic screening for lung cancer. , 2013, The New England journal of medicine.

[13]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[14]  Ahmedin Jemal,et al.  Annual number of lung cancer deaths potentially avertable by screening in the United States , 2013, Cancer.

[15]  A. Sivachenko,et al.  Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples , 2013, Nature Biotechnology.

[16]  V. Moyer Screening for Lung Cancer: U.S. Preventive Services Task Force Recommendation Statement , 2014, Annals of Internal Medicine.

[17]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[18]  E. Kazerooni,et al.  Performance of Lung-RADS in the National Lung Screening Trial , 2015, Annals of Internal Medicine.

[19]  B. Ebert,et al.  Clonal hematopoiesis of indeterminate potential and its distinction from myelodysplastic syndromes. , 2015, Blood.

[20]  V. Wong,et al.  Lengthening and shortening of plasma DNA in hepatocellular carcinoma patients , 2015, Proceedings of the National Academy of Sciences.

[21]  Ash A. Alizadeh,et al.  Integrated digital error suppression for improved detection of circulating tumor DNA , 2016, Nature Biotechnology.

[22]  B. Taylor,et al.  deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution , 2016, Genome Biology.

[23]  P. Hainaut,et al.  Somatic TP53 Mutations in the Era of Genome Sequencing. , 2016, Cold Spring Harbor perspectives in medicine.

[24]  Universal Patterns of Selection in Cancer and Somatic Tissues , 2017, Cell.

[25]  Ash A. Alizadeh,et al.  Early Detection of Molecular Residual Disease in Localized Lung Cancer by Circulating Tumor DNA Profiling. , 2017, Cancer Discovery.

[26]  Sam Angiuoli,et al.  Direct detection of early-stage cancers using circulating tumor DNA , 2017, Science Translational Medicine.

[27]  Ashwini Naik,et al.  Phylogenetic ctDNA analysis depicts early stage lung cancer evolution , 2017, Nature.

[28]  Ahmedin Jemal,et al.  Lung Cancer Screening With Low-Dose Computed Tomography in the United States—2010 to 2015 , 2017, JAMA oncology.

[29]  Yue Han,et al.  AfterQC: automatic filtering, trimming, error removing and quality control for fastq data , 2017, BMC Bioinformatics.

[30]  Keval Patel,et al.  Enhanced detection of circulating tumor DNA by fragment size analysis , 2018, Science Translational Medicine.

[31]  Ludmila V. Danilova,et al.  Detection and localization of surgically resectable cancers with a multi-analyte blood test , 2018, Science.

[32]  Rajat Singhania,et al.  Sensitive tumour detection and classification using plasma cell-free DNA methylomes , 2018, Nature.

[33]  M. Ladanyi,et al.  Prevalence of Clonal Hematopoiesis Mutations in Tumor-Only Clinical Genomic Profiling of Solid Tumors , 2018, JAMA oncology.

[34]  Steven J. M. Jones,et al.  Comprehensive Characterization of Cancer Driver Genes and Mutations , 2018, Cell.

[35]  PIWI proteins play oncogenic functions in pancreatic cancer. , 2019 .

[36]  David R. Jones,et al.  High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants , 2019, Nature Medicine.

[37]  H. Nielsen,et al.  Genome-wide cell-free DNA fragmentation in patients with cancer , 2019, Nature.

[38]  Ash A. Alizadeh,et al.  Dynamic Risk Profiling Using Serial Tumor Biomarkers for Personalized Outcome Prediction , 2019, Cell.

[39]  1PPIWI proteins play oncogenic functions in pancreatic cancer. , 2019, Annals of oncology : official journal of the European Society for Medical Oncology.

[40]  M. Ye,et al.  Biological background of the genomic variations of cf-DNA in healthy individuals , 2019, Annals of oncology : official journal of the European Society for Medical Oncology.

[41]  Harry J de Koning,et al.  Reduced Lung-Cancer Mortality with Volume CT Screening in a Randomized Trial. , 2020, The New England journal of medicine.

[42]  Ash A. Alizadeh,et al.  Circulating tumor DNA dynamics predict benefit from consolidation immunotherapy in locally advanced non-small-cell lung cancer , 2020, Nature Cancer.