Copy-number aberrations (CNAs) and whole-genome duplications (WGDs) are frequent somatic mutations in cancer but their quantification from DNA sequencing of bulk tumor samples is challenging. Standard methods for CNA inference analyze tumor samples individually; however, DNA sequencing of multiple samples from a cancer patient has recently become more common. We introduce HATCHet (Holistic Allele-specific Tumor Copy-number Heterogeneity), an algorithm that infers allele- and clone-specific CNAs and WGDs jointly across multiple tumor samples from the same patient. We show that HATCHet outperforms current state-of-the-art methods on multi-sample DNA sequencing data that we simulate using MASCoTE (Multiple Allele-specific Simulation of Copy-number Tumor Evolution). Applying HATCHet to 84 tumor samples from 14 prostate and pancreas cancer patients, we identify subclonal CNAs and WGDs that are more plausible than previously published analyses and more consistent with somatic single-nucleotide variants (SNVs) and small indels in the same samples.

Zaccaria, S., Raphael, B. (2020). Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data. NATURE COMMUNICATIONS, 11(1) [10.1038/s41467-020-17967-y].

Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data

Zaccaria S.;
2020

Abstract

Copy-number aberrations (CNAs) and whole-genome duplications (WGDs) are frequent somatic mutations in cancer but their quantification from DNA sequencing of bulk tumor samples is challenging. Standard methods for CNA inference analyze tumor samples individually; however, DNA sequencing of multiple samples from a cancer patient has recently become more common. We introduce HATCHet (Holistic Allele-specific Tumor Copy-number Heterogeneity), an algorithm that infers allele- and clone-specific CNAs and WGDs jointly across multiple tumor samples from the same patient. We show that HATCHet outperforms current state-of-the-art methods on multi-sample DNA sequencing data that we simulate using MASCoTE (Multiple Allele-specific Simulation of Copy-number Tumor Evolution). Applying HATCHet to 84 tumor samples from 14 prostate and pancreas cancer patients, we identify subclonal CNAs and WGDs that are more plausible than previously published analyses and more consistent with somatic single-nucleotide variants (SNVs) and small indels in the same samples.
Articolo in rivista - Articolo scientifico
Breast Neoplasms; Datasets as Topic; DNA Copy Number Variations; Female; Gene Duplication; High-Throughput Nucleotide Sequencing; Humans; INDEL Mutation; Male; Mutation Rate; Neoplasm Metastasis; Pancreatic Neoplasms; Polymorphism, Single Nucleotide; Prostatic Neoplasms; Single-Cell Analysis; Whole Exome Sequencing
English
2020
11
1
4301
open
Zaccaria, S., Raphael, B. (2020). Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data. NATURE COMMUNICATIONS, 11(1) [10.1038/s41467-020-17967-y].
File in questo prodotto:
File Dimensione Formato  
Zaccaria-2020-Nature Communications-VoR.pdf

accesso aperto

Descrizione: CC BY 4.0
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 3.08 MB
Formato Adobe PDF
3.08 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/508686
Citazioni
  • Scopus 57
  • ???jsp.display-item.citation.isi??? 56
Social impact