Copy-number aberrations (CNAs) are genetic alterations that amplify or delete the number of copies of large genomic segments. Although they are ubiquitous in cancer and, thus, a critical area of current cancer research, CNA identification from DNA sequencing data is challenging because it requires partitioning of the genome into complex segments with the same copy-number states that may not be contiguous. Existing segmentation algorithms address these challenges either by leveraging the local information among neighboring genomic regions, or by globally grouping genomic regions that are affected by similar CNAs across the entire genome. However, both approaches have limitations: overclustering in the case of local segmentation, or the omission of clusters corresponding to focal CNAs in the case of global segmentation. Importantly, inaccurate segmentation will lead to inaccurate identification of CNAs. For this reason, most pan-cancer research studies rely on manual procedures of quality control and anomaly correction. To improve copy-number segmentation, we introduce CNAVIZ, a web-based tool that enables the user to simultaneously perform local and global segmentation, thus overcoming the limitations of each approach. Using simulated data, we demonstrate that by several metrics, CNAVIZ allows the user to obtain more accurate segmentation relative to existing local and global segmentation methods. Moreover, we analyze six bulk DNA sequencing samples from three breast cancer patients. By validating with parallel single-cell DNA sequencing data from the same samples, we show that by using CNAVIZ, our user was able to obtain more accurate segmentation and improved accuracy in downstream copy-number calling.

Lalani, Z., Chu, G., Hsu, S., Kagawa, S., Xiang, M., Zaccaria, S., et al. (2022). CNAViz: An interactive webtool for user-guided segmentation of tumor DNA sequencing data. PLOS COMPUTATIONAL BIOLOGY, 18(10) [10.1371/journal.pcbi.1010614].

CNAViz: An interactive webtool for user-guided segmentation of tumor DNA sequencing data

Zaccaria S.
;
2022

Abstract

Copy-number aberrations (CNAs) are genetic alterations that amplify or delete the number of copies of large genomic segments. Although they are ubiquitous in cancer and, thus, a critical area of current cancer research, CNA identification from DNA sequencing data is challenging because it requires partitioning of the genome into complex segments with the same copy-number states that may not be contiguous. Existing segmentation algorithms address these challenges either by leveraging the local information among neighboring genomic regions, or by globally grouping genomic regions that are affected by similar CNAs across the entire genome. However, both approaches have limitations: overclustering in the case of local segmentation, or the omission of clusters corresponding to focal CNAs in the case of global segmentation. Importantly, inaccurate segmentation will lead to inaccurate identification of CNAs. For this reason, most pan-cancer research studies rely on manual procedures of quality control and anomaly correction. To improve copy-number segmentation, we introduce CNAVIZ, a web-based tool that enables the user to simultaneously perform local and global segmentation, thus overcoming the limitations of each approach. Using simulated data, we demonstrate that by several metrics, CNAVIZ allows the user to obtain more accurate segmentation relative to existing local and global segmentation methods. Moreover, we analyze six bulk DNA sequencing samples from three breast cancer patients. By validating with parallel single-cell DNA sequencing data from the same samples, we show that by using CNAVIZ, our user was able to obtain more accurate segmentation and improved accuracy in downstream copy-number calling.
Articolo in rivista - Articolo scientifico
Algorithms; Breast Neoplasms; DNA Copy Number Variations; DNA, Neoplasm; Female; Humans; Neoplasms; Sequence Analysis, DNA
English
13-ott-2022
2022
18
10
e1010614
open
Lalani, Z., Chu, G., Hsu, S., Kagawa, S., Xiang, M., Zaccaria, S., et al. (2022). CNAViz: An interactive webtool for user-guided segmentation of tumor DNA sequencing data. PLOS COMPUTATIONAL BIOLOGY, 18(10) [10.1371/journal.pcbi.1010614].
File in questo prodotto:
File Dimensione Formato  
Lalani-2022-PLoS Computational Biology-VoR.pdf

accesso aperto

Descrizione: CC BY 4.0 This is an open access article distributed under the terms of the Creative Commons Attribution License,
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 2.22 MB
Formato Adobe PDF
2.22 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/508659
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact