Bicocca Open Archive

Next-generation sequencing (NGS) technologies need new methodologies for alternative splicing (AS) analysis. Current computational methods for AS analysis from NGS data are mainly based on aligning short reads against a reference genome, while methods that do not need a reference genome are mostly underdeveloped. In this context, the main developed tools for NGS data focus on de novo transcriptome assembly (Grabherr et al., 2011; Schulz et al., 2012). While these tools are extensively applied for biological investigations and often show intrinsic shortcomings from the obtained results, a theoretical investigation of the inherent computational limits of transcriptome analysis from NGS data, when a reference genome is unknown or highly unreliable, is still missing. On the other hand, we still lack methods for computing the gene structures due to AS events under the above assumptions-a problem that we start to tackle with this article. More precisely, based on the notion of isoform graph (Lacroix et al., 2008), we define a compact representation of gene structures-called splicing graph-and investigate the computational problem of building a splicing graph that is (i) compatible with NGS data and (ii) isomorphic to the isoform graph. We characterize when there is only one representative splicing graph compatible with input data, and we propose an efficient algorithmic approach to compute this graph.

Beretta, S., Bonizzoni, P., DELLA VEDOVA, G., Pirola, Y., Rizzi, R. (2014). Modeling Alternative Splicing Variants from RNA-Seq Data with Isoform Graphs. JOURNAL OF COMPUTATIONAL BIOLOGY, 21(1), 16-40 [10.1089/cmb.2013.0112].

Modeling Alternative Splicing Variants from RNA-Seq Data with Isoform Graphs

BERETTA, STEFANO;BONIZZONI, PAOLA;DELLA VEDOVA, GIANLUCA;PIROLA, YURI;RIZZI, RAFFAELLA

2014

Abstract

Next-generation sequencing (NGS) technologies need new methodologies for alternative splicing (AS) analysis. Current computational methods for AS analysis from NGS data are mainly based on aligning short reads against a reference genome, while methods that do not need a reference genome are mostly underdeveloped. In this context, the main developed tools for NGS data focus on de novo transcriptome assembly (Grabherr et al., 2011; Schulz et al., 2012). While these tools are extensively applied for biological investigations and often show intrinsic shortcomings from the obtained results, a theoretical investigation of the inherent computational limits of transcriptome analysis from NGS data, when a reference genome is unknown or highly unreliable, is still missing. On the other hand, we still lack methods for computing the gene structures due to AS events under the above assumptions-a problem that we start to tackle with this article. More precisely, based on the notion of isoform graph (Lacroix et al., 2008), we define a compact representation of gene structures-called splicing graph-and investigate the computational problem of building a splicing graph that is (i) compatible with NGS data and (ii) isomorphic to the isoform graph. We characterize when there is only one representative splicing graph compatible with input data, and we propose an efficient algorithmic approach to compute this graph.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				alternative splicing; splicing graph
			
	Lingua del contenuto
	
				English
			
	Data di pubblicazione
	
				2014
			
	Rivista
	
				JOURNAL OF COMPUTATIONAL BIOLOGY
			
	Numero del volume
	
				21
			
	Fascicolo
	
				1
			
	Pagina iniziale
	
				16
			
	Pagina finale
	
				40
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1089/cmb.2013.0112
			
	Fulltext
	
				open
			
	Citazione
	
				Beretta, S., Bonizzoni, P., DELLA VEDOVA, G., Pirola, Y., Rizzi, R. (2014). Modeling Alternative Splicing Variants from RNA-Seq Data with Isoform Graphs. JOURNAL OF COMPUTATIONAL BIOLOGY, 21(1), 16-40 [10.1089/cmb.2013.0112].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
journ-art-14-jcb.pdf accesso aperto Descrizione: Articolo principale Dimensione 929.03 kB Formato Adobe PDF Visualizza/Apri	929.03 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/49820

Citazioni

19

17

Social impact