Genomic annotations describing structural and functional features of genes and gene products through controlled terminologies and ontologies are extremely valuable, especially for computational analyses aimed at inferring new biomedical knowledge, which rely on available annotations. Yet, they are incomplete, especially for recently studied genomes, and only some of available annotations represent highly reliable human curated information. In order to help and speedup the time-consuming curation process and improve available annotations, computational methods able to provide prioritized lists of predicted annotations are paramount. Starting from a previous work on automatic prediction of Gene Ontology annotations based on singular value decomposition (SVD) of gene-to-term annotation matrix, here we propose a novel prediction algorithm that incorporates gene clustering based on gene functional similarity computed on Gene Ontology annotations. We tested both prediction methods performing k-fold cross-validation on two organism genomes, Saccharomyces cerevisiae (SGD) and Drosophila melanogaster (FlyBase). Results demonstrate effectiveness of our approach.

Masseroli, M., Tagliasacchi, M., Chicco, D. (2011). Semantically improved genome-wide prediction of Gene Ontology annotations. In Proceedings of the 11th IEEE International Conference on Intelligent Systems Design and Applications (ISDA 2011) (pp.1080-1085). IEEE [10.1109/ISDA.2011.6121802].

Semantically improved genome-wide prediction of Gene Ontology annotations

Chicco, D
2011

Abstract

Genomic annotations describing structural and functional features of genes and gene products through controlled terminologies and ontologies are extremely valuable, especially for computational analyses aimed at inferring new biomedical knowledge, which rely on available annotations. Yet, they are incomplete, especially for recently studied genomes, and only some of available annotations represent highly reliable human curated information. In order to help and speedup the time-consuming curation process and improve available annotations, computational methods able to provide prioritized lists of predicted annotations are paramount. Starting from a previous work on automatic prediction of Gene Ontology annotations based on singular value decomposition (SVD) of gene-to-term annotation matrix, here we propose a novel prediction algorithm that incorporates gene clustering based on gene functional similarity computed on Gene Ontology annotations. We tested both prediction methods performing k-fold cross-validation on two organism genomes, Saccharomyces cerevisiae (SGD) and Drosophila melanogaster (FlyBase). Results demonstrate effectiveness of our approach.
paper
Annotation prediction; gene similarity metrics; Singular Value Decomposition;
English
2011 11th International Conference on Intelligent Systems Design and Applications, ISDA'11 - 22 November 2011 through 24 November 2011
2011
Proceedings of the 11th IEEE International Conference on Intelligent Systems Design and Applications (ISDA 2011)
9781457716751
2011
1080
1085
6121802
reserved
Masseroli, M., Tagliasacchi, M., Chicco, D. (2011). Semantically improved genome-wide prediction of Gene Ontology annotations. In Proceedings of the 11th IEEE International Conference on Intelligent Systems Design and Applications (ISDA 2011) (pp.1080-1085). IEEE [10.1109/ISDA.2011.6121802].
File in questo prodotto:
File Dimensione Formato  
Masseroli-2011-ISDA'11-VoR.pdf

Solo gestori archivio

Descrizione: Intervento a convegno
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 234.37 kB
Formato Adobe PDF
234.37 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/435463
Citazioni
  • Scopus 10
  • ???jsp.display-item.citation.isi??? ND
Social impact