Gene annotations are a key concept in bioinformatics and computational methods able to predict them are a fundamental contribution to the field. Several machine learning algorithms are available in this domain; they include relevant parameters that might influence the output list of predicted gene annotations. The amount that the variation of these key parameters affect the output gene annotation lists remains an open aspect to be evaluated. Here, we provide support for such evaluation by introducing two list correlation measures; they are based on and extend the Spearman ρ correlation coefficient and Kendall τ distance, respectively. The application of these measures to some gene annotation lists, predicted from Gene Ontology annotation datasets of different organisms’ genes, showed interesting patterns between the predicted lists. Additionally, they allowed expressing some useful considerations about the prediction parameters and algorithms used.

Chicco, D., Ciceri, E., Masseroli, M. (2015). Extended spearman and Kendall coefficients for gene annotation list correlation. In C. DI Serio, P. Liò, A. Nonis, R. Tagliaferri (a cura di), Computational Intelligence Methods for Bioinformatics and Biostatistics. 11th International Meeting, CIBB 2014, Cambridge, UK, June 26-28, 2014, Revised Selected Papers (pp. 19-32). Springer Verlag [10.1007/978-3-319-24462-4_2].

Extended spearman and Kendall coefficients for gene annotation list correlation

Chicco, D
Primo
;
2015

Abstract

Gene annotations are a key concept in bioinformatics and computational methods able to predict them are a fundamental contribution to the field. Several machine learning algorithms are available in this domain; they include relevant parameters that might influence the output list of predicted gene annotations. The amount that the variation of these key parameters affect the output gene annotation lists remains an open aspect to be evaluated. Here, we provide support for such evaluation by introducing two list correlation measures; they are based on and extend the Spearman ρ correlation coefficient and Kendall τ distance, respectively. The application of these measures to some gene annotation lists, predicted from Gene Ontology annotation datasets of different organisms’ genes, showed interesting patterns between the predicted lists. Additionally, they allowed expressing some useful considerations about the prediction parameters and algorithms used.
Capitolo o saggio
Biomolecular annotations; Kendall distance; Spearman coefficient; Top-K queries
English
Computational Intelligence Methods for Bioinformatics and Biostatistics. 11th International Meeting, CIBB 2014, Cambridge, UK, June 26-28, 2014, Revised Selected Papers
DI Serio, C; Liò, P; Nonis, A; Tagliaferri, R
18-nov-2015
2015
9783319244617
8623
Springer Verlag
19
32
Chicco, D., Ciceri, E., Masseroli, M. (2015). Extended spearman and Kendall coefficients for gene annotation list correlation. In C. DI Serio, P. Liò, A. Nonis, R. Tagliaferri (a cura di), Computational Intelligence Methods for Bioinformatics and Biostatistics. 11th International Meeting, CIBB 2014, Cambridge, UK, June 26-28, 2014, Revised Selected Papers (pp. 19-32). Springer Verlag [10.1007/978-3-319-24462-4_2].
reserved
File in questo prodotto:
File Dimensione Formato  
Chicco-2015-CIBB-VoR.pdf

Solo gestori archivio

Descrizione: Contributo in libro
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 290.42 kB
Formato Adobe PDF
290.42 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/433778
Citazioni
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 6
Social impact