Quantifying the diversity content of an incidence matrix is challenging in several scientific fields. The existing indices capture diverse facets of diversity and thus comparing their behaviour is not a straightforward task. For example, an application of diversity measures involves ensembles of classifiers which usually in real applications contain missing values. Therefore, we analysed 14 statistics and, after making them comparable and able to deal with missing values, we applied them on more than one hundred incidence matrices in order to examine the relationships among the measures themselves. In particular, we highlighted the importance of the inter-row agreement of factors, the general agreement of incident factors, as well as the influence on the indices of the proportion of missing values and matrix dimensions, the sensitivity to missing values, the uniform distribution of entries and the invariance to matrix transposition.

Valsecchi, C., Todeschini, R. (2020). Similarity/Diversity Indices on Incidence Matrices Containing Missing Values. MATCH, 83(2), 239-260.

Similarity/Diversity Indices on Incidence Matrices Containing Missing Values

Valsecchi, C;Todeschini, R
2020

Abstract

Quantifying the diversity content of an incidence matrix is challenging in several scientific fields. The existing indices capture diverse facets of diversity and thus comparing their behaviour is not a straightforward task. For example, an application of diversity measures involves ensembles of classifiers which usually in real applications contain missing values. Therefore, we analysed 14 statistics and, after making them comparable and able to deal with missing values, we applied them on more than one hundred incidence matrices in order to examine the relationships among the measures themselves. In particular, we highlighted the importance of the inter-row agreement of factors, the general agreement of incident factors, as well as the influence on the indices of the proportion of missing values and matrix dimensions, the sensitivity to missing values, the uniform distribution of entries and the invariance to matrix transposition.
Articolo in rivista - Articolo scientifico
Incidence matrix, missing values, diversity, similarity
English
2020
83
2
239
260
open
Valsecchi, C., Todeschini, R. (2020). Similarity/Diversity Indices on Incidence Matrices Containing Missing Values. MATCH, 83(2), 239-260.
File in questo prodotto:
File Dimensione Formato  
match83n2_239-260.pdf

accesso aperto

Descrizione: Articolo principale
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 901.93 kB
Formato Adobe PDF
901.93 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/262935
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact