Several similarity/diversity measures for data mining, chemometrics, and chemoinformatics are presented and discussed toward the different data they are applied to. After a short presentation of the axioms for dissimilarity and similarity functions, their relationships and the required data pretreatment, the theoretical definitions and formulas of distance and similarity measures for real‐valued, binary, ranked, frequency, and mixed‐type data are provided along with the main concepts on distances between sets and meta‐distances. Simple examples of calculation are given and extended comparisons are performed on the distances defined for real valued and binary data.
Todeschini, R., Ballabio, D., Consonni, V. (2020). Distances and Similarity Measures in Chemometrics and Chemoinformatics. In Encyclopedia of Analytical Chemistry (pp. 1-40). R.A. Meyers [10.1002/9780470027318.a9438.pub2].
Distances and Similarity Measures in Chemometrics and Chemoinformatics
Todeschini, R;Ballabio, D;Consonni, V
2020
Abstract
Several similarity/diversity measures for data mining, chemometrics, and chemoinformatics are presented and discussed toward the different data they are applied to. After a short presentation of the axioms for dissimilarity and similarity functions, their relationships and the required data pretreatment, the theoretical definitions and formulas of distance and similarity measures for real‐valued, binary, ranked, frequency, and mixed‐type data are provided along with the main concepts on distances between sets and meta‐distances. Simple examples of calculation are given and extended comparisons are performed on the distances defined for real valued and binary data.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.