In many research fields, there is, nowadays, a lot of readily available information, however, it needs processing. This is the case of the field of Quantitative Structure-Activity Relationships (QSAR), which exploits several thousand molecular descriptors, and quality control and multivariate calibration where hundreds of spectroscopic signals are easily obtained from spectroscopic methods. Genetic Algorithms, Simulated Annealing, and Tabu Search are some of the methods that are widely used to process available information to find sets of optimal models. In this case, the problem that arises is how to compare the selected models. This work proposes a new measure of the distance between two models, and we will demonstrate that this model distance allows clusters of similar models to be found and the most diverse models to be caught in such a way as to preserve maximum information and diversity. © 2003 Elsevier B.V. All rights reserved.
Todeschini, R., Consonni, V., Pavan, M. (2004). A Distance Measure between Models: a Tool for Similarity/Diversity Analsysis of Model Populations. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 70, 55-61 [10.1016/j.chemolab.2003.10.003].
A Distance Measure between Models: a Tool for Similarity/Diversity Analsysis of Model Populations
TODESCHINI, ROBERTO;CONSONNI, VIVIANA;
2004
Abstract
In many research fields, there is, nowadays, a lot of readily available information, however, it needs processing. This is the case of the field of Quantitative Structure-Activity Relationships (QSAR), which exploits several thousand molecular descriptors, and quality control and multivariate calibration where hundreds of spectroscopic signals are easily obtained from spectroscopic methods. Genetic Algorithms, Simulated Annealing, and Tabu Search are some of the methods that are widely used to process available information to find sets of optimal models. In this case, the problem that arises is how to compare the selected models. This work proposes a new measure of the distance between two models, and we will demonstrate that this model distance allows clusters of similar models to be found and the most diverse models to be caught in such a way as to preserve maximum information and diversity. © 2003 Elsevier B.V. All rights reserved.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.