Chemometrics plays a fundamental role in quantitative structure-activity relationships (QSARs) and quantitative structure-property relationships (QSPRs) methods, which aim at empirically linking the molecular structure of chemicals to experimentally-measurable properties. In fact, chemometrics statistics and chemoinformatics are the basic tools for finding meaningful mathematical relationships between the molecular structure and biological, physico-chemical, toxicological and environmental properties of chemicals. The key elements of QSAR/QSPR are molecular descriptors, which are numerical indices encoding information related to the structure of chemicals and are used as independent variables in the subsequent modeling, thus connecting the QSAR approaches to the multivariate and chemometric world. In this article, historical and novel QSAR modeling approaches are presented. After describing the historical background of QSAR and the classical approaches, molecular descriptors are illustrated, with state-of-the-art as well as novel description methodologies. Additionally, the key principles of QSAR are introduced, along with specific elements of the QSAR modeling workflow, e.g., variable reduction and selection, similarity-based approaches, validation, the definition of applicability domain and consensus modeling.
Todeschini, R., Consonni, V., Ballabio, D., Grisoni, F. (2020). 4.25 - Chemometrics for QSAR Modeling. In S. Brown, R. Tauler, B. Walczak (a cura di), Comprehensive Chemometrics: Chemical and Biochemical Data Analysis, Second Edition: Four Volume Set (pp. 599-634). Elsevier [10.1016/B978-0-12-409547-2.14703-1].
4.25 - Chemometrics for QSAR Modeling
Todeschini, Roberto;Consonni, Viviana;Ballabio, Davide;
2020
Abstract
Chemometrics plays a fundamental role in quantitative structure-activity relationships (QSARs) and quantitative structure-property relationships (QSPRs) methods, which aim at empirically linking the molecular structure of chemicals to experimentally-measurable properties. In fact, chemometrics statistics and chemoinformatics are the basic tools for finding meaningful mathematical relationships between the molecular structure and biological, physico-chemical, toxicological and environmental properties of chemicals. The key elements of QSAR/QSPR are molecular descriptors, which are numerical indices encoding information related to the structure of chemicals and are used as independent variables in the subsequent modeling, thus connecting the QSAR approaches to the multivariate and chemometric world. In this article, historical and novel QSAR modeling approaches are presented. After describing the historical background of QSAR and the classical approaches, molecular descriptors are illustrated, with state-of-the-art as well as novel description methodologies. Additionally, the key principles of QSAR are introduced, along with specific elements of the QSAR modeling workflow, e.g., variable reduction and selection, similarity-based approaches, validation, the definition of applicability domain and consensus modeling.File | Dimensione | Formato | |
---|---|---|---|
Todeschini-2020b.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Dimensione
1.84 MB
Formato
Adobe PDF
|
1.84 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.