A novelty detection model can be seen as a supervised classifier, trained on a fully- labeled training set, that allows for the presence of new classes in the test set not previously observed among the training units. When dealing with functional data, this requires learning the main patterns for the curves in the known classes, whilst being able to isolate signals that possess distinctive characteristics in the unlabeled set. In order to tackle this challenging problem, we propose a two-stage Bayesian semi-parametric novelty detector [2]. In the first stage, robust estimates are extracted from the training set via the Minimum Regularized Covariance Determinant (MRCD) estimator [1]. In the second stage, such information is employed to elicit informative priors within a Bayesian mixture of known groups plus a novelty term. To reflect the lack of knowledge on the latter component, we resort to a Dirichlet Process mixture model, thus overcoming the problematic a-priori specification of the expected number of novelties that may be present in the test set. The described methodology is applied to a spectroscopic dataset within a food authenticity study.
Denti, F., Cappozzo, A., Greselin, F. (2022). Outlier and Novelty Detection for Functional Data: a Semiparametric BayesianApproach. In Classification and Data Science in the Digital Age - Book of Abstracts IFCS 2022 (pp. 42-42). Printed in Portugal by Instituto Nacional de Estatística.
Outlier and Novelty Detection for Functional Data: a Semiparametric BayesianApproach
Francesco Denti;Andrea Cappozzo;Francesca Greselin
2022
Abstract
A novelty detection model can be seen as a supervised classifier, trained on a fully- labeled training set, that allows for the presence of new classes in the test set not previously observed among the training units. When dealing with functional data, this requires learning the main patterns for the curves in the known classes, whilst being able to isolate signals that possess distinctive characteristics in the unlabeled set. In order to tackle this challenging problem, we propose a two-stage Bayesian semi-parametric novelty detector [2]. In the first stage, robust estimates are extracted from the training set via the Minimum Regularized Covariance Determinant (MRCD) estimator [1]. In the second stage, such information is employed to elicit informative priors within a Bayesian mixture of known groups plus a novelty term. To reflect the lack of knowledge on the latter component, we resort to a Dirichlet Process mixture model, thus overcoming the problematic a-priori specification of the expected number of novelties that may be present in the test set. The described methodology is applied to a spectroscopic dataset within a food authenticity study.File | Dimensione | Formato | |
---|---|---|---|
IFCS2022 BoA ISBN DCG Outlier and Novelty Detection for Functional Data a Semiparametric Bayesian Approach .pdf
accesso aperto
Descrizione: Book of Abstracts
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Dimensione
845.49 kB
Formato
Adobe PDF
|
845.49 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.