A novelty detection model can be seen as a supervised classifier, trained on a fully- labeled training set, that allows for the presence of new classes in the test set not previously observed among the training units. When dealing with functional data, this requires learning the main patterns for the curves in the known classes, whilst being able to isolate signals that possess distinctive characteristics in the unlabeled set. In order to tackle this challenging problem, we propose a two-stage Bayesian semi-parametric novelty detector [2]. In the first stage, robust estimates are extracted from the training set via the Minimum Regularized Covariance Determinant (MRCD) estimator [1]. In the second stage, such information is employed to elicit informative priors within a Bayesian mixture of known groups plus a novelty term. To reflect the lack of knowledge on the latter component, we resort to a Dirichlet Process mixture model, thus overcoming the problematic a-priori specification of the expected number of novelties that may be present in the test set. The described methodology is applied to a spectroscopic dataset within a food authenticity study.

Denti, F., Cappozzo, A., Greselin, F. (2022). Outlier and Novelty Detection for Functional Data: a Semiparametric BayesianApproach. In Classification and Data Science in the Digital Age - Book of Abstracts IFCS 2022 (pp. 42-42). Printed in Portugal by Instituto Nacional de Estatística.

Outlier and Novelty Detection for Functional Data: a Semiparametric BayesianApproach

Francesco Denti;Andrea Cappozzo;Francesca Greselin
2022

Abstract

A novelty detection model can be seen as a supervised classifier, trained on a fully- labeled training set, that allows for the presence of new classes in the test set not previously observed among the training units. When dealing with functional data, this requires learning the main patterns for the curves in the known classes, whilst being able to isolate signals that possess distinctive characteristics in the unlabeled set. In order to tackle this challenging problem, we propose a two-stage Bayesian semi-parametric novelty detector [2]. In the first stage, robust estimates are extracted from the training set via the Minimum Regularized Covariance Determinant (MRCD) estimator [1]. In the second stage, such information is employed to elicit informative priors within a Bayesian mixture of known groups plus a novelty term. To reflect the lack of knowledge on the latter component, we resort to a Dirichlet Process mixture model, thus overcoming the problematic a-priori specification of the expected number of novelties that may be present in the test set. The described methodology is applied to a spectroscopic dataset within a food authenticity study.
Capitolo o saggio
bayesian mixture model, dirichlet process mixture model, functional data, minimum regularized covariance determinant;
English
Classification and Data Science in the Digital Age - Book of Abstracts IFCS 2022
2022
978-989-98955-9-1
Printed in Portugal by Instituto Nacional de Estatística
42
42
Denti, F., Cappozzo, A., Greselin, F. (2022). Outlier and Novelty Detection for Functional Data: a Semiparametric BayesianApproach. In Classification and Data Science in the Digital Age - Book of Abstracts IFCS 2022 (pp. 42-42). Printed in Portugal by Instituto Nacional de Estatística.
open
File in questo prodotto:
File Dimensione Formato  
IFCS2022 BoA ISBN DCG Outlier and Novelty Detection for Functional Data a Semiparametric Bayesian Approach .pdf

accesso aperto

Descrizione: Book of Abstracts
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 845.49 kB
Formato Adobe PDF
845.49 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/389286
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact