Bicocca Open Archive

Three important issues are often encountered in Supervised and Semi-Supervised Classification: class memberships are unreliable for some training units (label noise), a proportion of observations might depart from the main structure of the data (outliers) and new groups in the test set may have not been encountered earlier in the learning phase (unobserved classes). The present work introduces a robust and adaptive Discriminant Analysis rule, capable of handling situations in which one or more of the aforementioned problems occur. Two EM-based classifiers are proposed: the first one that jointly exploits the training and test sets (transductive approach), and the second one that expands the parameter estimation using the test set, to complete the group structure learned from the training set (inductive approach). Experiments on synthetic and real data, artificially adulterated, are provided to underline the benefits of the proposed method.

Cappozzo, A., Greselin, F., Murphy, T. (2020). Anomaly and Novelty detection for robust semi-supervised learning. STATISTICS AND COMPUTING, 30(5), 1545-1571 [10.1007/s11222-020-09959-1].

Anomaly and Novelty detection for robust semi-supervised learning

Cappozzo, A^Primo;Greselin, F;Murphy, TB

2020

Abstract

Three important issues are often encountered in Supervised and Semi-Supervised Classification: class memberships are unreliable for some training units (label noise), a proportion of observations might depart from the main structure of the data (outliers) and new groups in the test set may have not been encountered earlier in the learning phase (unobserved classes). The present work introduces a robust and adaptive Discriminant Analysis rule, capable of handling situations in which one or more of the aforementioned problems occur. Two EM-based classifiers are proposed: the first one that jointly exploits the training and test sets (transductive approach), and the second one that expands the parameter estimation using the test set, to complete the group structure learned from the training set (inductive approach). Experiments on synthetic and real data, artificially adulterated, are provided to underline the benefits of the proposed method.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Impartial trimming; Inductive inference; Label noise; Model-based classification; Novelty detection; Outliers detection; Robust estimation; Transductive inference; Unobserved classes;
			
	Parole chiave
	
				Impartial Trimming ; Inductive Inference ; Label Noise ; Model - Based Classification ; Novelty Detection ; Outliers Detection ; Robust Estimation ; Transductive Inference ; Unobserved Classes
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				30-giu-2020
			
	Data di pubblicazione
	
				2020
			
	Rivista
	
				STATISTICS AND COMPUTING
			
	Numero del volume
	
				30
			
	Fascicolo
	
				5
			
	Pagina iniziale
	
				1545
			
	Pagina finale
	
				1571
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s11222-020-09959-1
			
	URL alternativo
	
				https://link.springer.com/article/10.1007/s11222-020-09959-1
			
	Fulltext
	
				none
			
	Citazione
	
				Cappozzo, A., Greselin, F., Murphy, T. (2020). Anomaly and Novelty detection for robust semi-supervised learning. STATISTICS AND COMPUTING, 30(5), 1545-1571 [10.1007/s11222-020-09959-1].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/276200

Citazioni

18

15

Social impact