Algorithmic imputation techniques for missing
data: performance comparisons and
development perspectives

Solaro, N; Barbiero, A; Manzi, G; Ferrari, P

In recent years, much research has been devoted to solve the problem of missing data imputation. Although most of the novel proposals look attractive for some reason, less attention has been paid to the problem of when and why a particular method should be chosen while discarding the others. This matter is far crucial in applications, given that unsuitable solutions could heavily affect the reliability of statistical analyses. Starting from this, this work is addressed to study how well several algorithmic-type imputation methods perform in the case of quantitative data. We focus on three different logics of imputing, based respectively on the use of random forests, iterative PCA, and the forward procedure. In particular, the latter, having initially been introduced for ordinal data, has required us to develop an original adaptation so that it handles missing quantitative values.

Solaro, N., Barbiero, A., Manzi, G., Ferrari, P. (2012). Algorithmic imputation techniques for missing data: performance comparisons and development perspectives. In Analysis and Modeling of Complex Data in Behavioural and Social Sciences - Book of Short Papers (pp.1-4). Padova : CLEUP.

Algorithmic imputation techniques for missing data: performance comparisons and development perspectives

SOLARO, NADIA;Barbiero, A;Manzi, G;Ferrari, PA

2012

Abstract

In recent years, much research has been devoted to solve the problem of missing data imputation. Although most of the novel proposals look attractive for some reason, less attention has been paid to the problem of when and why a particular method should be chosen while discarding the others. This matter is far crucial in applications, given that unsuitable solutions could heavily affect the reliability of statistical analyses. Starting from this, this work is addressed to study how well several algorithmic-type imputation methods perform in the case of quantitative data. We focus on three different logics of imputing, based respectively on the use of random forests, iterative PCA, and the forward procedure. In particular, the latter, having initially been introduced for ordinal data, has required us to develop an original adaptation so that it handles missing quantitative values.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				multivariate exponential power distribution, multivariate skew-normal
distribution, nearest neighbour, principal component analysis, random forest
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				Convegno JCS - CLADAG 2012, "Analysis and Modeling of Complex Data in Behavioural and Social Sciences"
			
	Anno del convegno
	
				2012
			
	Curatori della monografia
	
				Okada, A; Vicari, D; Ragozini, G
			
	Titolo degli atti
	
				Analysis and Modeling of Complex Data in Behavioural and Social Sciences - Book of Short Papers
			
	ISBN del volume degli atti
	
				978-88-6129-916-0
			
	Data di pubblicazione
	
				ago-2012
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				4
			
	URL alternativo
	
				http://www.jcs-cladag12.tk/
			
	Fulltext
	
				none
			
	Citazione
	
				Solaro, N., Barbiero, A., Manzi, G., Ferrari, P. (2012). Algorithmic imputation techniques for missing
data: performance comparisons and
development perspectives. In Analysis and Modeling of Complex Data in Behavioural and Social Sciences - Book of Short Papers (pp.1-4). Padova : CLEUP.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/36518

Citazioni

ND

ND

Bicocca Open Archive

Algorithmic imputation techniques for missing data: performance comparisons and development perspectives

SOLARO, NADIA;Barbiero, A;Manzi, G;Ferrari, PA

2012

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

Social impact

Bicocca Open Archive

Algorithmic imputation techniques for missing data: performance comparisons and development perspectives

SOLARO, NADIA;Barbiero, A;Manzi, G;Ferrari, PA

2012

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Citazioni

Social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)