Bicocca Open Archive

Decision making activities stress data and information quality requirements. The quality of data sources is frequently very poor, therefore a cleansing process is required before using such data for decision making processes. When alternative (and more trusted) data sources are not available data can be cleansed only us- ing business rules derived from domain knowledge. Business rules focus on fixing inconsistencies, but an inconsistency can be cleansed in different ways (i.e. the correction can be not deterministic), therefore the choice on how to cleanse data can (even strongly) affect the aggregate values computed for decision making purposes. The paper proposes a methodology exploiting Finite State Systems to quantitatively estimate how computed variables and indicators might be affected by the uncertainty related to low data quality, indepen- dently from the data cleansing methodology used. The methodology has been implemented and tested on a real case scenario providing effective results.

Mezzanzanica, M., Boselli, R., Cesarini, M., Mercorio, F. (2012). Data Quality Sensitivity Analysis on Aggregate Indicators. In DATA 2012 - Proceedings of the International Conference on Data Technologies and Applications (pp.97-108). SciTePress [10.5220/0004040300970108].

Data Quality Sensitivity Analysis on Aggregate Indicators

MEZZANZANICA, MARIO;BOSELLI, ROBERTO;CESARINI, MIRKO;MERCORIO, FABIO

2012

Abstract

Decision making activities stress data and information quality requirements. The quality of data sources is frequently very poor, therefore a cleansing process is required before using such data for decision making processes. When alternative (and more trusted) data sources are not available data can be cleansed only us- ing business rules derived from domain knowledge. Business rules focus on fixing inconsistencies, but an inconsistency can be cleansed in different ways (i.e. the correction can be not deterministic), therefore the choice on how to cleanse data can (even strongly) affect the aggregate values computed for decision making purposes. The paper proposes a methodology exploiting Finite State Systems to quantitatively estimate how computed variables and indicators might be affected by the uncertainty related to low data quality, indepen- dently from the data cleansing methodology used. The methodology has been implemented and tested on a real case scenario providing effective results.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Data Quality, Data Cleansing, Sensitivity Analysis, Inconsistent Databases, Aggregate Indicators, Uncertainty
Assessment
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				International Conference on Data Technologies and Applications (DATA) 25/27 july
			
	Anno del convegno
	
				2012
			
	Titolo degli atti
	
				DATA 2012 - Proceedings of the International Conference on Data Technologies and Applications
			
	ISBN del volume degli atti
	
				978-989-8565-18-1
			
	Data di pubblicazione
	
				2012
			
	Pagina iniziale
	
				97
			
	Pagina finale
	
				108
			
	DOI dell'intervento
	
				https://dx.doi.org/10.5220/0004040300970108
			
	URL alternativo
	
				www.dataconference.org
			
	Fulltext
	
				open
			
	Citazione
	
				Mezzanzanica, M., Boselli, R., Cesarini, M., Mercorio, F. (2012). Data Quality Sensitivity Analysis on Aggregate Indicators. In DATA 2012 - Proceedings of the International Conference on Data Technologies and Applications (pp.97-108). SciTePress [10.5220/0004040300970108].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
data2012cr.pdf accesso aperto Tipologia di allegato: Author’s Accepted Manuscript, AAM (Post-print) Dimensione 216.15 kB Formato Adobe PDF Visualizza/Apri	216.15 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/36311

Citazioni

19

ND

Social impact