In this paper we present a preliminary report on a domain independent strategy to reduce duplicated records by means of the knowledge stored in the schema. According to different kinds of relationships, we propose specific techniques to build and compare the knowledge networks by means of graph-based similarity techniques

Maurino, A., Li, P. (2009). Schema based deduplication. In Proceedings of the 2009 International Conference on Information Quality, ICIQ 2009 (pp.1-12).

Schema based deduplication

Maurino, A
;
Li, P
2009

Abstract

In this paper we present a preliminary report on a domain independent strategy to reduce duplicated records by means of the knowledge stored in the schema. According to different kinds of relationships, we propose specific techniques to build and compare the knowledge networks by means of graph-based similarity techniques
poster + paper
schema based deduplication, record linkage
English
International Conference on Information Quality, ICIQ 2009 7-8 November
2009
Proceedings of the 2009 International Conference on Information Quality, ICIQ 2009
2009
1
12
none
Maurino, A., Li, P. (2009). Schema based deduplication. In Proceedings of the 2009 International Conference on Information Quality, ICIQ 2009 (pp.1-12).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/11916
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact