Many data sets contain temporal records which span a long period of time; each record is associated with a time stamp and describes some aspects of a real-world entity at a particular time (e. g., author information in DBLP). In such cases, we often wish to identify records that describe the same entity over time and so be able to perform interesting longitudinal data analysis. However, existing record linkage techniques ignore temporal information and fall short for temporal data. This article studies linking temporal records. First, we apply time decay to capture the effect of elapsed time on entity value evolution. Second, instead of comparing each pair of records locally, we propose clustering methods that consider the time order of the records and make global decisions. Experimental results show that our algorithms significantly outperform traditional linkage methods on various temporal data sets. © 2012 Higher Education Press and Springer-Verlag Berlin Heidelberg.

Li, P., Dong, X., Maurino, A., Srivastava, D. (2012). Linking temporal records. FRONTIERS OF COMPUTER SCIENCE, 6(3), 293-312 [10.1007/s11704-012-2002-5].

Linking temporal records

LI, PEI;MAURINO, ANDREA;
2012

Abstract

Many data sets contain temporal records which span a long period of time; each record is associated with a time stamp and describes some aspects of a real-world entity at a particular time (e. g., author information in DBLP). In such cases, we often wish to identify records that describe the same entity over time and so be able to perform interesting longitudinal data analysis. However, existing record linkage techniques ignore temporal information and fall short for temporal data. This article studies linking temporal records. First, we apply time decay to capture the effect of elapsed time on entity value evolution. Second, instead of comparing each pair of records locally, we propose clustering methods that consider the time order of the records and make global decisions. Experimental results show that our algorithms significantly outperform traditional linkage methods on various temporal data sets. © 2012 Higher Education Press and Springer-Verlag Berlin Heidelberg.
Articolo in rivista - Articolo scientifico
temporal record linkage
English
2012
6
3
293
312
none
Li, P., Dong, X., Maurino, A., Srivastava, D. (2012). Linking temporal records. FRONTIERS OF COMPUTER SCIENCE, 6(3), 293-312 [10.1007/s11704-012-2002-5].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/37780
Citazioni
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 2
Social impact