In the big data context, it is very frequent to manage the analysis of missing values. This is especially relevant in the field of statistical analysis, where this represents a thorny issue. This study proposes a strategy for data enrichment in presence of sparse matrices. The research objective consists in the evaluation of a possible distinction of behaviour among observations in sparse matrices with missing data. After selecting among the multiple imputation methods, an innovative technique will be presented to impute missing observations as a negative position or a neutral opinion. This method has been applied to a dataset measuring the interaction between users and social network pages for some Italian newspapers.

Mariani, P., Marletta, A., Locci, M. (2024). Missing values and data enrichment: an application to social media liking. COMPUTATIONAL STATISTICS, 39(1), 217-237 [10.1007/s00180-022-01261-0].

Missing values and data enrichment: an application to social media liking

Paolo Mariani;Andrea Marletta
;
2024

Abstract

In the big data context, it is very frequent to manage the analysis of missing values. This is especially relevant in the field of statistical analysis, where this represents a thorny issue. This study proposes a strategy for data enrichment in presence of sparse matrices. The research objective consists in the evaluation of a possible distinction of behaviour among observations in sparse matrices with missing data. After selecting among the multiple imputation methods, an innovative technique will be presented to impute missing observations as a negative position or a neutral opinion. This method has been applied to a dataset measuring the interaction between users and social network pages for some Italian newspapers.
Articolo in rivista - Articolo scientifico
Data enrichment; Missing values; Multiple imputations; Social network data;
English
9-ago-2022
2024
39
1
217
237
open
Mariani, P., Marletta, A., Locci, M. (2024). Missing values and data enrichment: an application to social media liking. COMPUTATIONAL STATISTICS, 39(1), 217-237 [10.1007/s00180-022-01261-0].
File in questo prodotto:
File Dimensione Formato  
Mariani2022_Article_MissingValuesAndDataEnrichment.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 1.12 MB
Formato Adobe PDF
1.12 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/390292
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact