During last years, Big Data appears as one of the most innovative and growing scientific area of interest. In this field, finding reliable methods to make accurate predictions represents one of the most inspirational challenges. The way to make prediction in the following paper is the use of ROC (Receiver Operating Characteristic) Curve, a binary prediction tool, often used for medical tests. The attention is focused in particular on the implementation of ROC Curve in GAMLSS (Generalized Additive Models for Location Scale and Shape), semi-parametric models suitable for huge and flexible dataset. An application will be shown where the class of GAMLSS is applied to Twitter data in order to predict number of interactions for a tweet given a set of explanatory variables.

Mariani, P., Marletta, A., Sciandra, M. (2017). GAMLSS for Big Data: Roc Curve prediction using Twitter data. Intervento presentato a: CLADAG Scientific Meeting of the CLAssification and Data Analysis Group, Milano, Italy.

GAMLSS for Big Data: Roc Curve prediction using Twitter data

MARIANI, PAOLO;MARLETTA, ANDREA
;
2017

Abstract

During last years, Big Data appears as one of the most innovative and growing scientific area of interest. In this field, finding reliable methods to make accurate predictions represents one of the most inspirational challenges. The way to make prediction in the following paper is the use of ROC (Receiver Operating Characteristic) Curve, a binary prediction tool, often used for medical tests. The attention is focused in particular on the implementation of ROC Curve in GAMLSS (Generalized Additive Models for Location Scale and Shape), semi-parametric models suitable for huge and flexible dataset. An application will be shown where the class of GAMLSS is applied to Twitter data in order to predict number of interactions for a tweet given a set of explanatory variables.
abstract + slide
GAMLSS, ROC curve, Twitter, Big Data
English
CLADAG Scientific Meeting of the CLAssification and Data Analysis Group
2017
2017
open
Mariani, P., Marletta, A., Sciandra, M. (2017). GAMLSS for Big Data: Roc Curve prediction using Twitter data. Intervento presentato a: CLADAG Scientific Meeting of the CLAssification and Data Analysis Group, Milano, Italy.
File in questo prodotto:
File Dimensione Formato  
RevGAMLSS for Big Data.pdf

accesso aperto

Dimensione 118.89 kB
Formato Adobe PDF
118.89 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/171385
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact