Bicocca Open Archive

The Social Web supports and fosters social interactions by means of different social media, which allow the spread of the so called User-Generated Content (UGC). In this context, characterized by the absence of trusted third parties that verify the reliability of the sources and the believability of the content generated, the issue of assessing the credibility of the information diffused by means of social media is receiving increasing attention. In the literature, this issue has been mainly tackled as a classification problem; information is categorized into genuine and fake, usually by implementing or applying classifiers that consider multiple kinds of features (mainly textual and non-Textual) to be evaluated in terms of credibility. In this article, unlike prior research, textual features are considered separately with respect to other kinds of features during the classification process. In particular, an Ensemble Method that combines the results produced by two text classifiers and the ones returned by another classifier acting on non-Textual features is proposed. This allows to have better results with respect to the use of a single classifier on multiple features together. The effectiveness of the Ensemble Method has been assessed in the context of review sites, by means of a labeled dataset gathered from the Yelp.com site, where on-line reviews are already classified as recommended and not recommended.

Fontanarava, J., Pasi, G., Viviani, M. (2017). An ensemble method for the credibility assessment of user-generated content. In Proceedings of the International Conference on Web Intelligence (pp.863-868). Association for Computing Machinery, Inc [10.1145/3106426.3106464].

An ensemble method for the credibility assessment of user-generated content

Fontanarava, J;PASI, GABRIELLA^Secondo;VIVIANI, MARCO^Ultimo

2017

Abstract

The Social Web supports and fosters social interactions by means of different social media, which allow the spread of the so called User-Generated Content (UGC). In this context, characterized by the absence of trusted third parties that verify the reliability of the sources and the believability of the content generated, the issue of assessing the credibility of the information diffused by means of social media is receiving increasing attention. In the literature, this issue has been mainly tackled as a classification problem; information is categorized into genuine and fake, usually by implementing or applying classifiers that consider multiple kinds of features (mainly textual and non-Textual) to be evaluated in terms of credibility. In this article, unlike prior research, textual features are considered separately with respect to other kinds of features during the classification process. In particular, an Ensemble Method that combines the results produced by two text classifiers and the ones returned by another classifier acting on non-Textual features is proposed. This allows to have better results with respect to the use of a single classifier on multiple features together. The effectiveness of the Ensemble Method has been assessed in the context of review sites, by means of a labeled dataset gathered from the Yelp.com site, where on-line reviews are already classified as recommended and not recommended.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				slide + paper
			
	Parole chiave
	
				Classification; Credibility; Ensemble learning; Language models; Social media; Socialweb; Text mining;
			
	Parole chiave
	
				Credibility, Social Web, Social Media, Classification, Ensemble Learning, Text Mining, Language Models.
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				16th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2017
			
	Anno del convegno
	
				2017
			
	Titolo degli atti
	
				Proceedings of the International Conference on Web Intelligence
			
	ISBN del volume degli atti
	
				9781450349512
			
	Data di pubblicazione
	
				2017
			
	Pagina iniziale
	
				863
			
	Pagina finale
	
				868
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1145/3106426.3106464
			
	Fulltext
	
				none
			
	Citazione
	
				Fontanarava, J., Pasi, G., Viviani, M. (2017). An ensemble method for the credibility assessment of user-generated content. In Proceedings of the International Conference on Web Intelligence (pp.863-868). Association for Computing Machinery, Inc [10.1145/3106426.3106464].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/168473

Citazioni

9

5

Social impact