Bicocca Open Archive

The automatic detection of sarcasm and irony in user generated contents is one of the most challenging task of Natural Language Processing. In this paper we address this problem by introducing Bayesian Model Averaging (BMA), an ensemble approach to take into account several classifiers according to their reliabilities and their marginal probability predictions. The impact of the most used expressive signals (pragmatic particles and POS tags) have been evaluated in baseline models (traditional classifiers and majority voting) as well as in the proposed BMA approach. Experimental results highlight two main findings: (1) not all the features are equally able to characterize sarcasm and irony and (2) BMA not only outperforms traditional state of the art models, but is also able to ensure notable generalization capabilities both on ironic and sarcastic text.

Fersini, E., Pozzi, F., Messina, V. (2015). Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers. In Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 (pp.981-988). Institute of Electrical and Electronics Engineers Inc. [10.1109/DSAA.2015.7344888].

Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers

FERSINI, ELISABETTA^Primo;POZZI, FEDERICO ALBERTO^Secondo;MESSINA, VINCENZINA^Ultimo

2015

Abstract

The automatic detection of sarcasm and irony in user generated contents is one of the most challenging task of Natural Language Processing. In this paper we address this problem by introducing Bayesian Model Averaging (BMA), an ensemble approach to take into account several classifiers according to their reliabilities and their marginal probability predictions. The impact of the most used expressive signals (pragmatic particles and POS tags) have been evaluated in baseline models (traditional classifiers and majority voting) as well as in the proposed BMA approach. Experimental results highlight two main findings: (1) not all the features are equally able to characterize sarcasm and irony and (2) BMA not only outperforms traditional state of the art models, but is also able to ensure notable generalization capabilities both on ironic and sarcastic text.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Artificial Intelligence; Information Systems and Management; Information Systems
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 Oct 19-21
			
	Anno del convegno
	
				2015
			
	Titolo degli atti
	
				Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015
			
	ISBN del volume degli atti
	
				9781467382731
			
	Data di pubblicazione
	
				2015
			
	Pagina iniziale
	
				981
			
	Pagina finale
	
				988
			
	Article number
	
				7344888
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1109/DSAA.2015.7344888
			
	Fulltext
	
				none
			
	Citazione
	
				Fersini, E., Pozzi, F., Messina, V. (2015). Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers. In Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 (pp.981-988). Institute of Electrical and Electronics Engineers Inc. [10.1109/DSAA.2015.7344888].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/135766

Citazioni

57

20

Social impact