Bicocca Open Archive

Personalization in Information Retrieval is a topic studied for a long time. Nevertheless, there is still a lack of high-quality, real-world datasets to conduct large-scale experiments and evaluate models for personalized search. This paper contributes to filling this gap by introducing SE-PQA(StackExchange - Personalized Question Answering), a new curated resource to design and evaluate personalized models related to the task of community Question Answering (cQA). The contributed dataset includes more than 1 million queries and 2 million answers, annotated with a rich set of features modeling the social interactions among the users of a popular cQA platform. We describe the characteristics of SE-PQA and detail the features associated with questions and answers. We also provide reproducible baseline methods for the cQA task based on the resource, including deep learning models and personalization approaches. The results of the preliminary experiments conducted show the appropriateness of SE-PQA to train effective cQA models; they also show that personalization remarkably improves the effectiveness of all the methods tested. Furthermore, we show the benefits in terms of robustness and generalization of combining data from multiple communities for personalization purposes.

Kasela, P., Braga, M., Pasi, G., Perego, R. (2024). SE-PQA: Personalized Community Question Answering. In WWW '24: Companion Proceedings of the ACM Web Conference 2024 (pp.1095-1098). Association for Computing Machinery, Inc [10.1145/3589335.3651445].

SE-PQA: Personalized Community Question Answering

Kasela, Pranav^Primo;Braga, Marco;Pasi, Gabriella;Perego, Raffaele

2024

Abstract

Personalization in Information Retrieval is a topic studied for a long time. Nevertheless, there is still a lack of high-quality, real-world datasets to conduct large-scale experiments and evaluate models for personalized search. This paper contributes to filling this gap by introducing SE-PQA(StackExchange - Personalized Question Answering), a new curated resource to design and evaluate personalized models related to the task of community Question Answering (cQA). The contributed dataset includes more than 1 million queries and 2 million answers, annotated with a rich set of features modeling the social interactions among the users of a popular cQA platform. We describe the characteristics of SE-PQA and detail the features associated with questions and answers. We also provide reproducible baseline methods for the cQA task based on the resource, including deep learning models and personalization approaches. The results of the preliminary experiments conducted show the appropriateness of SE-PQA to train effective cQA models; they also show that personalization remarkably improves the effectiveness of all the methods tested. Furthermore, we show the benefits in terms of robustness and generalization of combining data from multiple communities for personalization purposes.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Personalization; Question Answering; User Model;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				33rd ACM Web Conference, WWW 2024 - 13 May 2024 through 17 May 2024
			
	Anno del convegno
	
				2024
			
	Titolo degli atti
	
				WWW '24: Companion Proceedings of the ACM Web Conference 2024
			
	ISBN del volume degli atti
	
				9798400701726
			
	Data di pubblicazione
	
				2024
			
	Numero del volume
	
				9
			
	Pagina iniziale
	
				1095
			
	Pagina finale
	
				1098
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1145/3589335.3651445
			
	Fulltext
	
				open
			
	Citazione
	
				Kasela, P., Braga, M., Pasi, G., Perego, R. (2024). SE-PQA: Personalized Community Question Answering. In WWW '24: Companion Proceedings of the ACM Web Conference 2024 (pp.1095-1098). Association for Computing Machinery, Inc [10.1145/3589335.3651445].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
3589335.3651445.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.02 MB Formato Adobe PDF Visualizza/Apri	1.02 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/521139

Citazioni

2

ND

Social impact