Bicocca Open Archive

With the increasing influence of social media platforms, it has become crucial to develop automated systems capable of detecting instances of sexism and other disrespectful and hateful behaviors to promote a more inclusive and respectful online environment. Nevertheless, these tasks are considerably challenging considering different hate categories and the author’s intentions, especially under the learning with disagreements regime. This paper describes AI-UPV team’s participation in the EXIST (sEXism Identification in Social neTworks) Lab at CLEF 2023 [1, 2]. The proposed approach aims at addressing the task of sexism identification and characterization under the learning with disagreements paradigm by training directly from the data with disagreements, without using any aggregated label. Yet, performances considering both soft and hard evaluations are reported. The proposed system uses large language models (i.e., mBERT and XLM-RoBERTa) and ensemble strategies for sexism identification and classification in English and Spanish. In particular, our system is articulated in three different pipelines. The ensemble approach outperformed the individual large language models obtaining the best performances both adopting a soft and a hard label evaluation. This work describes the participation in all the three EXIST tasks, considering a soft evaluation, it obtained fourth place in Task 2 at EXIST and first place in Task 3, with the highest ICM-Soft of −2.32 and a normalized ICM-Soft of 0.79. The source code of our approaches is publicly available at https://github.com/AngelFelipeMP/Sexism-LLM-Learning-With-Disagreement.

de Paula, A., Rizzi, G., Fersini, E., Spina, D. (2023). AI-UPV at EXIST 2023 – Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime. In CLEF 2023 Working Notes - Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023) (pp.985-999). CEUR-WS.

AI-UPV at EXIST 2023 – Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime

de Paula A. F. M.;Rizzi G.;Fersini E.;Spina D.

2023

Abstract

With the increasing influence of social media platforms, it has become crucial to develop automated systems capable of detecting instances of sexism and other disrespectful and hateful behaviors to promote a more inclusive and respectful online environment. Nevertheless, these tasks are considerably challenging considering different hate categories and the author’s intentions, especially under the learning with disagreements regime. This paper describes AI-UPV team’s participation in the EXIST (sEXism Identification in Social neTworks) Lab at CLEF 2023 [1, 2]. The proposed approach aims at addressing the task of sexism identification and characterization under the learning with disagreements paradigm by training directly from the data with disagreements, without using any aggregated label. Yet, performances considering both soft and hard evaluations are reported. The proposed system uses large language models (i.e., mBERT and XLM-RoBERTa) and ensemble strategies for sexism identification and classification in English and Spanish. In particular, our system is articulated in three different pipelines. The ensemble approach outperformed the individual large language models obtaining the best performances both adopting a soft and a hard label evaluation. This work describes the participation in all the three EXIST tasks, considering a soft evaluation, it obtained fourth place in Task 2 at EXIST and first place in Task 3, with the highest ICM-Soft of −2.32 and a normalized ICM-Soft of 0.79. The source code of our approaches is publicly available at https://github.com/AngelFelipeMP/Sexism-LLM-Learning-With-Disagreement.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Ensemble; Large Language Models; Learning with Disagreements; Sexism Characterization;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				24th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF-WN 2023 - 18 September 2023 through 21 September 2023
			
	Anno del convegno
	
				2023
			
	Curatori della monografia
	
				Aliannejadi, M; Faggioli, G; Ferro, N; Vlachos, M
			
	Titolo degli atti
	
				CLEF 2023 Working Notes - Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023)
			
	Collana o serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Data di pubblicazione
	
				2023
			
	Numero del volume
	
				3497
			
	Pagina iniziale
	
				985
			
	Pagina finale
	
				999
			
	URL alternativo
	
				https://ceur-ws.org/Vol-3497/
			
	Fulltext
	
				none
			
	Citazione
	
				de Paula, A., Rizzi, G., Fersini, E., Spina, D. (2023). AI-UPV at EXIST 2023 – Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime. In CLEF 2023 Working Notes - Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023) (pp.985-999). CEUR-WS.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/451402

Citazioni

2

ND

Social impact