Bicocca Open Archive

Hate speech is a major challenge in Indonesia, a diverse country with multiple languages and a dynamic online landscape. This research explores the phenomenon of hate speech and its detection, particularly in language contexts with limited resources. We introduce a new abusive words lexicon, created by collecting words from various sources, adapted for Indonesian, Javanese and Sundanese. Our study investigates the practical implementation of this lexicon. We conducted extensive experiments using different datasets and machine learning models, aiming to improve hate speech detection. The results consistently show a positive impact of the lexicon, which significantly improves detection, especially in languages with fewer resources. But this research paves the way for further exploration. The lexicon can be expanded, broadening its scope. Additionally, we suggest investigating more sophisticated models, such as transformer-based models, to more effectively detect hate speech. In a world where hate speech is a growing problem, our research provides valuable insights and tools to combat it effectively in Indonesia and other countries.

Pamungkas, E., Purworini, D., Putri, D., Akhtar, S. (2024). Enhancing hate speech detection in Indonesian using abusive words lexicon. INDONESIAN JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 33(1), 450-462 [10.11591/ijeecs.v33.i1.pp450-462].

Enhancing hate speech detection in Indonesian using abusive words lexicon

Pamungkas E. W.;Purworini D.;Putri D. G. P.;Akhtar S.

2024

Abstract

Hate speech is a major challenge in Indonesia, a diverse country with multiple languages and a dynamic online landscape. This research explores the phenomenon of hate speech and its detection, particularly in language contexts with limited resources. We introduce a new abusive words lexicon, created by collecting words from various sources, adapted for Indonesian, Javanese and Sundanese. Our study investigates the practical implementation of this lexicon. We conducted extensive experiments using different datasets and machine learning models, aiming to improve hate speech detection. The results consistently show a positive impact of the lexicon, which significantly improves detection, especially in languages with fewer resources. But this research paves the way for further exploration. The lexicon can be expanded, broadening its scope. Additionally, we suggest investigating more sophisticated models, such as transformer-based models, to more effectively detect hate speech. In a world where hate speech is a growing problem, our research provides valuable insights and tools to combat it effectively in Indonesia and other countries.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Abusive language; Hate speech detection; Low-resource languages; Machine learning; Social media;
			
	Lingua del contenuto
	
				English
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				INDONESIAN JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE
			
	Numero del volume
	
				33
			
	Fascicolo
	
				1
			
	Pagina iniziale
	
				450
			
	Pagina finale
	
				462
			
	DOI dell'articolo
	
				https://dx.doi.org/10.11591/ijeecs.v33.i1.pp450-462
			
	Fulltext
	
				open
			
	Citazione
	
				Pamungkas, E., Purworini, D., Putri, D., Akhtar, S. (2024). Enhancing hate speech detection in Indonesian using abusive words lexicon. INDONESIAN JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 33(1), 450-462 [10.11591/ijeecs.v33.i1.pp450-462].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
"35178-70512-1-PB.pdf" accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.57 MB Formato Unknown Visualizza/Apri	1.57 MB	Unknown	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/530050

Citazioni

0

ND

Social impact