Bicocca Open Archive

Datasets that include alignments between natural language and Knowledge Graphs are fundamental to a wide variety of Natural Language Processing and Generation tasks. Current state-of-the-art aligned datasets, though, are significantly impacted by reduced size and scarcity of covered domains, and their quality is difficult to evaluate. To compensate for these issues, we introduce SEALIon, a tool for extracting RDF triples from natural language textual corpora based on a human-in-the-loop approach. We present our first results of SEALIon’s approach, paving the way for further researches in the field of human-in-the-loop triple extraction.

Amianto Barbato, J., Cremaschi, M., Rula, A., Maurino, A. (2024). Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation. In Intelligent Systems and Applications Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1 (pp.84-101). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-47721-8_6].

Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation

Amianto Barbato, J;Cremaschi, M;Rula, A;Maurino, A

2024

Abstract

Datasets that include alignments between natural language and Knowledge Graphs are fundamental to a wide variety of Natural Language Processing and Generation tasks. Current state-of-the-art aligned datasets, though, are significantly impacted by reduced size and scarcity of covered domains, and their quality is difficult to evaluate. To compensate for these issues, we introduce SEALIon, a tool for extracting RDF triples from natural language textual corpora based on a human-in-the-loop approach. We present our first results of SEALIon’s approach, paving the way for further researches in the field of human-in-the-loop triple extraction.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Human-in-the-loop; Natural language generation; Natural language processing; Relation extraction;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				Intelligent Systems Conference, IntelliSys 2023 - 7 September 2023 through 8 September 2023
			
	Anno del convegno
	
				2023
			
	Titolo degli atti
	
				Intelligent Systems and Applications
Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1
			
	ISBN del volume degli atti
	
				9783031477201
			
	Collana o serie
	
				LECTURE NOTES IN NETWORKS AND SYSTEMS
			
	Data di pubblicazione
	
				2024
			
	Numero del volume
	
				822 LNNS
			
	Pagina iniziale
	
				84
			
	Pagina finale
	
				101
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-031-47721-8_6
			
	Fulltext
	
				none
			
	Citazione
	
				Amianto Barbato, J., Cremaschi, M., Rula, A., Maurino, A. (2024). Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation. In Intelligent Systems and Applications
Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1 (pp.84-101). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-47721-8_6].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/465058

Citazioni

0

0

Social impact