Datasets that include alignments between natural language and Knowledge Graphs are fundamental to a wide variety of Natural Language Processing and Generation tasks. Current state-of-the-art aligned datasets, though, are significantly impacted by reduced size and scarcity of covered domains, and their quality is difficult to evaluate. To compensate for these issues, we introduce SEALIon, a tool for extracting RDF triples from natural language textual corpora based on a human-in-the-loop approach. We present our first results of SEALIon’s approach, paving the way for further researches in the field of human-in-the-loop triple extraction.

Amianto Barbato, J., Cremaschi, M., Rula, A., Maurino, A. (2024). Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation. In Intelligent Systems and Applications Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1 (pp.84-101). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-47721-8_6].

Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation

Amianto Barbato, J
;
Cremaschi, M;Rula, A;Maurino, A
2024

Abstract

Datasets that include alignments between natural language and Knowledge Graphs are fundamental to a wide variety of Natural Language Processing and Generation tasks. Current state-of-the-art aligned datasets, though, are significantly impacted by reduced size and scarcity of covered domains, and their quality is difficult to evaluate. To compensate for these issues, we introduce SEALIon, a tool for extracting RDF triples from natural language textual corpora based on a human-in-the-loop approach. We present our first results of SEALIon’s approach, paving the way for further researches in the field of human-in-the-loop triple extraction.
paper
Human-in-the-loop; Natural language generation; Natural language processing; Relation extraction;
English
Intelligent Systems Conference, IntelliSys 2023 - 7 September 2023 through 8 September 2023
2023
Intelligent Systems and Applications Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1
9783031477201
2024
822 LNNS
84
101
none
Amianto Barbato, J., Cremaschi, M., Rula, A., Maurino, A. (2024). Toward a Human-in-the-Loop Approach to Create Training Datasets for RDF Lexicalisation. In Intelligent Systems and Applications Proceedings of the 2023 Intelligent Systems Conference (IntelliSys) Volume 1 (pp.84-101). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-47721-8_6].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/465058
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact