Large language models (LLMs) have revolutionized human-machine interaction with their ability to converse and perform various language tasks. This study investigates the potential of LLMs for knowledge formalization using well-defined vocabularies, specifically focusing on OntoLex-Lemon. As a preliminary exploration, we test four languages (English, Italian, Albanian, Romanian) and analyze the formalization quality of nine words with varying characteristics applying a multidimensional evaluation approach. While manual validation provided initial insights, it highlights the need for developing scalable evaluation methods for future large-scale experiments. This research aims to initiate a discussion on the potential and challenges of utilizing LLMs for knowledge formalization within the Semantic Web framework.

Pia di Buono, M., Spahiu, B., Mititelu, V. (2024). Evaluating Large Language Models for Linguistic Linked Data Generation. In Proceedings of the Workshop on DLnLD 2024: Deep Learning and Linked Data at LREC-COLING 2024 - Workshop Proceedings (pp.66-75). European Language Resources Association (ELRA).

Evaluating Large Language Models for Linguistic Linked Data Generation

Spahiu B.
;
2024

Abstract

Large language models (LLMs) have revolutionized human-machine interaction with their ability to converse and perform various language tasks. This study investigates the potential of LLMs for knowledge formalization using well-defined vocabularies, specifically focusing on OntoLex-Lemon. As a preliminary exploration, we test four languages (English, Italian, Albanian, Romanian) and analyze the formalization quality of nine words with varying characteristics applying a multidimensional evaluation approach. While manual validation provided initial insights, it highlights the need for developing scalable evaluation methods for future large-scale experiments. This research aims to initiate a discussion on the potential and challenges of utilizing LLMs for knowledge formalization within the Semantic Web framework.
paper
Knowledge Formalisation; Large Language Models; Linguistic Data; Semantic Web
English
2024 Workshop on Deep Learning and Linked Data, DLnLD 2024 - 21 May 2024
2024
Serasset, G; Oliveira, HG; Oleskeviciene, GV
Proceedings of the Workshop on DLnLD 2024: Deep Learning and Linked Data at LREC-COLING 2024 - Workshop Proceedings
9782493814166
2024
66
75
https://aclanthology.org/2024.dlnld-1.6/
open
Pia di Buono, M., Spahiu, B., Mititelu, V. (2024). Evaluating Large Language Models for Linguistic Linked Data Generation. In Proceedings of the Workshop on DLnLD 2024: Deep Learning and Linked Data at LREC-COLING 2024 - Workshop Proceedings (pp.66-75). European Language Resources Association (ELRA).
File in questo prodotto:
File Dimensione Formato  
Pia di Buono-2024-Proceedings of the Workshop on DLnLD 2024-VoR.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 181.89 kB
Formato Adobe PDF
181.89 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/488839
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact