The incompleteness of Knowledge Graphs (KGs) is a crucial issue affecting the quality of AI-based services. In the scholarly domain, KGs describing research publications typically lack important information, hindering our ability to analyse and predict research dynamics. In recent years, link prediction approaches based on Knowledge Graph Embedding models became the first aid for this issue. In this work, we present Trans4E, a novel embedding model that is particularly fit for KGs which include N to M relations with N ≫ M. This is typical for KGs that categorize a large number of entities (e.g., research articles, patents, persons) according to a relatively small set of categories. Trans4E was applied on two large-scale knowledge graphs, the Academia/Industry DynAmics (AIDA) and Microsoft Academic Graph (MAG), for completing the information about Fields of Study (e.g., ‘neural networks’, ‘machine learning’, ‘artificial intelligence’), and affiliation types (e.g., ‘education’, ‘company’, ‘government’), improving the scope and accuracy of the resulting data. We evaluated our approach against alternative solutions on AIDA, MAG, and four other benchmarks (FB15k, FB15k-237, WN18, and WN18RR). Trans4E outperforms the other models when using low embedding dimensions and obtains competitive results in high dimensions.

Nayyeri, M., Cil, G., Vahdati, S., Osborne, F., Rahman, M., Angioni, S., et al. (2021). Trans4E: Link prediction on scholarly knowledge graphs. NEUROCOMPUTING, 461(21 October 2021), 530-542 [10.1016/j.neucom.2021.02.100].

Trans4E: Link prediction on scholarly knowledge graphs

Osborne F;Salatino A;
2021

Abstract

The incompleteness of Knowledge Graphs (KGs) is a crucial issue affecting the quality of AI-based services. In the scholarly domain, KGs describing research publications typically lack important information, hindering our ability to analyse and predict research dynamics. In recent years, link prediction approaches based on Knowledge Graph Embedding models became the first aid for this issue. In this work, we present Trans4E, a novel embedding model that is particularly fit for KGs which include N to M relations with N ≫ M. This is typical for KGs that categorize a large number of entities (e.g., research articles, patents, persons) according to a relatively small set of categories. Trans4E was applied on two large-scale knowledge graphs, the Academia/Industry DynAmics (AIDA) and Microsoft Academic Graph (MAG), for completing the information about Fields of Study (e.g., ‘neural networks’, ‘machine learning’, ‘artificial intelligence’), and affiliation types (e.g., ‘education’, ‘company’, ‘government’), improving the scope and accuracy of the resulting data. We evaluated our approach against alternative solutions on AIDA, MAG, and four other benchmarks (FB15k, FB15k-237, WN18, and WN18RR). Trans4E outperforms the other models when using low embedding dimensions and obtains competitive results in high dimensions.
Articolo in rivista - Articolo scientifico
Knowledge graph embedding; Link prediction; Metaresearch queries; Research of research; Scholarly communication; Scholarly knowledge graph; Science graph;
English
2021
461
21 October 2021
530
542
open
Nayyeri, M., Cil, G., Vahdati, S., Osborne, F., Rahman, M., Angioni, S., et al. (2021). Trans4E: Link prediction on scholarly knowledge graphs. NEUROCOMPUTING, 461(21 October 2021), 530-542 [10.1016/j.neucom.2021.02.100].
File in questo prodotto:
File Dimensione Formato  
Nayyeri-2021-Neurocomput-preprint.pdf

accesso aperto

Tipologia di allegato: Submitted Version (Pre-print)
Licenza: Creative Commons
Dimensione 1.43 MB
Formato Adobe PDF
1.43 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/374651
Citazioni
  • Scopus 41
  • ???jsp.display-item.citation.isi??? 31
Social impact