In recent years, we saw the emergence of several approaches for producing machine-readable, semantically rich, interlinked description of the content of research publications, typically encoded as knowledge graphs. A common limitation of these solutions is that they address a low number of articles, either because they rely on human experts to summarize information from the literature or because they focus on specific research areas. In this paper, we introduce the Computer Science Knowledge Graph (CS-KG), a large-scale knowledge graph composed by over 350M RDF triples describing 41M statements from 6.7M articles about 10M entities linked by 179 semantic relations. It was automatically generated and will be periodically updated by applying an information extraction pipeline on a large repository of research papers. CS-KG is much larger than all comparable solutions and offers a very comprehensive representation of tasks, methods, materials, and metrics in Computer Science. It can support a variety of intelligent services, such as advanced literature search, document classification, article recommendation, trend forecasting, hypothesis generation, and many others. CS-KG was evaluated against a benchmark of manually annotated statements, yielding excellent results.

Dessi, D., Osborne, F., Reforgiato Recupero, D., Buscaldi, D., Motta, E. (2022). CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science. In The Semantic Web – ISWC 2022 21st International Semantic Web Conference, Virtual Event, October 23–27, 2022, Proceedings (pp.678-696). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-19433-7_39].

CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science

Osborne F.;
2022

Abstract

In recent years, we saw the emergence of several approaches for producing machine-readable, semantically rich, interlinked description of the content of research publications, typically encoded as knowledge graphs. A common limitation of these solutions is that they address a low number of articles, either because they rely on human experts to summarize information from the literature or because they focus on specific research areas. In this paper, we introduce the Computer Science Knowledge Graph (CS-KG), a large-scale knowledge graph composed by over 350M RDF triples describing 41M statements from 6.7M articles about 10M entities linked by 179 semantic relations. It was automatically generated and will be periodically updated by applying an information extraction pipeline on a large repository of research papers. CS-KG is much larger than all comparable solutions and offers a very comprehensive representation of tasks, methods, materials, and metrics in Computer Science. It can support a variety of intelligent services, such as advanced literature search, document classification, article recommendation, trend forecasting, hypothesis generation, and many others. CS-KG was evaluated against a benchmark of manually annotated statements, yielding excellent results.
paper
Artificial Intelligence; Information extraction; Knowledge graph; Natural language processing; Scholarly data; Semantic Web;
English
21st International Semantic Web Conference, ISWC 2022 - 23 October 2022through 27 October 2022
2022
Sattler, U; Hogan, A; Keet, M; Presutti, V; Almeida, JPA; Takeda, H; Monnin, P; Pirrò, G; d’Amato, C
The Semantic Web – ISWC 2022 21st International Semantic Web Conference, Virtual Event, October 23–27, 2022, Proceedings
978-3-031-19432-0
2022
13489 LNCS
678
696
open
Dessi, D., Osborne, F., Reforgiato Recupero, D., Buscaldi, D., Motta, E. (2022). CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science. In The Semantic Web – ISWC 2022 21st International Semantic Web Conference, Virtual Event, October 23–27, 2022, Proceedings (pp.678-696). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-19433-7_39].
File in questo prodotto:
File Dimensione Formato  
Dessi-2022-Lect Notes Computer Sci-preprint.pdf

accesso aperto

Descrizione: Intervento a convegno
Tipologia di allegato: Submitted Version (Pre-print)
Licenza: Creative Commons
Dimensione 1.88 MB
Formato Adobe PDF
1.88 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/412219
Citazioni
  • Scopus 20
  • ???jsp.display-item.citation.isi??? 5
Social impact