DBLP-QuAD: A Question Answering Dataset over the DBLP Scholarly Knowledge Graph
Abstract
In this work we create a question answering dataset over the DBLP scholarly knowledge graph (KG). DBLP is an on-line reference for bibliographic information on major computer science publications that indexes over 4.4 million publications published by more than 2.2 million authors. Our dataset consists of 10,000 question answer pairs with the corresponding SPARQL queries which can be executed over the DBLP KG to fetch the correct answer. DBLP-QuAD is the largest scholarly question answering dataset.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2023
- DOI:
- 10.48550/arXiv.2303.13351
- arXiv:
- arXiv:2303.13351
- Bibcode:
- 2023arXiv230313351B
- Keywords:
-
- Computer Science - Digital Libraries;
- Computer Science - Computation and Language
- E-Print:
- 12 pages ceur-ws 1 column accepted at International Bibliometric Information Retrieval Workshp @ ECIR 2023