A Collection of Question Answering Datasets for Norwegian

A Collection of Question Answering Datasets for Norwegian

This paper introduces a new suite of question answering datasets for Norwegian; NorOpenBookQA, NorCommonSenseQA, NorTruthfulQA, and NRK-Quiz-QA. The data covers a wide range of skills and knowledge domains, including world knowledge, commonsense reasoning, truthfulness, and knowledge about Norway. Covering both of the written standards of Norwegian - Bokm{\aa}l and Nynorsk - our datasets comprise over 10k question-answer pairs, created by native speakers. We detail our dataset creation approach and present the results of evaluating 11 language models (LMs) in zero- and few-shot regimes. Most LMs perform better in Bokm{\aa}l than Nynorsk, struggle most with commonsense reasoning, and are often untruthful in generating answers to questions. All our datasets and annotation materials are publicly available.

Publication:

arXiv e-prints

Pub Date:

January 2025

arXiv:

arXiv:2501.11128

Bibcode:

2025arXiv250111128M

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence

E-Print:

Accepted for NoDaLiDa / Baltic-HLT 2025

ADS

A Collection of Question Answering Datasets for Norwegian

Abstract