Skip to content

A repository with information about Reykjavik University Question-Answer Dataset

Notifications You must be signed in to change notification settings

cadia-lvl/RUQuAD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

RUQuAD

A repository with information about Reykjavik University Question-Answering Dataset (RUQuAD).

The first version of RUQuAD (RUQuAD 22.02) was collected in 2021-2022 by about 1,000 crowd-workers who used the GameQA mobile app platform to generate about 23,000 questions of which about 20,800 passed a double peer review. For these 20,800 verified questions, the crowd-workers annotated about 12,700 answers, sourced from five sources in four separate domains: The Icelandic Wikipedia, The Icelandic Web of Science, the news websites mbl.is and visir.is, and The Icelandic Government Information website.

Please refer to the following paper regarding GameQA and the compilation of RUQuAD:

Njáll Skarphéðinsson, Breki Guðmundsson, Steinar Smári, Marta Kristín Lárusdóttir, Hafsteinn Einarsson, Abuzar Khan, Eric Nyberg, and Hrafn Loftsson. 2023. GameQA: Gamified Mobile App for Building Multiple-Domain Question-Answering Datasets. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL): System Demonstrations. Dubrovnik, Croatia.

RUQuAD 22.02 is available for download from CLARIN.is as two separate datasets: RUQuAD-1 and RUQuAD-2.

About

A repository with information about Reykjavik University Question-Answer Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published