Skip to content

Latest commit

 

History

History
22 lines (13 loc) · 1.98 KB

simple_questions_freebase.md

File metadata and controls

22 lines (13 loc) · 1.98 KB

simple_questions_freebase[1] is an created built on a subset of Freebase - FB5M. The dataset is manually created and annotated against Freebase. This dataset consists of a total of 108,442 questions written in natu-al language by human English-speaking annota- tors each paired with a corresponding fact from FB2M that provides the answer and explains it. These questions are randomy shufffled, and seperated by 70% of them (75910) as training set, 10% as validation set (10845), and the remaining 20% as test set.

The dataset is available for download at this link.

Leaderboard

Model / System Year Precision Recall F1 Language Reported by
STaG-QA_pre 2023 60.20 63.20 61.70 EN Badenes-Olmedo and Corcho
MuHeQA 2023 59.70 56.33 57.97 EN Badenes-Olmedo and Corcho
SYGMA 2023 42.00 55.00 44.00 EN Badenes-Olmedo and Corcho
Falcon 2.0 2023 34.00 41.10 36.30 EN Badenes-Olmedo and Corcho

References

[1] Antoine Bordes, Nicolas Usunier, Sumit Chopra, Jason Weston. Large-scale Simple Question Answering with Memory Networks arXiv preprint arXiv:1603.06807 (2016). [2] We only update system performance after 01.01.2023.

Go back to the README