simple_questions_freebase^[1] is an created built on a subset of Freebase - FB5M. The dataset is manually created and annotated against Freebase. This dataset consists of a total of 108,442 questions written in natu-al language by human English-speaking annota- tors each paired with a corresponding fact from FB2M that provides the answer and explains it. These questions are randomy shufffled, and seperated by 70% of them (75910) as training set, 10% as validation set (10845), and the remaining 20% as test set.

The dataset is available for download at this link.

Leaderboard

Model / System	Year	Precision	Recall	F1	Language	Reported by
STaG-QA_pre	2023	60.20	63.20	61.70	EN	Badenes-Olmedo and Corcho
MuHeQA	2023	59.70	56.33	57.97	EN	Badenes-Olmedo and Corcho
SYGMA	2023	42.00	55.00	44.00	EN	Badenes-Olmedo and Corcho
Falcon 2.0	2023	34.00	41.10	36.30	EN	Badenes-Olmedo and Corcho

References

[1] Antoine Bordes, Nicolas Usunier, Sumit Chopra, Jason Weston. Large-scale Simple Question Answering with Memory Networks arXiv preprint arXiv:1603.06807 (2016). [2] We only update system performance after 01.01.2023.

Go back to the README

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simple_questions_freebase.md

simple_questions_freebase.md

Leaderboard

References

Files

simple_questions_freebase.md

Latest commit

History

simple_questions_freebase.md

File metadata and controls

Leaderboard

References