Knowledge Base Datasets
There are 13 knowledge base datasets in our directory, 1 of which are benchmarks. Each links to its source, paper, and download — browse the full list below or filter by language.
Knowledge Base is a machine-learning task covered in our directory. We catalog 13 datasets for it.
Updated June 2026
- GrailQAQuestion Answering, Knowledge BaseEnglish
- Complex Sequential Question Answering (CSQA)Question Answering, Knowledge BaseEnglish
- WebQuestionsQuestion Answering, Knowledge BaseEnglish
- Kensho Derived Wikimedia Dataset (KDWD)Text Corpora, Knowledge BaseEnglish
- Wikidata NE datasetNamed Entity Recognition, Knowledge BaseGerman, English
- ReVerb45k, Base and AmbiguousInformation Retrieval, Knowledge BaseEnglish
- An Open Information Extraction Corpus (OPIEC)Knowledge Base, Information ExtractionEnglish
- Aristo Tuple KBKnowledge BaseEnglish
- ComplexWebQuestionsQuestion Answering, Knowledge BaseEnglish
- DbpediaKnowledge BaseMulti-Lingual
- The SimpleQuestions DatasetQuestion Answering, Knowledge BaseEnglish
- TupleInf Open IE DatasetKnowledge BaseEnglish
- The Semantic Scholar Open Research Corpus (S2ORC)Text Corpora, Knowledge BaseEnglishBenchmark