Kensho Derived Wikimedia Dataset (KDWD)
Text CorporaKnowledge BaseEnglish
Kensho Derived Wikimedia Dataset (KDWD) is a text corpora dataset in English from Kensho R&D in CSV, JSON format.
About Kensho Derived Wikimedia Dataset (KDWD)
Dataset contains two main components - a link annotated corpus of English Wikipedia pages and a compact sample of the Wikidata knowledge base.
Details
- Task
- Text Corpora, Knowledge Base
- Language
- English
- Format
- CSV, JSON
- Rows / instances
- n/a
- Creator
- Kensho R&D
- Year
- 2020