Skip to content

Kensho Derived Wikimedia Dataset (KDWD)

Text CorporaKnowledge BaseEnglish

Kensho Derived Wikimedia Dataset (KDWD) is a text corpora dataset in English from Kensho R&D in CSV, JSON format.

About Kensho Derived Wikimedia Dataset (KDWD)

Dataset contains two main components - a link annotated corpus of English Wikipedia pages and a compact sample of the Wikidata knowledge base.

Details

Task
Text Corpora, Knowledge Base
Language
English
Format
CSV, JSON
Rows / instances
n/a
Creator
Kensho R&D
Year
2020
Download

Related Text Corpora, Knowledge Base datasets

FAQ