Skip to content

WikiAnn

Named Entity Recognition (NER)Multi-Lingual

The WikiAnn dataset is a Multi-Lingual Named Entity Recognition (NER) resource from Pan et al. at 2017 comprising 95,924 examples.

About WikiAnn

Dataset with NER annotations for PER, ORG and LOC. It has been constructed using the linked entities in Wikipedia pages for 282 different languages.

Details

Task
Named Entity Recognition (NER)
Language
Multi-Lingual
Format
JSON
Rows / instances
95,924
Creator
Pan et al.
Year
2017
Download Paper

Related Named Entity Recognition (NER) datasets

FAQ