Named Entity Recognition (NER) Datasets
There are 14 named entity recognition (ner) datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.
Named Entity Recognition (NER) is a machine-learning task covered in our directory. We catalog 14 datasets for it.
Updated June 2026
- CoNLL 2003 ++Named Entity Recognition (NER)English
- WNUT 2016Named Entity Recognition (NER)English
- BioCreative II Gene Mention Recognition (BC2GM)Information Extraction, Named Entity Recognition (NER)English
- BC5CDR Drug/Chemical (BC5-Chem)Information Extraction, Named Entity Recognition (NER)English
- BC5CDR Disease (BC5-Disease)Information Extraction, Named Entity Recognition (NER)English
- JNLPBAInformation Extraction, Named Entity Recognition (NER)English
- NCBI Disease CorpusInformation Extraction, Named Entity Recognition (NER)English
- KALIMAT Multipurpose Arabic CorpusSummarization, Named Entity Recognition (NER), Part-of-Speech (POS)Arabic
- HAREMNamed Entity Recognition (NER)Portuguese
- BSNLP-2019Named Entity Recognition (NER), Entity LinkingMulti-Lingual
- WikiAnnNamed Entity Recognition (NER)Multi-Lingual
- Conference on Computational Natural Language Learning (CoNLL 2002)Named Entity Recognition (NER)Spanish, Dutch
- Named Entity Model for German, Politics (NEMGP)Named Entity Recognition (NER)German
- Conference on Computational Natural Language Learning (CoNLL 2003)Named Entity Recognition (NER), Part-of-Speech (POS)German, English