BG Datasets
We catalog 5 BG datasets for NLP and machine learning. Browse the list below or narrow down by task.
This page covers BG-language data. Our directory includes 5 datasets in BG.
Updated June 2026
- dennlinger/eur-lex-sumTranslation, SummarizationBG, HR, CS
- FBK-MT/moselAutomatic Speech Recognition, Text To SpeechEN, BG, HR
- neulab/PangeaInstructVisual Question Answering, Question AnsweringAM, AR, BG
- Helsinki-NLP/open_subtitlesTranslationAF, AR, BG
- papluca/language-identificationText ClassificationAR, BG, DE