Skip to content

JW300

Machine TranslationMulti-Lingual

JW300 is a machine translation dataset in Multi-Lingual from Agic et al. with 105.11 records in XML format.

About JW300

Dataset is parallel corpus of over 300 languages with around 100 thousand parallel sentences per language pair on average.

Details

Task
Machine Translation
Language
Multi-Lingual
Format
XML
Rows / instances
105.11M
Creator
Agic et al.
Year
2019
Download Paper

Related Machine Translation datasets

FAQ