Skip to content

DOGC

Text CorporaMachine TranslationCatalan, Spanish

Created by Tiedemann et al. at 2012, the DOGC is a text corpora dataset in Catalan, Spanish containing 21.87 records in XML format.

About DOGC

A collection of documents from the official journal of the Catalan Goverment in Catalan and Spanish.

Details

Task
Text Corpora, Machine Translation
Language
Catalan, Spanish
Format
XML
Rows / instances
21.87M
Creator
Tiedemann et al.
Year
2012
Download Paper

Related Text Corpora, Machine Translation datasets

FAQ