Watan-2004 Corpus
Text CorporaArabic
Created by Abbas et al. at 2004, the Watan-2004 Corpus is a text corpora dataset in Arabic containing 20 records in HTML format.
About Watan-2004 Corpus
Dataset contains about 20,000 articles talking about 6 topics: culture, religion, economy, local news, international news and sports.
Details
- Task
- Text Corpora
- Language
- Arabic
- Format
- HTML
- Rows / instances
- 20
- Creator
- Abbas et al.
- Year
- 2004