BlogSet-BR
Text CorporaText ClassificationPortuguese
The BlogSet-BR dataset is a Portuguese text corpora resource from Henrique et al. at 2018 comprising 7.4 examples.
About BlogSet-BR
This dataset is a collection of blog posts crawled from Blogspot platform, containing texts by brazilian authors.
Details
- Task
- Text Corpora, Text Classification
- Language
- Portuguese
- Format
- CSV
- Rows / instances
- 7.40M
- Creator
- Henrique et al.
- Year
- 2018