Skip to content

BlogSet-BR

Text CorporaText ClassificationPortuguese

The BlogSet-BR dataset is a Portuguese text corpora resource from Henrique et al. at 2018 comprising 7.4 examples.

About BlogSet-BR

This dataset is a collection of blog posts crawled from Blogspot platform, containing texts by brazilian authors.

Details

Task
Text Corpora, Text Classification
Language
Portuguese
Format
CSV
Rows / instances
7.40M
Creator
Henrique et al.
Year
2018
Download Paper

Related Text Corpora, Text Classification datasets

FAQ