Skip to content

data-is-better-together/fineweb-c

Text ClassificationLVS, KOR, KIN

Data-is-better-together/fineweb-c is a text classification-focused dataset in LVS, KOR, KIN that provides 89,699 labeled examples distributed in Parquet format. And falls in the 10K<n<100K size category, and has been downloaded 2.5K times.

About data-is-better-together/fineweb-c

FineWeb-C: Educational content in many languages, labelled by the community Multilingual data is better together! Note: We are not actively working on this project anymore. You can continue to contribute annotations and we'll occasio...

Details

Task
Text Classification
Language
LVS, KOR, KIN
Format
Parquet
Rows / instances
89699
Size
10K<n<100K
Creator
data-is-better-together
Year
2024
Downloads
2530
Likes
60
Download Homepage

Related Text Classification datasets

FAQ