data-is-better-together/fineweb-c
Text ClassificationLVS, KOR, KIN
Data-is-better-together/fineweb-c is a text classification-focused dataset in LVS, KOR, KIN that provides 89,699 labeled examples distributed in Parquet format. And falls in the 10K<n<100K size category, and has been downloaded 2.5K times.
About data-is-better-together/fineweb-c
FineWeb-C: Educational content in many languages, labelled by the community
Multilingual data is better together!
Note: We are not actively working on this project anymore. You can continue to contribute annotations and we'll occasio...
Details
- Task
- Text Classification
- Language
- LVS, KOR, KIN
- Format
- Parquet
- Rows / instances
- 89699
- Size
- 10K<n<100K
- Creator
- data-is-better-together
- Year
- 2024
- Downloads
- 2530
- Likes
- 60