Skip to content

arcee-ai/EvolKit-20k

General NLPEnglishBenchmarkmit

Arcee-ai/EvolKit-20k is a General NLP-focused benchmark dataset in English distributed in Parquet format. It is distributed under the mit license and falls in the 10K<n<100K size category, and has been downloaded 86 times.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About arcee-ai/EvolKit-20k

EvolKit-20k This is a subset of a larger dataset generated for the purpose of training our Llama-3.1-SuperNova model. It utilized our EvolKit repository: https://github.com/arcee-ai/EvolKit.

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
arcee-ai
Year
2024
License
mit
Downloads
86
Likes
62
Download Homepage

Related General NLP datasets

FAQ