tasksource/mmlu
Text ClassificationMultiple ChoiceQuestion AnsweringENBenchmark
Tasksource/mmlu is a text classification benchmark dataset in EN from tasksource in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About tasksource/mmlu
MMLU (hendrycks_test on huggingface) without auxiliary train. It is much lighter (7MB vs 162MB) and faster than the original implementation, in which auxiliary train is loaded (+ duplicated!) by default for all the configs in the original version,...
Details
- Task
- Text Classification, Multiple Choice, Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- tasksource
- Year
- 2023