Skip to content

BelleGroup/train_1M_CN

General NLPZHgpl-3.0

Created by BelleGroup at 2023, the BelleGroup/train_1M_CN is a General NLP dataset in ZH in Parquet format. With 1.2K downloads and 157 likes, it is actively used by the community. It is released under the gpl-3.0 license and is a 100K<n<1M-scale dataset.

About BelleGroup/train_1M_CN

内容 包含约100万条由BELLE项目生成的中文指令数据。 样例 { "instruction": "给定一个文字输入,将其中的所有数字加1。\n“明天的会议在9点开始,记得准时到达。”\n", "input": "", "output": "“明天的会议在10点开始,记得准时到达。”" } 字段: instruction: 指令 input: 输入(本数据集均为空) output: 输出 使用限制 仅...

Details

Task
General NLP
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
BelleGroup
Year
2023
License
gpl-3.0
Downloads
1180
Likes
157
Download Homepage

Related General NLP datasets

FAQ