BelleGroup/train_1M_CN
General NLPZHgpl-3.0
Created by BelleGroup at 2023, the BelleGroup/train_1M_CN is a General NLP dataset in ZH in Parquet format. With 1.2K downloads and 157 likes, it is actively used by the community. It is released under the gpl-3.0 license and is a 100K<n<1M-scale dataset.
About BelleGroup/train_1M_CN
内容
包含约100万条由BELLE项目生成的中文指令数据。
样例
{
"instruction": "给定一个文字输入,将其中的所有数字加1。\n“明天的会议在9点开始,记得准时到达。”\n",
"input": "",
"output": "“明天的会议在10点开始,记得准时到达。”"
}
字段:
instruction: 指令
input: 输入(本数据集均为空)
output: 输出
使用限制
仅...
Details
- Task
- General NLP
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- BelleGroup
- Year
- 2023
- License
- gpl-3.0
- Downloads
- 1180
- Likes
- 157