Skip to content

BAAI/COIG

General NLPZHapache-2.0

BAAI/COIG is a General NLP dataset in ZH from BAAI in Parquet format. It is distributed under the apache-2.0 license and falls in the 100K<n<1M size category, and has been downloaded 800 times.

About BAAI/COIG

We propose the Chinese Open Instruction Generalist (COIG) project to maintain a harmless, helpful, and diverse set of Chinese instruction corpora. We welcome all researchers in the community to contribute to the corpus set and collaborate with us....

Details

Task
General NLP
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
BAAI
Year
2023
License
apache-2.0
Downloads
800
Likes
460
Download Homepage

Related General NLP datasets

FAQ