BAAI/IndustryCorpus2
General NLPEN, ZHapache-2.0
BAAI/IndustryCorpus2 is a General NLP-focused dataset in EN, ZH distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the 100M<n<1B size category, and has been downloaded 8.8K times.
About BAAI/IndustryCorpus2
Industry models play a vital role in promoting the intelligent transformation and innovative development of enterprises. High-quality industry data is the key to improving the performance of large models and realizing the implementation of industr...
Details
- Task
- General NLP
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100M<n<1B
- Creator
- BAAI
- Year
- 2024
- License
- apache-2.0
- Downloads
- 8785
- Likes
- 73