Skip to content

BAAI/IndustryCorpus2

General NLPEN, ZHapache-2.0

BAAI/IndustryCorpus2 is a General NLP-focused dataset in EN, ZH distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the 100M<n<1B size category, and has been downloaded 8.8K times.

About BAAI/IndustryCorpus2

Industry models play a vital role in promoting the intelligent transformation and innovative development of enterprises. High-quality industry data is the key to improving the performance of large models and realizing the implementation of industr...

Details

Task
General NLP
Language
EN, ZH
Format
Parquet
Rows / instances
N/A
Size
100M<n<1B
Creator
BAAI
Year
2024
License
apache-2.0
Downloads
8785
Likes
73
Download Homepage

Related General NLP datasets

FAQ