Skip to content

applied-ai-018/pretraining_v1-omega_books

General NLPEnglish

The applied-ai-018/pretraining_v1-omega_books dataset is a English General NLP resource from applied-ai-018 at 2026 comprising 51,901,183 examples. With 364.5K downloads and 7 likes, it is actively used by the community and is a 100M<n<1B-scale dataset.

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
51901183
Size
100M<n<1B
Creator
applied-ai-018
Year
2026
Downloads
364474
Likes
7
Download Homepage

Related General NLP datasets

FAQ