Skip to content

Google Books N-grams

ClassificationClusteringMulti-Lingual

Google Books N-grams is a classification dataset in Multi-Lingual from Google with 2.2 TB of text records in Text format.

Details

Task
Classification, Clustering
Language
Multi-Lingual
Format
Text
Rows / instances
2.2 TB of text
Creator
Google
Year
2011
Download Paper

Related Classification, Clustering datasets

FAQ