Skip to content

Tencent AI Lab Embedding Corpus

EmbeddingsChinese

Created by Song et al. at 2018, the Tencent AI Lab Embedding Corpus is a embeddings dataset in Chinese containing 8 records in Text format.

About Tencent AI Lab Embedding Corpus

Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.

Details

Task
Embeddings
Language
Chinese
Format
Text
Rows / instances
8M
Creator
Song et al.
Year
2018
Download Paper

Related Embeddings datasets

FAQ