Tencent AI Lab Embedding Corpus
EmbeddingsChinese
Created by Song et al. at 2018, the Tencent AI Lab Embedding Corpus is a embeddings dataset in Chinese containing 8 records in Text format.
About Tencent AI Lab Embedding Corpus
Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.
Details
- Task
- Embeddings
- Language
- Chinese
- Format
- Text
- Rows / instances
- 8M
- Creator
- Song et al.
- Year
- 2018