Paraphrase and Semantic Similarity in Twitter (PIT)
ClassificationEnglish
Paraphrase and Semantic Similarity in Twitter (PIT) is a classification-focused dataset in English that provides 18,762 labeled examples distributed in Text format.
About Paraphrase and Semantic Similarity in Twitter (PIT)
Dataset focuses on whether tweets have (almost) same meaning/information or not.
Details
- Task
- Classification
- Language
- English
- Format
- Text
- Rows / instances
- 18,762
- Creator
- Xu et al.
- Year
- 2015