Skip to content

Paraphrase Adversaries from Word Scrambling (PAWS-X)

Paraphrasing IdentificationMulti-Lingual

The Paraphrase Adversaries from Word Scrambling (PAWS-X) dataset is a Multi-Lingual paraphrasing identification resource from Yang et al. at 2019 comprising 300,000+ examples.

About Paraphrase Adversaries from Word Scrambling (PAWS-X)

Dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine translated training pairs in six typologically distinct languages: French, Spanish, German, Chinese, Japanese, and Korean. All translated pairs are sourced from examples in PAWS-Wiki.

Details

Task
Paraphrasing Identification
Language
Multi-Lingual
Format
TSV
Rows / instances
300,000+
Creator
Yang et al.
Year
2019
Download Paper

Related Paraphrasing Identification datasets

FAQ