Skip to content

google-research-datasets/paws-x

Text ClassificationDE, EN, ESBenchmark

Google-research-datasets/paws-x is a text classification benchmark dataset in DE, EN, ES from google-research-datasets with 373,807 records in Parquet format. It is distributed under the other license and falls in the 100K<n<1M size category, and has been downloaded 3.8K times.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About google-research-datasets/paws-x

Dataset Card for PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification Dataset Summary This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine translated training pairs in six typol...

Details

Task
Text Classification
Language
DE, EN, ES
Format
Parquet
Rows / instances
373807
Size
100K<n<1M
Creator
google-research-datasets
Year
2022
License
other
Downloads
3830
Likes
51
Download Homepage

Related Text Classification datasets

FAQ