google-research-datasets/paws-x
Text ClassificationDE, EN, ESBenchmark
Google-research-datasets/paws-x is a text classification benchmark dataset in DE, EN, ES from google-research-datasets with 373,807 records in Parquet format. It is distributed under the other license and falls in the 100K<n<1M size category, and has been downloaded 3.8K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About google-research-datasets/paws-x
Dataset Card for PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification
Dataset Summary
This dataset contains 23,659 human translated PAWS evaluation pairs and
296,406 machine translated training pairs in six typol...
Details
- Task
- Text Classification
- Language
- DE, EN, ES
- Format
- Parquet
- Rows / instances
- 373807
- Size
- 100K<n<1M
- Creator
- google-research-datasets
- Year
- 2022
- License
- other
- Downloads
- 3830
- Likes
- 51