Skip to content

allenai/real-toxicity-prompts

General NLPENapache-2.0

The allenai/real-toxicity-prompts dataset is a EN General NLP resource from allenai at 2022. With 10.8K downloads and 121 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10K<n<100K-scale dataset.

About allenai/real-toxicity-prompts

Dataset Card for Real Toxicity Prompts Dataset Summary RealToxicityPrompts is a dataset of 100k sentence snippets from the web for researchers to further address the risk of neural toxic degeneration in models. Languages E...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
allenai
Year
2022
License
apache-2.0
Downloads
10826
Likes
121
Download Homepage

Related General NLP datasets

FAQ