allenai/soda
General NLPENcc-by-4.0
Allenai/soda is a General NLP-focused dataset in EN distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.6K times.
About allenai/soda
Dataset Card for 🥤SODA
Dataset Summary
🥤SODA is the first publicly available, million-scale, high-quality dialogue dataset covering a wide range of social interactions. Dialogues are distilled from a PLM (InstructGPT; Ouyang et al., ...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- allenai
- Year
- 2023
- License
- cc-by-4.0
- Downloads
- 1581
- Likes
- 154