codeparrot/self-instruct-starcoder
General NLPENBenchmarkbigscience-openrail-m
Created by codeparrot at 2023, the codeparrot/self-instruct-starcoder is a General NLP benchmark dataset in EN containing 9,631 records in Parquet format. With 313 downloads and 63 likes, it is actively used by the community. It is released under the bigscience-openrail-m license and is a 1K<n<10K-scale dataset.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About codeparrot/self-instruct-starcoder
Self-instruct-starcoder
Summary
Self-instruct-starcoder is a dataset that was generated by prompting starcoder to generate new instructions based on some human-written seed instructions.
The underlying process is explained in the pap...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 9631
- Size
- 1K<n<10K
- Creator
- codeparrot
- Year
- 2023
- License
- bigscience-openrail-m
- Downloads
- 313
- Likes
- 63