mlabonne/orpo-dpo-mix-40k
Text GenerationEN
Mlabonne/orpo-dpo-mix-40k is a text generation-focused dataset in EN distributed in Parquet format.
About mlabonne/orpo-dpo-mix-40k
ORPO-DPO-mix-40k v1.2
This dataset is designed for ORPO or DPO training.
See Fine-tune Llama 3 with ORPO for more information about how to use it.
It is a combination of the following high-quality DPO datasets:
argilla/Capybara-Preferences: h...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- mlabonne
- Year
- 2024