Skip to content

DPO on Pythia-2.8B

Stanford UniversityCZ Biohub NetworkLanguage modeling/generationQuestion answering

DPO on Pythia-2.8B is language modeling/generation model published by Stanford University,CZ Biohub Network in 2023 featuring 2800000000.0 parameters.

About DPO on Pythia-2.8B

While large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of their behavior is difficult due to the completely unsupervised nature of their training. Existing methods for gai

Details

Provider
Stanford University,CZ Biohub Network
Task
Language modeling/generation,Question answering
Parameters
2800000000.0
Released
2023-05-29
Open weights
No
View model source

Explore

FAQ