Skip to content

GPT3-2.7B (FlashAttention-2)

Stanford UniversityPrinceton UniversityLanguage modeling/generation

GPT3-2.7B (FlashAttention-2) is language modeling/generation model published by Stanford University,Princeton University in 2023 featuring 2700000000.0 parameters.

About GPT3-2.7B (FlashAttention-2)

Scaling Transformers to longer sequence lengths has been a major problem in the last several years, promising to improve performance in language modeling and high-resolution image understanding, as well as to unlock new applications in code, audio, a

Details

Provider
Stanford University,Princeton University
Task
Language modeling/generation
Parameters
2700000000.0
Released
2023-07-18
Open weights
No
View model source

Explore

FAQ