GPT3-2.7B (FlashAttention-2)
Stanford UniversityPrinceton UniversityLanguage modeling/generation
GPT3-2.7B (FlashAttention-2) is language modeling/generation model published by Stanford University,Princeton University in 2023 featuring 2700000000.0 parameters.
About GPT3-2.7B (FlashAttention-2)
Scaling Transformers to longer sequence lengths has been a major problem in the last several years, promising to improve performance in language modeling and high-resolution image understanding, as well as to unlock new applications in code, audio, a
Details
- Provider
- Stanford University,Princeton University
- Task
- Language modeling/generation
- Parameters
- 2700000000.0
- Released
- 2023-07-18
- Open weights
- No