Skip to content

Chinchilla

DeepMindLanguage modeling

Developed by DeepMind in 2022, Chinchilla is a language modeling model with 70000000000.0 parameters.

About Chinchilla

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly undertrained, a consequence of the recent focus on scaling

Details

Provider
DeepMind
Task
Language modeling
Parameters
70000000000.0
Released
2022-03-29
Open weights
No
View model source

Explore

FAQ