Chinchilla

Chinchilla

Chinchilla is a dense decoder-only Transformer built to test compute-optimal scaling. Instead of pushing parameter count ever higher, DeepMind kept the model moderate in size and dramatically increased the training corpus, landing near an optimal ratio of about 20 training tokens per parameter. Trained on roughly 1.4 trillion tokens with around 70 billion parameters, it outperformed larger predecessors such as Gopher on a wide range of benchmarks, while being faster and cheaper to serve at inference. The result reshaped industry practice: for a fixed training budget, allocate more data and fewer parameters, aim for long training runs with strong regularization, and you can get better generalization, stronger few-shot performance, and more practical deployment costs. Chinchilla’s findings influenced later model families that emphasized token budgets, data quality, and extended pretraining over sheer parameter scale.

Overview

Chinchilla is DeepMind’s 2022 language model that showed smaller models trained on far more tokens can beat much larger ones. It has about 70B parameters and was trained on roughly 1.4T tokens, setting a new compute-optimal recipe and improving accuracy while cutting inference cost.

About Google

At Google, we think that AI can meaningfully improve people's lives and that the biggest impact will come when everyone can access it.

Industry: Technology, Information and Internet

Company Size: 182.000-190.000

Location: Mountain View, CA, US

Website: ai.google

View Company Profile

Tools using Chinchilla

CometAPI

One API, 500+ AI models at your fingertips.

APIs

Open

1,655 www.cometapi.com

Share

Released 11mo ago
No pricing

2,765
29
4.6

Last updated: February 18, 2026

Search

Overview

About Google

Tools using Chinchilla

Related Models

Help

People also viewed

AI Options

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: