Llama 3.2 3b Instruct vs Gemma 4 31B Garnet
Llama 3.2 3b Instruct is cheaper on output tokens, while Gemma 4 31B Garnet offers a larger context window. Choose Llama 3.2 3b Instruct or Gemma 4 31B Garnet based on the trade-off between cost, context, and the benchmarks that matter for your use case.
| Spec | Llama 3.2 3b Instruct | Gemma 4 31B Garnet |
|---|---|---|
| Provider | NanoGPT | NanoGPT |
| Input / 1M tokens | $0.03 | $0.31 |
| Output / 1M tokens | $0.05 | $0.31 |
| Context window | 131K | 262K |
| Parameters | — | — |
| Open weights | No | No |
| Released | Sep 2024 | May 2026 |
FAQ
Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.