Best LLMs for Coding
The best LLMs for coding are ranked here by their scores on coding benchmarks — measuring code generation, bug fixing, and software-engineering ability. A higher score means stronger performance on programming tasks. Pair benchmark strength with context window (for large codebases) and price when choosing a coding model.
Ranked list (top 25)
| # | Model | Coding score | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|---|
| 1 | OpenAI o3-pro (2025-06-10) NanoGPT | 84.9 | $10.00 | $19.99 | 200K |
| 2 | Google: Gemini 2.5 Pro Preview 06-05 Google | 83.1 | $1.25 | $10.00 | 1M |
| 3 | Doubao-Seed-Code ZenMux | 78.8 | $0.17 | $1.12 | 256K |
| 4 | Google: Gemini 2.5 Pro Preview 05-06 Google | 76.9 | $1.25 | $10.00 | 1M |
| 5 | Claude 4 Sonnet NanoGPT | 76.8 | $2.99 | $14.99 | 200K |
| 6 | Gemini 3 Flash Thinking NanoGPT | 75.8 | $0.50 | $3.00 | 1M |
| 7 | MiniMax M2.5 NanoGPT | 75.8 | $0.30 | $1.20 | 205K |
| 8 | Anthropic: Claude Opus 4.6 (Fast) Anthropic | 75.6 | $30.00 | $150.00 | 1M |
| 9 | GPT 4.1 NanoGPT | 74.6 | $2.00 | $8.00 | 1M |
| 10 | OpenAI o4-mini high NanoGPT | 74.4 | $1.10 | $4.40 | 200K |
| 11 | DeepSeek: DeepSeek V3.2 DeepSeek | 74.2 | $0.23 | $0.34 | 131K |
| 12 | Claude 4 Opus Thinking (1K) NanoGPT | 73.2 | $14.99 | $75.00 | 200K |
| 13 | GPT-5.1 Poe | 66.0 | $1.10 | $9.00 | 400K |
| 14 | Qwen: Qwen3 Coder 30B A3B Instruct Qwen | 60.4 | $0.07 | $0.27 | 160K |
| 15 | Devstral Small MistralOpen weights | 56.4 | $0.10 | $0.30 | 128K |
| 16 | gemini-2.5-flash-preview-05-20 Jiekou.AI | 55.1 | $0.14 | $3.15 | 1M |
| 17 | Devstral 2 MistralOpen weights | 53.8 | $0.40 | $2.00 | 262K |
| 18 | Grok 3 Mini GitHub Models | 49.3 | $0.00 | $0.00 | 128K |
| 19 | Mistral Devstral Small 2505 NanoGPT | 46.8 | $0.06 | $0.06 | 33K |
| 20 | chatgpt-4o-latest 302.AI | 45.3 | $5.00 | $15.00 | 128K |
| 21 | Amazon: Nova Premier 1.0 Amazon | 42.4 | $2.50 | $12.50 | 1M |
| 22 | Qwen: Qwen3 32B Qwen | 40.0 | $0.08 | $0.28 | 131K |
| 23 | GPT 4.1 Mini NanoGPT | 32.4 | $0.40 | $1.60 | 1M |
| 24 | Gemini 2.5 Flash Preview NanoGPT | 28.7 | $0.15 | $0.60 | 1M |
| 25 | Qwen Max Alibaba (China) | 21.8 | $0.35 | $1.38 | 131K |
Prices are per 1M tokens (USD); confirm with the provider. Updated regularly.