Which has the larger context window, GPT OSS 120B High Throughput or Qwen3 30B A3B Thinking 2507?

Qwen3 30B A3B Thinking 2507 has the larger context window (262K tokens).

GPT OSS 120B High Throughput vs Qwen3 30B A3B Thinking 2507

Q: Is GPT OSS 120B High Throughput or Qwen3 30B A3B Thinking 2507 cheaper?

GPT OSS 120B High Throughput is cheaper on output tokens ($0.36 vs $1.30 per 1M).

GPT OSS 120B High Throughput is cheaper on output tokens, while Qwen3 30B A3B Thinking 2507 offers a larger context window. Choose GPT OSS 120B High Throughput or Qwen3 30B A3B Thinking 2507 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	GPT OSS 120B High Throughput	Qwen3 30B A3B Thinking 2507
Provider	Clarifai	Clarifai
Input / 1M tokens	$0.09	$0.36
Output / 1M tokens	$0.36	$1.30
Context window	131K	262K
Parameters	—	—
Open weights	Yes	Yes
Released	Aug 2025	Jul 2025

GPT OSS 120B High Throughput details →Qwen3 30B A3B Thinking 2507 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.