Skip to content

GPT OSS 120B High Throughput vs Qwen3 30B A3B Thinking 2507

GPT OSS 120B High Throughput is cheaper on output tokens, while Qwen3 30B A3B Thinking 2507 offers a larger context window. Choose GPT OSS 120B High Throughput or Qwen3 30B A3B Thinking 2507 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecGPT OSS 120B High ThroughputQwen3 30B A3B Thinking 2507
ProviderClarifaiClarifai
Input / 1M tokens$0.09$0.36
Output / 1M tokens$0.36$1.30
Context window131K262K
Parameters
Open weightsYesYes
ReleasedAug 2025Jul 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.