Is Qwen3 Embedding 8B or GPT OSS 120B cheaper?

Qwen3 Embedding 8B is cheaper on output tokens ($0.12 vs $0.92 per 1M).

Which has the larger context window, Qwen3 Embedding 8B or GPT OSS 120B?

GPT OSS 120B has the larger context window (66K tokens).

Qwen3 Embedding 8B vs GPT OSS 120B

Qwen3 Embedding 8B is cheaper on output tokens, while GPT OSS 120B offers a larger context window. Choose Qwen3 Embedding 8B or GPT OSS 120B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3 Embedding 8B	GPT OSS 120B
Provider	evroc	evroc
Input / 1M tokens	$0.12	$0.23
Output / 1M tokens	$0.12	$0.92
Context window	41K	66K
Parameters	—	117B
Open weights	Yes	Yes
Released	Jul 2025	Aug 2025

Qwen3 Embedding 8B details →GPT OSS 120B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.