Is GPT OSS 120B or Qwen3 Embedding 8B cheaper?

Qwen3 Embedding 8B is cheaper on output tokens ($0.12 vs $0.92 per 1M).

Which has the larger context window, GPT OSS 120B or Qwen3 Embedding 8B?

GPT OSS 120B has the larger context window (66K tokens).

GPT OSS 120B vs Qwen3 Embedding 8B

Qwen3 Embedding 8B is cheaper on output tokens, while GPT OSS 120B offers a larger context window. Choose GPT OSS 120B or Qwen3 Embedding 8B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	GPT OSS 120B	Qwen3 Embedding 8B
Provider	evroc	evroc
Input / 1M tokens	$0.23	$0.12
Output / 1M tokens	$0.92	$0.12
Context window	66K	41K
Parameters	117B	—
Open weights	Yes	Yes
Released	Aug 2025	Jul 2025

GPT OSS 120B details →Qwen3 Embedding 8B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.