Is Qwen3 235B A22B Thinking 2507 or GLM 4.5 FP8 cheaper?

Qwen3 235B A22B Thinking 2507 is cheaper on output tokens ($0.60 vs $0.80 per 1M).

Which has the larger context window, Qwen3 235B A22B Thinking 2507 or GLM 4.5 FP8?

Qwen3 235B A22B Thinking 2507 has the larger context window (262K tokens).

Qwen3 235B A22B Thinking 2507 vs GLM 4.5 FP8

Qwen3 235B A22B Thinking 2507 is cheaper on output tokens, while Qwen3 235B A22B Thinking 2507 offers a larger context window. Choose Qwen3 235B A22B Thinking 2507 or GLM 4.5 FP8 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3 235B A22B Thinking 2507	GLM 4.5 FP8
Provider	submodel	submodel
Input / 1M tokens	$0.20	$0.20
Output / 1M tokens	$0.60	$0.80
Context window	262K	131K
Parameters	—	—
Open weights	Yes	Yes
Released	Aug 2025	Jul 2025

Qwen3 235B A22B Thinking 2507 details →GLM 4.5 FP8 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.