Skip to content

LiveBench Leaderboard

reasoning/coding71 models · Updated June 2026

On the LiveBench benchmark, GPT-5.5 ranks #1 with a score of 81.3, while Qwen: Qwen3 235B A22B Thinking 2507 offers the best score-per-dollar at $0.10/1M output tokens. The full ranking, with cost per million tokens, is below.

💰 Best value
Qwen: Qwen3 235B A22B Thinking 2507 — score 52.9 at $0.10/1M output tokens
#ModelScoreOutput / 1MContext
1GPT-5.5OpenAI81.3
2GPT-5.4OpenAI80.9
3Gemini 3.1 ProGoogle DeepMind80.7
4Claude Opus 4.8 ThinkingNanoGPT79.5$25.011M
5Claude Fable 5Anthropic78.6
6Claude Opus 4.7Anthropic77.1
7Anthropic: Claude Opus 4.6 (Fast)Anthropic76.8$150.001M
8Z.ai: GLM 5.2Z.ai76.2$3.001M
9Claude Opus 4.5Anthropic76.0
10Gemini 3.5 FlashNanoGPT75.8$9.001M
11Claude Sonnet 4.6 ThinkingNanoGPT75.7$14.991M
12GPT-5.2OpenAI75.4
13Qwen3.7 Max ThinkingNanoGPT75.2$7.501M
14DeepSeek-V4-ProDeepSeek74.4
15GPT-5.1-Codex-MaxOpenAI74.4
16GPT-5.2 CodexOpenAI74.3
17Gemini 3 ProGoogle DeepMind73.5
18GPT-5.3 CodexOpenAI73.2
19Gemini 3 Flash ThinkingNanoGPT73.0$3.001M
20GPT-5.1Poe72.6$9.00400K
21Kimi K2.6Moonshot72.4
22MoonshotAI: Kimi K2.7 CodeMoonshotAI71.9$4.00262K
23GPT 5.4 NanoNanoGPT71.3$1.25400K
24GPT-5 ProOpenAI71.3
25Qwen3.6 PlusAlibaba Token Plan70.8$0.001M
26GLM-5.1Z.ai (Zhipu AI)70.6
27MiniMax M3 ThinkingNanoGPT70.0$1.20512K
28Grok Build 0.1NanoGPT69.6$2.00256K
29GPT-5.1-CodexOpenAI69.3
30Kimi K2.5Moonshot69.2
31Grok 4.20 (Reasoning)xAI69.0$2.501M
32GLM-5Z.ai (Zhipu AI)68.7
33Claude Sonnet 4.5Anthropic67.9
34GPT 5.4 MiniNanoGPT67.7$4.50400K
35DeepSeek-V4-FlashDeepSeek67.7
36Grok 4.3NanoGPT67.4$2.501M
37GPT-5 miniOpenAI66.6
38Qwen: Qwen3.6 27BQwen65.6$2.39262K
39MiniMax M2.7NanoGPT65.0$1.20205K
40DeepSeek: DeepSeek V3.2DeepSeek63.1$0.34131K
41Gemma 4 31B ITLilac62.4$0.35262K
42Kimi K2 ThinkingMoonshot62.3
43Gemini 3.1 Flash LiteNanoGPT62.1$1.501M
44grok-4-0709Jiekou.AI61.8$13.50256K
45Claude 4.1 Opus Thinking (32K)NanoGPT61.4$75.00200K
46Claude Haiku 4.5Anthropic61.0
47GPT 5.1 Codex MiniNanoGPT60.8$2.00400K
48Claude 4 SonnetNanoGPT60.6$14.99200K
49Qwen: Qwen3.6 FlashQwen60.5$1.131M
50MiniMax M2.5NanoGPT60.3$1.20205K
51Grok 4.1 FastxAI60.1
52GPT-5.3-InstantPoe59.8$13.00128K
53MiMo-V2-ProXiaomi Corp58.4
54Google: Gemini 2.5 Pro Preview 06-05Google57.5$10.001M
55GLM-4.7Z.ai (Zhipu AI)57.3
56GLM-4.6Z.ai (Zhipu AI),Tsinghua University54.7
57Qwen: Qwen3 235B A22B Thinking 2507Qwen52.9$0.10262K
58Gemini 2.5 Flash PreviewNanoGPT52.3$0.601M
59Qwen: Qwen3 Next 80B A3B ThinkingQwen51.0$0.78262K
60NVIDIA: Nemotron 3 UltraNVIDIA50.7$2.201M
61GPT-5 nanoOpenAI48.8
62GLM 5V Turbo ThinkingNanoGPT48.8$4.00203K
63gpt-oss-120bOpenAI46.4
64Qwen: Qwen3 32BQwen42.7$0.28131K
65Gemini 2.5 Flash LiteNanoGPT41.5$0.401M
66Z.ai: GLM 4.6VZ.ai38.9$0.90131K
67Qwen: Qwen3 30B A3BQwen38.8$0.50131K
68Mistral: Devstral 2 2512Mistral38.8$2.00262K
69Grok 4.20 (Non-Reasoning)xAI37.9$2.501M
70Nemotron 3 Super 120B A12BSynthetic32.0$1.00262K
71X-Ai/Grok 4.1 Fast Non ReasoningQiniu31.62M

Frequently asked questions

Pricing is indicative — confirm with the provider before production use. Updated June 2026.