Skip to content

LMArena (Chatbot Arena) Elo Leaderboard

human-preference192 models · Updated June 2026

On the LMArena (Chatbot Arena) Elo benchmark, Anthropic: Claude Opus 4.6 (Fast) ranks #1 with a score of 1500, while Llama 3.1 8B (decentralized) offers the best score-per-dollar at $0.03/1M output tokens. The full ranking, with cost per million tokens, is below.

💰 Best value
Llama 3.1 8B (decentralized) — score 1187 at $0.03/1M output tokens
#ModelScoreOutput / 1MContext
1Anthropic: Claude Opus 4.6 (Fast)Anthropic1500$150.001M
2Claude Fable 5Anthropic1494
3Claude Opus 4.7Anthropic1489
4Gemini 3.5 FlashNanoGPT1480$9.001M
5Gemini 3.1 ProGoogle DeepMind1480
6Gemini 3 ProGoogle DeepMind1479
7Qwen3.7 Max ThinkingNanoGPT1475$7.501M
8GPT-5.4OpenAI1470
9GLM-5.1Z.ai (Zhipu AI)1468
10GPT-5.5OpenAI1468
11ERNIE 5.1NanoGPT1467$3.00119K
12Gemini 3 Flash ThinkingNanoGPT1466$3.001M
13Z.ai: GLM 5.2Z.ai1465$3.001M
14Qwen3.7 Plus ThinkingNanoGPT1463$1.60984K
15Claude Opus 4.8 ThinkingNanoGPT1463$25.011M
16MiMo-V2.5-ProXiaomi Corp1462
17Gemini 2.5 Pro (Jun 2025)Google DeepMind1457
18Claude Sonnet 4.6 ThinkingNanoGPT1457$14.991M
19Grok 4.20 (Reasoning)xAI1455$2.501M
20Kimi K2.6Moonshot1455
21Grok 4.20 Multi-AgentxAI1450$2.501M
22Claude Opus 4.5Anthropic1450
23DeepSeek-V4-ProDeepSeek1449
24Qwen3.6 Max PreviewNanoGPT1446$7.80246K
25GLM-5Z.ai (Zhipu AI)1446
26Kimi K2.5Moonshot1445
27Gemma 4 31BNanoGPT1441$0.35262K
28GPT-5.1Poe1441$9.00400K
29GLM-4.6Z.ai (Zhipu AI),Tsinghua University1440
30MiniMax M3 ThinkingNanoGPT1440$1.20512K
31Qwen3.5 397B-A17BAlibaba1439
32Qwen3-Max-ThinkingAlibaba1439
33GPT-5.2OpenAI1438
34Claude Sonnet 4.5Anthropic1438
35Grok 4.1xAI1437
36Qwen3.6 PlusAlibaba Token Plan1437$0.001M
37GLM-4.7Z.ai (Zhipu AI)1436
38MiMo-V2-ProXiaomi Corp1436
39Gemma 4 26B A4B ThinkingNanoGPT1435$0.40262K
40DeepSeek-V4-FlashDeepSeek1431
41Mistral Large 3Vercel AI Gateway1430$1.50256K
42GLM-4.5Z.ai (Zhipu AI),Tsinghua University1429
43chatgpt-4o-latest302.AI1429$15.00128K
44DeepSeek: R1 0528DeepSeek1428$2.15164K
45MiMo V2.5NanoGPT1427$0.281M
46DeepSeek: DeepSeek V3.2DeepSeek1424$0.34131K
47MiMo V2 OmniNanoGPT1423$2.00262K
48LongCat-FlashMeituan Inc1422
49Qwen: Qwen3 VL 235B A22B ThinkingQwen1421$2.60131K
50Mistral Medium 3.5NanoGPT1420$7.50256K
51DeepSeek: DeepSeek V3.1 TerminusDeepSeek1420$0.95164K
52Qwen: Qwen3 235B A22B Thinking 2507Qwen1419$0.10262K
53DeepSeek: DeepSeek V3.1DeepSeek1419$0.79164K
54GPT-5.5 InstantZenMux1419$30.00400K
55Qwen: Qwen3 Next 80B A3B ThinkingQwen1419$0.78262K
56Claude Opus 4.1Anthropic1418
57Qwen3.5-122B-A10BAlibaba1418
58GPT-4.5OpenAI1417
59Gemini 2.5 Flash PreviewNanoGPT1417$0.601M
60Gemini 3.1 Flash LiteNanoGPT1415$1.501M
61Kimi K2 Thinking TurboMoonshot AI1414$8.00262K
62GPT 5.4 MiniNanoGPT1413$4.50400K
63MiMo V2 FlashNanoGPT1411$0.31256K
64grok-4-0709Jiekou.AI1410$13.50256K
65o3OpenAI1409
66Grok 4 FastRequesty1409$0.502M
67Qwen3.5 27BNanoGPT1409$2.16260K
68Grok 4.1 FastxAI1408
69GPT-5OpenAI1405
70MiniMax M2.7NanoGPT1404$1.20205K
71Step 3.5 FlashNanoGPT1404$0.50256K
72Grok 4.3NanoGPT1401$2.501M
73Hunyuan-T1Tencent Coding Plan (China)1400$0.00131K
74Qwen: Qwen3.5-FlashQwen1398$0.261M
75Qwen3.5 35B A3BNanoGPT1396$1.80260K
76Claude Haiku 4.5Anthropic1392
77MiniMax-M2.1MiniMax1391
78OpenAI: GPT-5.3 ChatOpenAI1388$14.00128K
79Qwen: Qwen3 30B A3B Thinking 2507Qwen1384$0.40131K
80Z.ai: GLM 4.5 AirZ.ai1383$0.85131K
81GPT 4.1NanoGPT1382$8.001M
82Kimi K2 0905NanoGPT1379$2.00256K
83Nemotron 3 Super 120B A12BSynthetic1378$1.00262K
84Hunyuan-TurboSTencent1376
85Claude Opus 4Anthropic1375
86DeepSeek: DeepSeek V3 0324DeepSeek1375$0.77164K
87Z.ai: GLM 4.6VZ.ai1375$0.90131K
88GPT 5.4 NanoNanoGPT1374$1.25400K
89GPT-5 miniOpenAI1374
90DeepSeek-R1 (May 2025)DeepSeek1373
91Kimi K2 0711NanoGPT1371$2.00128K
92Mistral Medium 2505routing.run1369$2.00128K
93gemini-2.5-flash-lite-preview-06-17Jiekou.AI1368$0.361M
94Grok 3 MiniGitHub Models1367$0.00128K
95Qwen2.5-Max-2025-01-25Qiniu1366128K
96o1OpenAI1366
97Qwen3-235B-A22B-Thinking (Jul 2025)Alibaba1366
98gpt-oss-120bOpenAI1365
99Amazon: Nova 2 LiteAmazon1363$2.501M
100MiniMax M2.5NanoGPT1359$1.20205K
101Gemma 3 27B ITNanoGPT1358$0.30128K
102Mercury 2NanoGPT1357$0.75128K
103Qwen3-Coder-480B-A35BAlibaba1356
104INTELLECT 3Cortecs1356$1.20128K
105Gemini 2.0 FlashQiniu13541M
106Z.ai: GLM 4.7 FlashZ.ai1353$0.40203K
107OpenAI o4-mini highNanoGPT1353$4.40200K
108Step-3ZenMux1350$0.5766K
109Claude Sonnet 4Anthropic1348
110MiniMax M1NanoGPT1342$1.331M
111MiniMax-M2MiniMax1342
112Trinity Large ThinkingNanoGPT1342$0.90262K
113GPT 4.1 MiniNanoGPT1340$1.601M
114Qwen: Qwen3 32BQwen1340$0.28131K
115NVIDIA: Llama 3.3 Nemotron Super 49B V1.5NVIDIA1338$0.40131K
116o3-miniOpenAI1337
117Gemma 3 12B ITNanoGPT1334$0.27128K
118GLM 4.5V ThinkingNanoGPT1333$1.8064K
119DeepSeek-V3 (Mar 2025)DeepSeek1333
120Cohere Command A (08/2025)NanoGPT1331$10.00256K
121GLM 4 Plus 0111NanoGPT1331$10.00128K
122QwQ-32BAlibaba1329
123GPT-5 nanoOpenAI1320
124Llama-3.1-Nemotron-Ultra-253B-v1Nebius Token Factory1319$1.80128K
125Gemini 1.5 ProGoogle DeepMind1319
126o1-miniOpenAI1317
127Qwen: Qwen3 30B A3BQwen1317$0.50131K
128Claude 3.7 SonnetAnthropic1314
129Google: Gemma 3n 4BGoogle1306$0.1233K
130Grok-2xAI1305
131Yi-Lightning01.AI1302
132GPT-4o (Mar 2025)OpenAI1301
133Olmo 3 32B ThinkNanoGPT1299$0.45128K
134Claude 3.5 SonnetAnthropic1298
135Granite 4.1 8BNanoGPT1293$0.10131K
136Gemma 3 4B ITNanoGPT1291$0.20128K
137GLM-4-PlusZ.ai (Zhipu AI)1290
138Hunyuan-LargeTencent1288
139Llama 4 Maverick 17B 128E InstructDigitalOcean1288$0.871M
140gpt-oss-20bOpenAI1288
141Gemini 1.5 FlashNanoGPT1287$0.312M
142GPT-4o miniOpenAI1287
143GPT 4.1 NanoNanoGPT1285$0.401M
144Llama-3.1-Nemotron-70B-InstructNVIDIA,Meta AI1283
145MercuryInception Labs1282
146Llama 4 Scout 17B 16E InstructCloudflare Workers AI1281$0.85131K
147Llama 3.3 70BMeta AI1275
148GPT-4 Turbo (Apr 2024)OpenAI1272
149DeepSeek-V2.5DeepSeek1271
150Qwen2.5-72BAlibaba1269
151Mistral Large 24071266$6.00131K
152Mistral Large 2411NanoGPT1265$6.00128K
153Claude 3 OpusAnthropic1262
154Meta: Llama 3.1 70B InstructMeta1261$0.40131K
155Claude 3.5 HaikuQiniu1255200K
156Reka CoreReka AI1248
157Jamba 1.5-LargeAI21 Labs1237
158Google: Gemma 2 27BGoogle1231$0.658K
159Qwen2.5 Coder 32B Instruct1230$1.00128K
160Cohere: Command R+ (08-2024)Cohere1229$10.00128K
161GLM-4 (0520)Z.ai (Zhipu AI)1226
162Nemotron-4 340BNVIDIA1225
163Aya Expanse 32BCohere1224128K
164Llama 3-70BMeta AI1221
165Claude 3 SonnetAnthropic1218
166Qwen FlashAlibaba (China)1218$0.221M
167Phi-4Azure1217$0.50128K
168Qwen2-72BAlibaba1203
169Anthropic: Claude 3 HaikuAnthropic1195$1.25200K
170Cohere: Command RNanoGPT1187$1.43128K
171AI21 Jamba 1.5 MiniGitHub Models1187$0.00256K
172Llama 3.1 8B (decentralized)NanoGPT1187$0.03128K
173Aya Expanse 8BCohere11858K
174Qwen1.5-72BAlibaba1166
175Meta: Llama 3 8B InstructMeta1166$0.148K
176Gemma 2 2b ItNvidia1156$0.00128K
177Mixtral 8x7B Instruct v0.1Cortecs1132$0.6832K
178Google Gemini Pro Latest1131$12.001M
179Yi-34B01.AI1129
180GPT-3.5 Turbo 0125Azure1125$1.5016K
181DBRXDatabricks1119
182Llama 2-70BMeta AI1115
183Phi-3-small instruct (128k)GitHub Models1110$0.00128K
184Llama 3.2 3b InstructNanoGPT1110$0.05131K
185GPT-3.5 Turbo 1106Azure1094$2.0016K
186Meta: Llama 3.2 1B InstructMeta1055$0.20131K
187Falcon-180BTechnology Innovation Institute1054
188Llama 2-7BMeta AI1053
189Phi-3-mini instruct (128k)GitHub Models1050$0.00128K
190PaLM 2Google1027
191Mistral 7BMistral1024$0.258K
192ChatGLM3-6BZ.ai (Zhipu AI)972

What does LMArena (Chatbot Arena) Elo test?

Overview This dataset contains ALL in-the-wild conversation crowdsourced from Search Arena between March 18, 2025 and May 8, 2025. It includes 24,069 multi-turn conversations with search-LLMs across diverse intents, languages, and topics—alongs...

Frequently asked questions

Pricing is indicative — confirm with the provider before production use. Updated June 2026.