LMArena (the project formerly known as LMSys Chatbot Arena) is a public benchmark where humans blind-vote between two anonymous model responses to the same prompt. Votes are aggregated into Elo ratings — the same statistical method used to rank chess players — so a model rating only goes up if it consistently beats peers in side-by-side comparison.

Who is currently #1 on LMArena in June 2026?

Our June 2026 snapshot of the LMArena text leaderboard puts Anthropic: Claude Fable 5 at the top with 1525 Elo. The full top 10 is below; the rankings re-shuffle weekly as votes accumulate.

How should I read Arena Elo as an enterprise buyer?

Arena Elo is a strong signal for general chat quality and human preference under short-conversation conditions. It does not predict your production accuracy on function-calling, long-context retrieval, or domain-specific tasks. Use it as a triage filter, not a procurement decision — then build an internal eval harness on your own workload.

Why are there separate LMArena leaderboards?

The Arena now publishes separate leaderboards for text chat, coding, hard prompts, math, multi-turn, vision, and several language-specific variants. A model can lead one and rank mid-pack on another. The split between text and coding leaders is the most cited illustration.

Is the LMArena gameable?

Partially. Style polish and short-answer formatting tend to win votes, which can favour models tuned for chat over models tuned for accuracy. The Arena team has shipped category-specific leaderboards and prompt-category filters to reduce this skew. It is still the best human-preference signal we have at scale.

Updated Jun 14, 2026

LMArena Leaderboard — June 2026

What the LMArena actually is, how to read an Arena Elo score, and the current top 10 for June 2026. The original human-preference benchmark that started as LMSys Chatbot Arena and now anchors most enterprise model selection conversations.

What is the LMArena, in one paragraph?

The LMArena is a public, blind side-by-side voting site for AI chat models. A user submits a prompt, two anonymous models reply, the user picks a winner, and the project aggregates millions of such votes into Elo ratings. It started in 2023 as the LMSys Chatbot Arena out of UC Berkeley and rebranded to LMArena.ai in 2024-25 as it spun out into an independent project. The current June 2026 top 10 is below — three models now sit above the historical 1500 Elo barrier on text, with the open-weights tier within striking distance of the closed-source frontier.

How to read an Arena Elo score

Reference table for Arena Elo bands (June 2026)

  1510+   Frontier #1      Claude Opus 4.8 (AAII 61.4, coding & overall #1)
  1500    Frontier         Gemini 3.1 Pro, Claude Opus 4.7, GPT-5.5 Pro
  1450    Frontier-adj.    DeepSeek V4 Pro, Qwen 3.7 Max
  1400    Strong tier      GPT-4.1, Claude Sonnet 4, Gemini 2.5 Pro
  1300    Capable tier     Llama 4 Maverick, Mistral Large 3
  1200    Solid daily      Gemma 4, Phi-4, Mistral Small 3
  1100    Light tasks      DeepSeek V4 Flash, GPT-4o Mini
   <1100  Legacy tier      Older 2023-24 model generations

A 100-Elo gap means the higher-rated model wins ~64% of head-to-heads.
A 200-Elo gap means it wins ~76%. Rating shifts under 25 points are noise.

Live Leaderboard

357 models

#	Model	Quality	Arena ELO	Speed	Price	Context	Value	Released
1	Anthropic: Claude Fable 5 New Anthropic · Frontier agentic coding & knowledge work	100	1525	58 t/s	$10 / $50	1M	3.3	Jun 2026
2	Anthropic: Claude Opus 4.8 New Anthropic · Coding, agents & computer use	99	1512	72 t/s	$5 / $25	1M	6.6	May 2026
3	OpenAI: GPT-5.5 Pro OpenAI · Reasoning at any cost	98	1510	68 t/s	$30 / $180	1M	0.9	Apr 2026
4	OpenAI: GPT-5.5 OpenAI · Frontier general purpose	97	1506	70 t/s	$5 / $30	1M	5.5	Apr 2026
5	OpenAI: GPT-5.4 Pro OpenAI · Complex analysis	97	—	—	$30 / $180	1M	0.9	Mar 2026
6	OpenAI: GPT-5.2 Pro OpenAI · Complex analysis	97	—	—	$21 / $168	400K	1.0	Dec 2025
7	Anthropic: Claude Opus 4.7 (Fast) Anthropic · Complex analysis	97	—	—	$30 / $150	1M	1.1	May 2026
8	Anthropic: Claude Opus 4.7 Anthropic · Coding & agentic workflows	96	1505	68 t/s	$5 / $25	1M	6.4	Apr 2026
9	OpenAI: o3 Deep Research OpenAI · Deep research	96	—	—	$10 / $40	200K	3.8	Oct 2025
10	OpenAI: o4 Mini Deep Research OpenAI · Deep research	96	—	—	$2 / $8	200K	19.2	Oct 2025
11	OpenAI: o3 Pro OpenAI · Hard reasoning	96	—	—	$20 / $80	200K	1.9	Jun 2025
12	Google: Gemini 3.1 Pro Preview Custom Tools Google · Speed & cost	96	1505	—	$2 / $12	1M	13.7	Feb 2026
13	Google: Gemini 3.1 Pro Preview Google · Science & long-context	96	1505	131 t/s	$2 / $12	1M	13.7	Apr 2026
14	Anthropic: Claude Opus 4.6 Anthropic · General purpose	95	1490	—	$5 / $25	1M	6.3	Feb 2026
15	Anthropic: Claude Opus 4.5 Anthropic · General purpose	95	—	—	$5 / $25	200K	6.3	Nov 2025
16	Anthropic: Claude Opus 4.6 (Fast) Anthropic · Complex analysis	95	—	—	$30 / $150	1M	1.1	Apr 2026
17	Google: Nano Banana Pro (Gemini 3 Pro Image Preview) Google · Image generation	94	—	—	$2 / $12	66K	13.4	Nov 2025
18	Anthropic: Claude Opus 4.1 Anthropic · Multimodal	94	—	—	$15 / $75	200K	2.1	Aug 2025
19	OpenAI: o3 OpenAI · Hard reasoning	94	1370	68 t/s	$10 / $40	200K	3.8	Apr 2025
20	Qwen: Qwen3.7 Max Alibaba Cloud · Long autonomous agentic runs	94	1488	90 t/s	$2.5 / $7.5	1M	18.8	May 2026
21	xAI: Grok 4.3 xAI · Agentic tasks & real-time info	93	1496	83 t/s	$1.25 / $2.5	1M	49.6	May 2026
22	OpenAI: GPT-5.4 OpenAI · General purpose	93	1495	—	$2.5 / $15	1M	10.6	Mar 2026
23	OpenAI: GPT-5.3 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Mar 2026
24	OpenAI: GPT-5.3-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Feb 2026
25	OpenAI: GPT-5.2-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Jan 2026
26	OpenAI: GPT-5.2 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Dec 2025
27	OpenAI: GPT-5.2 OpenAI · General purpose	93	—	—	$1.75 / $14	400K	11.8	Dec 2025
28	OpenAI: GPT-5.1-Codex-Max OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Dec 2025
29	OpenAI: GPT-5.1 OpenAI · General purpose	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
30	OpenAI: GPT-5.1 Chat OpenAI · General purpose	93	—	—	$1.25 / $10	128K	16.5	Nov 2025
31	OpenAI: GPT-5.1-Codex OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
32	OpenAI: o1-pro OpenAI · Hard reasoning	93	—	—	$150 / $600	200K	0.2	Mar 2025
33	OpenAI: GPT-4 (older v0314) OpenAI · Complex analysis	93	—	—	$30 / $60	8K	2.1	May 2023
34	OpenAI: GPT-4 OpenAI · Multimodal	93	—	—	$30 / $60	8K	2.1	May 2023
35	xAI: Grok 4.20 xAI · General purpose	93	1496	—	$1.25 / $2.5	2M	49.6	Mar 2026
36	OpenAI: GPT-5.4 Image 2 OpenAI · Complex analysis	93	—	—	$8 / $15	272K	8.1	Apr 2026
37	MoonshotAI: Kimi K2.6 Moonshot AI · Frontier quality at low cost	92	1466	48 t/s	$0.73 / $3.49	256K	43.6	Apr 2026
38	Google: Gemini 2.5 Pro Google · Multimodal + value	92	1345	87 t/s	$1.25 / $10	1M	16.4	Mar 2025
39	Anthropic: Claude Opus 4 Anthropic · Complex analysis	91	1360	52 t/s	$15 / $75	200K	2.0	May 2025
40	TNG: DeepSeek R1T2 ChimeraOSS · Hard reasoning	91	—	—	$0.3 / $1.1	164K	130.0	Jul 2025
41	Google: Gemini 2.5 Pro Preview 06-05 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	Jun 2025
42	DeepSeek: R1 0528OSS DeepSeek · Hard reasoning	91	—	—	$0.5 / $2.15	164K	68.7	May 2025
43	Google: Gemini 2.5 Pro Preview 05-06 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	May 2025
44	DeepSeek: R1 Distill Qwen 32BOSS DeepSeek · Hard reasoning	91	—	—	$0.29 / $0.29	33K	313.8	Jan 2025
45	DeepSeek: R1 Distill Llama 70BOSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $0.8	131K	121.3	Jan 2025
46	DeepSeek: R1OSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $2.5	64K	56.9	Jan 2025
47	MoonshotAI: Kimi K2.7 Code NewOSS Moonshot AI · Open-weight agentic coding	91	—	55 t/s	$0.73 / $3.49	256K	43.1	Jun 2026
48	Nex AGI: Nexus N2 Pro NewOSS · Open-weight reasoning & tool use	91	—	50 t/s	$0.2 / $0.8	262K	182.0	Jun 2026
49	DeepSeek: DeepSeek V4 ProOSS DeepSeek · Open-source value leader	90	1467	33 t/s	$1.74 / $3.48	1M	34.5	Apr 2026
50	Anthropic: Claude Sonnet 4.6 Anthropic · Coding & balance	90	1467	73 t/s	$3 / $15	1M	10.0	Feb 2026
51	OpenAI: GPT-5 OpenAI · General purpose	90	1455	—	$1.25 / $10	400K	16.0	Aug 2025
52	xAI: Grok 3 Beta xAI · General purpose	90	—	—	$3 / $15	131K	10.0	Apr 2025
53	Qwen: Qwen3.6 Max PreviewOSS Alibaba Cloud · Open-source	90	—	—	$1.04 / $6.24	262K	24.7	Apr 2026
54	OpenAI: GPT-4.1 OpenAI · Long context	89	1310	120 t/s	$2 / $8	1M	17.8	Apr 2025
55	MoonshotAI: Kimi K2.5 Moonshot AI · Speed & cost	89	1452	—	$0.4 / $1.9	262K	77.4	Jan 2026
56	MiniMax: MiniMax M3 NewOSS · Open-weight agentic coding	89	1455	80 t/s	$0.6 / $2.4	1M	59.3	Jun 2026
57	Z.ai: GLM 5.2 NewOSS · Open-weight agentic coding (provisional)	89	—	—	$0.98 / $3.08	200K	43.8	Jun 2026
58	Z.ai: GLM 5.1OSS · Open-weight agentic & tool use	88	1467	48 t/s	$0.98 / $3.08	200K	43.3	Apr 2026
59	OpenAI: GPT-5 Image OpenAI · Multimodal	88	—	—	$10 / $10	400K	8.8	Oct 2025
60	OpenAI: GPT-5 Pro OpenAI · Complex analysis	88	—	—	$15 / $120	400K	1.3	Oct 2025
61	Anthropic: Claude Sonnet 4.5 Anthropic · General purpose	88	—	—	$3 / $15	1M	9.8	Sep 2025
62	OpenAI: GPT-4o Audio OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Aug 2025
63	OpenAI: GPT-4o Search Preview OpenAI · Search + citations	88	—	—	$2.5 / $10	128K	14.1	Mar 2025
64	OpenAI: o1 OpenAI · Hard reasoning	88	—	—	$15 / $60	200K	2.3	Dec 2024
65	OpenAI: GPT-4o (2024-11-20) OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Nov 2024
66	OpenAI: GPT-4o OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	May 2024
67	OpenAI: GPT-4o (extended) OpenAI · Multimodal	88	—	—	$6 / $18	128K	7.3	May 2024
68	OpenAI: GPT-4o (2024-05-13) OpenAI · General purpose	88	—	—	$5 / $15	128K	8.8	May 2024
69	OpenAI: GPT-4 Turbo OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Apr 2024
70	OpenAI: GPT-4 Turbo Preview OpenAI · Complex analysis	88	—	—	$10 / $30	128K	4.4	Jan 2024
71	OpenAI: GPT-4 Turbo (older v1106) OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Nov 2023
72	Z.ai: GLM 5OSS · Open-source	88	1450	—	$0.6 / $1.92	80K	69.8	Feb 2026
73	Anthropic: Claude Sonnet 4 Anthropic · Coding & balance	88	1320	95 t/s	$3 / $15	200K	9.8	May 2025
74	OpenAI: o3 Mini OpenAI · Reasoning & math	88	1305	155 t/s	$1.1 / $4.4	200K	32.0	Jan 2025
75	xAI: Grok 3 xAI · Real-time info	87	1330	82 t/s	$3 / $15	131K	9.7	Feb 2025
76	DeepSeek: DeepSeek V3.2OSS DeepSeek · Open-source	87	1455	—	$0.252 / $0.378	164K	276.2	Dec 2025
77	Nex AGI: DeepSeek V3.1 Nex N1OSS · Open-source	86	—	—	$0.135 / $0.5	131K	270.9	Dec 2025
78	DeepSeek: DeepSeek V3.2 SpecialeOSS DeepSeek · Open-source	86	—	—	$0.287 / $0.431	164K	239.6	Dec 2025
79	DeepSeek: DeepSeek V3.2 ExpOSS DeepSeek · Open-source	86	—	—	$0.27 / $0.41	164K	252.9	Sep 2025
80	DeepSeek: DeepSeek V3.1 TerminusOSS DeepSeek · Open-source	86	—	—	$0.27 / $0.95	164K	141.0	Sep 2025
81	DeepSeek: DeepSeek V3.1OSS DeepSeek · Open-source	86	—	—	$0.21 / $0.79	33K	172.0	Aug 2025
82	DeepSeek: DeepSeek V3 0324OSS DeepSeek · Open-source	86	—	—	$0.2 / $0.77	164K	177.3	Mar 2025
83	Anthropic: Claude 3.7 Sonnet Anthropic · General purpose	86	—	—	$3 / $15	200K	9.6	Feb 2025
84	Anthropic: Claude 3.7 Sonnet (thinking) Anthropic · Hard reasoning	86	—	—	$3 / $15	200K	9.6	Feb 2025
85	DeepSeek: DeepSeek V3OSS DeepSeek · Best open-source value	86	1310	62 t/s	$0.27 / $1.1	128K	125.5	Mar 2025
86	Qwen: Qwen3.6 Plus Alibaba Cloud · Multilingual & APAC	86	1448	124 t/s	$1.4 / $5.6	256K	24.6	Apr 2026
87	OpenAI: GPT-4o (2024-08-06) OpenAI · General purpose	85	1285	109 t/s	$2.5 / $10	128K	13.6	May 2024
88	Mistral: Mistral Large 3 2512OSS Mistral AI · Open-source	85	—	—	$0.5 / $1.5	262K	85.0	Dec 2025
89	Mistral Large 2407OSS Mistral AI · Open-source	85	—	—	$2 / $6	131K	21.3	Nov 2024
90	Mistral LargeOSS Mistral AI · Open-source	85	—	—	$2 / $6	128K	21.3	Feb 2024
91	Google: Gemini 3.5 Flash Google · Speed & cost	84	—	—	$1.5 / $9	1M	16.0	May 2026
92	Nex AGI: Nexus N2 mini NewOSS · Accessible open-weight agentics	84	—	110 t/s	$0.05 / $0.2	262K	672.0	Jun 2026
93	OpenAI: GPT-5.4 Mini OpenAI · Speed & cost	83	—	—	$0.75 / $4.5	400K	31.6	Mar 2026
94	OpenAI: GPT-5 Mini OpenAI · Speed & cost	83	—	—	$0.25 / $2	400K	73.8	Aug 2025
95	Qwen: Qwen3.5-9BOSS Alibaba Cloud · Open-source	82	—	—	$0.04 / $0.15	256K	863.2	Mar 2026
96	Qwen: Qwen3.5-35B-A3BOSS Alibaba Cloud · Open-source	82	—	—	$0.139 / $1	262K	144.0	Feb 2026
97	Qwen: Qwen3.5-27BOSS Alibaba Cloud · Open-source	82	—	—	$0.195 / $1.56	262K	93.4	Feb 2026
98	Qwen: Qwen3.5-122B-A10BOSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $2.08	262K	70.1	Feb 2026
99	Qwen: Qwen3.5-FlashOSS Alibaba Cloud · Speed & cost	82	—	—	$0.065 / $0.26	1M	504.6	Feb 2026
100	Qwen: Qwen3.5 Plus 2026-02-15OSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $1.56	1M	90.1	Feb 2026

Page 1 of 4 · 1–100 of 357

Quality = composite benchmark (MMLU, HumanEval, MATH)Arena ELO = LMSYS Chatbot Arena ratingValue = quality per dollarPrice = input / output per 1M tokens

What to do this quarter

Treat Arena Elo as a triage filter, not a decision. Use it to drop the bottom half of your candidate list, then run a real eval on the remainder.
Pick the right Arena board. Coding teams should read the coding Arena (Claude Opus 4.8 now leads at ~1582 Elo, ahead of Opus 4.7 at 1567). Long-context teams should read the hard-prompts Arena. The aggregate text leaderboard is the wrong signal for many enterprise workloads.
Discount short-conversation polish. The Arena rewards style. Models tuned for chat win at the margin against models tuned for accuracy. Build internal evals that reward what your business actually pays for.
Watch the gap, not the ranking. Sub-25 Elo shifts are within statistical noise. Anything under 50 Elo between two candidates is a coin flip on most workloads.
Plan for the multi-way race. Claude Opus 4.8 holds a narrow lead, but Gemini 3.1 Pro, Claude Opus 4.7, GPT-5.5 Pro, Qwen 3.7 Max, and DeepSeek V4 Pro are approximately interchangeable on quality at the top. Optimise your stack for switching cost, not for capability.
Capture vote-rate momentum. The fastest-rising models week-over-week are usually the next month's leaders. Subscribe to weekly Arena reports.
Pair Arena Elo with cost. A 50-Elo lead at 10x the price is rarely a good trade. See our model leaderboard for combined quality-cost rankings.

LMArena Leaderboard — June 2026

What is the LMArena, in one paragraph?

How to read an Arena Elo score

Live Leaderboard

What to do this quarter

Related reading