LMArena Leaderboard — June 2026
What the LMArena actually is, how to read an Arena Elo score, and the current top 10 for June 2026. The original human-preference benchmark that started as LMSys Chatbot Arena and now anchors most enterprise model selection conversations.
What is the LMArena, in one paragraph?
The LMArena is a public, blind side-by-side voting site for AI chat models. A user submits a prompt, two anonymous models reply, the user picks a winner, and the project aggregates millions of such votes into Elo ratings. It started in 2023 as the LMSys Chatbot Arena out of UC Berkeley and rebranded to LMArena.ai in 2024-25 as it spun out into an independent project. The current June 2026 top 10 is below — three models now sit above the historical 1500 Elo barrier on text, with the open-weights tier within striking distance of the closed-source frontier.
How to read an Arena Elo score
Reference table for Arena Elo bands (June 2026) 1510+ Frontier #1 Claude Opus 4.8 (AAII 61.4, coding & overall #1) 1500 Frontier Gemini 3.1 Pro, Claude Opus 4.7, GPT-5.5 Pro 1450 Frontier-adj. DeepSeek V4 Pro, Qwen 3.7 Max 1400 Strong tier GPT-4.1, Claude Sonnet 4, Gemini 2.5 Pro 1300 Capable tier Llama 4 Maverick, Mistral Large 3 1200 Solid daily Gemma 4, Phi-4, Mistral Small 3 1100 Light tasks DeepSeek V4 Flash, GPT-4o Mini <1100 Legacy tier Older 2023-24 model generations A 100-Elo gap means the higher-rated model wins ~64% of head-to-heads. A 200-Elo gap means it wins ~76%. Rating shifts under 25 points are noise.
Live Leaderboard
| # | Model | Quality | Arena ELO | Speed | Price | Context | Value | Released |
|---|---|---|---|---|---|---|---|---|
| 1 | Anthropic · Frontier agentic coding & knowledge work | 100 | 1525 | 58 t/s | $10 / $50 | 1M | 3.3 | Jun 2026 |
| 2 | Anthropic · Coding, agents & computer use | 99 | 1512 | 72 t/s | $5 / $25 | 1M | 6.6 | May 2026 |
| 3 | OpenAI · Reasoning at any cost | 98 | 1510 | 68 t/s | $30 / $180 | 1M | 0.9 | Apr 2026 |
| 4 | OpenAI · Frontier general purpose | 97 | 1506 | 70 t/s | $5 / $30 | 1M | 5.5 | Apr 2026 |
| 5 | OpenAI · Complex analysis | 97 | — | — | $30 / $180 | 1M | 0.9 | Mar 2026 |
| 6 | OpenAI · Complex analysis | 97 | — | — | $21 / $168 | 400K | 1.0 | Dec 2025 |
| 7 | Anthropic · Complex analysis | 97 | — | — | $30 / $150 | 1M | 1.1 | May 2026 |
| 8 | Anthropic · Coding & agentic workflows | 96 | 1505 | 68 t/s | $5 / $25 | 1M | 6.4 | Apr 2026 |
| 9 | OpenAI · Deep research | 96 | — | — | $10 / $40 | 200K | 3.8 | Oct 2025 |
| 10 | OpenAI · Deep research | 96 | — | — | $2 / $8 | 200K | 19.2 | Oct 2025 |
| 11 | OpenAI · Hard reasoning | 96 | — | — | $20 / $80 | 200K | 1.9 | Jun 2025 |
| 12 | Google · Speed & cost | 96 | 1505 | — | $2 / $12 | 1M | 13.7 | Feb 2026 |
| 13 | Google · Science & long-context | 96 | 1505 | 131 t/s | $2 / $12 | 1M | 13.7 | Apr 2026 |
| 14 | Anthropic · General purpose | 95 | 1490 | — | $5 / $25 | 1M | 6.3 | Feb 2026 |
| 15 | Anthropic · General purpose | 95 | — | — | $5 / $25 | 200K | 6.3 | Nov 2025 |
| 16 | Anthropic · Complex analysis | 95 | — | — | $30 / $150 | 1M | 1.1 | Apr 2026 |
| 17 | Google · Image generation | 94 | — | — | $2 / $12 | 66K | 13.4 | Nov 2025 |
| 18 | Anthropic · Multimodal | 94 | — | — | $15 / $75 | 200K | 2.1 | Aug 2025 |
| 19 | OpenAI · Hard reasoning | 94 | 1370 | 68 t/s | $10 / $40 | 200K | 3.8 | Apr 2025 |
| 20 | Alibaba Cloud · Long autonomous agentic runs | 94 | 1488 | 90 t/s | $2.5 / $7.5 | 1M | 18.8 | May 2026 |
| 21 | xAI · Agentic tasks & real-time info | 93 | 1496 | 83 t/s | $1.25 / $2.5 | 1M | 49.6 | May 2026 |
| 22 | OpenAI · General purpose | 93 | 1495 | — | $2.5 / $15 | 1M | 10.6 | Mar 2026 |
| 23 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Mar 2026 |
| 24 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Feb 2026 |
| 25 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Jan 2026 |
| 26 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Dec 2025 |
| 27 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Dec 2025 |
| 28 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Dec 2025 |
| 29 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 30 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 128K | 16.5 | Nov 2025 |
| 31 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 32 | OpenAI · Hard reasoning | 93 | — | — | $150 / $600 | 200K | 0.2 | Mar 2025 |
| 33 | OpenAI · Complex analysis | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 34 | OpenAI · Multimodal | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 35 | xAI · General purpose | 93 | 1496 | — | $1.25 / $2.5 | 2M | 49.6 | Mar 2026 |
| 36 | OpenAI · Complex analysis | 93 | — | — | $8 / $15 | 272K | 8.1 | Apr 2026 |
| 37 | Moonshot AI · Frontier quality at low cost | 92 | 1466 | 48 t/s | $0.73 / $3.49 | 256K | 43.6 | Apr 2026 |
| 38 | Google · Multimodal + value | 92 | 1345 | 87 t/s | $1.25 / $10 | 1M | 16.4 | Mar 2025 |
| 39 | Anthropic · Complex analysis | 91 | 1360 | 52 t/s | $15 / $75 | 200K | 2.0 | May 2025 |
| 40 | · Hard reasoning | 91 | — | — | $0.3 / $1.1 | 164K | 130.0 | Jul 2025 |
| 41 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | Jun 2025 |
| 42 | DeepSeek · Hard reasoning | 91 | — | — | $0.5 / $2.15 | 164K | 68.7 | May 2025 |
| 43 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | May 2025 |
| 44 | DeepSeek · Hard reasoning | 91 | — | — | $0.29 / $0.29 | 33K | 313.8 | Jan 2025 |
| 45 | DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $0.8 | 131K | 121.3 | Jan 2025 |
| 46 | DeepSeek: R1OSS DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $2.5 | 64K | 56.9 | Jan 2025 |
| 47 | Moonshot AI · Open-weight agentic coding | 91 | — | 55 t/s | $0.73 / $3.49 | 256K | 43.1 | Jun 2026 |
| 48 | · Open-weight reasoning & tool use | 91 | — | 50 t/s | $0.2 / $0.8 | 262K | 182.0 | Jun 2026 |
| 49 | DeepSeek · Open-source value leader | 90 | 1467 | 33 t/s | $1.74 / $3.48 | 1M | 34.5 | Apr 2026 |
| 50 | Anthropic · Coding & balance | 90 | 1467 | 73 t/s | $3 / $15 | 1M | 10.0 | Feb 2026 |
| 51 | OpenAI · General purpose | 90 | 1455 | — | $1.25 / $10 | 400K | 16.0 | Aug 2025 |
| 52 | xAI · General purpose | 90 | — | — | $3 / $15 | 131K | 10.0 | Apr 2025 |
| 53 | Alibaba Cloud · Open-source | 90 | — | — | $1.04 / $6.24 | 262K | 24.7 | Apr 2026 |
| 54 | OpenAI · Long context | 89 | 1310 | 120 t/s | $2 / $8 | 1M | 17.8 | Apr 2025 |
| 55 | Moonshot AI · Speed & cost | 89 | 1452 | — | $0.4 / $1.9 | 262K | 77.4 | Jan 2026 |
| 56 | · Open-weight agentic coding | 89 | 1455 | 80 t/s | $0.6 / $2.4 | 1M | 59.3 | Jun 2026 |
| 57 | · Open-weight agentic coding (provisional) | 89 | — | — | $0.98 / $3.08 | 200K | 43.8 | Jun 2026 |
| 58 | · Open-weight agentic & tool use | 88 | 1467 | 48 t/s | $0.98 / $3.08 | 200K | 43.3 | Apr 2026 |
| 59 | OpenAI · Multimodal | 88 | — | — | $10 / $10 | 400K | 8.8 | Oct 2025 |
| 60 | OpenAI · Complex analysis | 88 | — | — | $15 / $120 | 400K | 1.3 | Oct 2025 |
| 61 | Anthropic · General purpose | 88 | — | — | $3 / $15 | 1M | 9.8 | Sep 2025 |
| 62 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Aug 2025 |
| 63 | OpenAI · Search + citations | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Mar 2025 |
| 64 | OpenAI · Hard reasoning | 88 | — | — | $15 / $60 | 200K | 2.3 | Dec 2024 |
| 65 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Nov 2024 |
| 66 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | May 2024 |
| 67 | OpenAI · Multimodal | 88 | — | — | $6 / $18 | 128K | 7.3 | May 2024 |
| 68 | OpenAI · General purpose | 88 | — | — | $5 / $15 | 128K | 8.8 | May 2024 |
| 69 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Apr 2024 |
| 70 | OpenAI · Complex analysis | 88 | — | — | $10 / $30 | 128K | 4.4 | Jan 2024 |
| 71 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Nov 2023 |
| 72 | Z.ai: GLM 5OSS · Open-source | 88 | 1450 | — | $0.6 / $1.92 | 80K | 69.8 | Feb 2026 |
| 73 | Anthropic · Coding & balance | 88 | 1320 | 95 t/s | $3 / $15 | 200K | 9.8 | May 2025 |
| 74 | OpenAI · Reasoning & math | 88 | 1305 | 155 t/s | $1.1 / $4.4 | 200K | 32.0 | Jan 2025 |
| 75 | xAI · Real-time info | 87 | 1330 | 82 t/s | $3 / $15 | 131K | 9.7 | Feb 2025 |
| 76 | DeepSeek · Open-source | 87 | 1455 | — | $0.252 / $0.378 | 164K | 276.2 | Dec 2025 |
| 77 | · Open-source | 86 | — | — | $0.135 / $0.5 | 131K | 270.9 | Dec 2025 |
| 78 | DeepSeek · Open-source | 86 | — | — | $0.287 / $0.431 | 164K | 239.6 | Dec 2025 |
| 79 | DeepSeek · Open-source | 86 | — | — | $0.27 / $0.41 | 164K | 252.9 | Sep 2025 |
| 80 | DeepSeek · Open-source | 86 | — | — | $0.27 / $0.95 | 164K | 141.0 | Sep 2025 |
| 81 | DeepSeek · Open-source | 86 | — | — | $0.21 / $0.79 | 33K | 172.0 | Aug 2025 |
| 82 | DeepSeek · Open-source | 86 | — | — | $0.2 / $0.77 | 164K | 177.3 | Mar 2025 |
| 83 | Anthropic · General purpose | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 84 | Anthropic · Hard reasoning | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 85 | DeepSeek · Best open-source value | 86 | 1310 | 62 t/s | $0.27 / $1.1 | 128K | 125.5 | Mar 2025 |
| 86 | Alibaba Cloud · Multilingual & APAC | 86 | 1448 | 124 t/s | $1.4 / $5.6 | 256K | 24.6 | Apr 2026 |
| 87 | OpenAI · General purpose | 85 | 1285 | 109 t/s | $2.5 / $10 | 128K | 13.6 | May 2024 |
| 88 | Mistral AI · Open-source | 85 | — | — | $0.5 / $1.5 | 262K | 85.0 | Dec 2025 |
| 89 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 131K | 21.3 | Nov 2024 |
| 90 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 128K | 21.3 | Feb 2024 |
| 91 | Google · Speed & cost | 84 | — | — | $1.5 / $9 | 1M | 16.0 | May 2026 |
| 92 | · Accessible open-weight agentics | 84 | — | 110 t/s | $0.05 / $0.2 | 262K | 672.0 | Jun 2026 |
| 93 | OpenAI · Speed & cost | 83 | — | — | $0.75 / $4.5 | 400K | 31.6 | Mar 2026 |
| 94 | OpenAI · Speed & cost | 83 | — | — | $0.25 / $2 | 400K | 73.8 | Aug 2025 |
| 95 | Alibaba Cloud · Open-source | 82 | — | — | $0.04 / $0.15 | 256K | 863.2 | Mar 2026 |
| 96 | Alibaba Cloud · Open-source | 82 | — | — | $0.139 / $1 | 262K | 144.0 | Feb 2026 |
| 97 | Alibaba Cloud · Open-source | 82 | — | — | $0.195 / $1.56 | 262K | 93.4 | Feb 2026 |
| 98 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $2.08 | 262K | 70.1 | Feb 2026 |
| 99 | Alibaba Cloud · Speed & cost | 82 | — | — | $0.065 / $0.26 | 1M | 504.6 | Feb 2026 |
| 100 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $1.56 | 1M | 90.1 | Feb 2026 |
What to do this quarter
- Treat Arena Elo as a triage filter, not a decision. Use it to drop the bottom half of your candidate list, then run a real eval on the remainder.
- Pick the right Arena board. Coding teams should read the coding Arena (Claude Opus 4.8 now leads at ~1582 Elo, ahead of Opus 4.7 at 1567). Long-context teams should read the hard-prompts Arena. The aggregate text leaderboard is the wrong signal for many enterprise workloads.
- Discount short-conversation polish. The Arena rewards style. Models tuned for chat win at the margin against models tuned for accuracy. Build internal evals that reward what your business actually pays for.
- Watch the gap, not the ranking. Sub-25 Elo shifts are within statistical noise. Anything under 50 Elo between two candidates is a coin flip on most workloads.
- Plan for the multi-way race. Claude Opus 4.8 holds a narrow lead, but Gemini 3.1 Pro, Claude Opus 4.7, GPT-5.5 Pro, Qwen 3.7 Max, and DeepSeek V4 Pro are approximately interchangeable on quality at the top. Optimise your stack for switching cost, not for capability.
- Capture vote-rate momentum. The fastest-rising models week-over-week are usually the next month's leaders. Subscribe to weekly Arena reports.
- Pair Arena Elo with cost. A 50-Elo lead at 10x the price is rarely a good trade. See our model leaderboard for combined quality-cost rankings.
Related reading
- AI Model Leaderboard — full quality, speed, and pricing comparison
- LLM Leaderboard — same data, LLM-focused entry point
- LMSys Arena Leaderboard May 2026 — full deep-dive
- LMArena Elo Explained for Enterprise Buyers
For teams running multiple top-of-Arena models in production, Swfte Connect provides a single OpenAI-compatible endpoint that routes across providers and normalises Arena-tier quality without re-architecting your stack.