LMArena.ai — Top Models May 2026
LMArena.ai is the rebranded LMSys Chatbot Arena. Same blind pairwise-voting methodology, same Elo math, new home. Here is who leads each board this month and what the rebrand actually changed for buyers.
The LMSys to LMArena.ai story
The Chatbot Arena began in 2023 as a research project under LMSys, an academic group out of UC Berkeley. It quickly became the most-cited LLM benchmark because it measured something the capability-only benchmarks could not: actual human preference under blind side-by-side comparison. By 2024 the project had processed millions of votes, become a procurement input for Fortune 500 buyers, and outgrown its original academic scaffold. The 2024-25 transition to the lmarena.ai domain consolidated the project as an independent organisation while keeping the same Elo methodology and open vote pool.
For users the rebrand changed almost nothing: same prompts, same blind voting, same Elo math. For procurement teams the rebrand codified Arena Elo as a vendor-neutral signal independent of any single university. That is what made it sticky as a reference point in enterprise contracts.
The four-way race at the top
The May 2026 snapshot below shows three models above the historical 1500 Elo barrier on text. The top of LMArena.ai is now genuinely contested.
Top of LMArena.ai text leaderboard (May 2026) Gemini 3.1 Pro Preview 1500 ████████████████████ text leader Claude Opus 4.7 Thinking 1495 ███████████████████ coding #1 GPT-5.5 Pro 1488 ██████████████████ reasoning DeepSeek V4 Pro 1462 █████████████████ Apache 2.0 Qwen 3.6 Plus 1423 ███████████████ open weights Claude Sonnet 4 1402 ██████████████ workhorse tier GPT-4.1 1395 █████████████ legacy frontier Gemini 2.5 Pro 1388 █████████████ legacy frontier Llama 4 Maverick 1352 ███████████ open weights Mistral Large 3 1341 ███████████ open weights
Full Leaderboard
| # | Model | Quality | Arena ELO | Speed | Price | Context | Value | Released |
|---|---|---|---|---|---|---|---|---|
| 1 | OpenAI · Complex analysis | 97 | — | — | $30 / $180 | 1M | 0.9 | Mar 2026 |
| 2 | OpenAI · Complex analysis | 97 | — | — | $21 / $168 | 400K | 1.0 | Dec 2025 |
| 3 | OpenAI · Deep research | 96 | — | — | $10 / $40 | 200K | 3.8 | Oct 2025 |
| 4 | OpenAI · Deep research | 96 | — | — | $2 / $8 | 200K | 19.2 | Oct 2025 |
| 5 | OpenAI · Hard reasoning | 96 | — | — | $20 / $80 | 200K | 1.9 | Jun 2025 |
| 6 | Anthropic · General purpose | 95 | — | — | $5 / $25 | 1M | 6.3 | Feb 2026 |
| 7 | Anthropic · General purpose | 95 | — | — | $5 / $25 | 200K | 6.3 | Nov 2025 |
| 8 | Google · Speed & cost | 94 | — | — | $2 / $12 | 1M | 13.4 | Feb 2026 |
| 9 | Google · Speed & cost | 94 | — | — | $2 / $12 | 1M | 13.4 | Feb 2026 |
| 10 | Google · Image generation | 94 | — | — | $2 / $12 | 66K | 13.4 | Nov 2025 |
| 11 | Anthropic · Multimodal | 94 | — | — | $15 / $75 | 200K | 2.1 | Aug 2025 |
| 12 | Anthropic · Multimodal | 94 | — | — | $15 / $75 | 200K | 2.1 | May 2025 |
| 13 | OpenAI · General purpose | 93 | — | — | $2.5 / $15 | 1M | 10.6 | Mar 2026 |
| 14 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Mar 2026 |
| 15 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Feb 2026 |
| 16 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Jan 2026 |
| 17 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Dec 2025 |
| 18 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Dec 2025 |
| 19 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Dec 2025 |
| 20 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 21 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 128K | 16.5 | Nov 2025 |
| 22 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 23 | OpenAI · Hard reasoning | 93 | — | — | $150 / $600 | 200K | 0.2 | Mar 2025 |
| 24 | OpenAI · Complex analysis | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 25 | OpenAI · Multimodal | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 26 | OpenAI · General purpose | 92 | — | — | $1.25 / $10 | 400K | 16.4 | Aug 2025 |
| 27 | OpenAI · Hard reasoning | 92 | — | — | $2 / $8 | 200K | 18.4 | Apr 2025 |
| 28 | · Hard reasoning | 91 | — | — | $0.3 / $1.1 | 164K | 130.0 | Jul 2025 |
| 29 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | Jun 2025 |
| 30 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | Jun 2025 |
| 31 | DeepSeek · Hard reasoning | 91 | — | — | $0.45 / $2.15 | 164K | 70.0 | May 2025 |
| 32 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | May 2025 |
| 33 | DeepSeek · Hard reasoning | 91 | — | — | $0.29 / $0.29 | 33K | 313.8 | Jan 2025 |
| 34 | DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $0.8 | 131K | 121.3 | Jan 2025 |
| 35 | DeepSeek: R1OSS DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $2.5 | 64K | 56.9 | Jan 2025 |
| 36 | xAI · General purpose | 90 | — | — | $3 / $15 | 131K | 10.0 | Jun 2025 |
| 37 | xAI · General purpose | 90 | — | — | $3 / $15 | 131K | 10.0 | Apr 2025 |
| 38 | OpenAI · General purpose | 89 | — | — | $2 / $8 | 1M | 17.8 | Apr 2025 |
| 39 | Anthropic · General purpose | 88 | — | — | $3 / $15 | 1M | 9.8 | Feb 2026 |
| 40 | OpenAI · Multimodal | 88 | — | — | $10 / $10 | 400K | 8.8 | Oct 2025 |
| 41 | OpenAI · Complex analysis | 88 | — | — | $15 / $120 | 400K | 1.3 | Oct 2025 |
| 42 | Anthropic · General purpose | 88 | — | — | $3 / $15 | 1M | 9.8 | Sep 2025 |
| 43 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Aug 2025 |
| 44 | OpenAI · Search + citations | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Mar 2025 |
| 45 | OpenAI · Hard reasoning | 88 | — | — | $15 / $60 | 200K | 2.3 | Dec 2024 |
| 46 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Nov 2024 |
| 47 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Aug 2024 |
| 48 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | May 2024 |
| 49 | OpenAI · Multimodal | 88 | — | — | $6 / $18 | 128K | 7.3 | May 2024 |
| 50 | OpenAI · General purpose | 88 | — | — | $5 / $15 | 128K | 8.8 | May 2024 |
| 51 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Apr 2024 |
| 52 | OpenAI · Complex analysis | 88 | — | — | $10 / $30 | 128K | 4.4 | Jan 2024 |
| 53 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Nov 2023 |
| 54 | · Open-source | 86 | — | — | $0.135 / $0.5 | 131K | 270.9 | Dec 2025 |
| 55 | DeepSeek · Open-source | 86 | — | — | $0.4 / $1.2 | 164K | 107.5 | Dec 2025 |
| 56 | DeepSeek · Open-source | 86 | — | — | $0.26 / $0.38 | 164K | 268.8 | Dec 2025 |
| 57 | DeepSeek · Open-source | 86 | — | — | $0.27 / $0.41 | 164K | 252.9 | Sep 2025 |
| 58 | DeepSeek · Open-source | 86 | — | — | $0.21 / $0.79 | 164K | 172.0 | Sep 2025 |
| 59 | DeepSeek · Open-source | 86 | — | — | $0.15 / $0.75 | 33K | 191.1 | Aug 2025 |
| 60 | Anthropic · General purpose | 86 | — | — | $3 / $15 | 200K | 9.6 | May 2025 |
| 61 | DeepSeek · Open-source | 86 | — | — | $0.2 / $0.77 | 164K | 177.3 | Mar 2025 |
| 62 | Anthropic · General purpose | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 63 | Anthropic · Hard reasoning | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 64 | DeepSeek · Open-source | 86 | — | — | $0.32 / $0.89 | 164K | 142.1 | Dec 2024 |
| 65 | Mistral AI · Open-source | 85 | — | — | $0.5 / $1.5 | 262K | 85.0 | Dec 2025 |
| 66 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 131K | 21.3 | Nov 2024 |
| 67 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 131K | 21.3 | Nov 2024 |
| 68 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 128K | 21.3 | Feb 2024 |
| 69 | Cohere · Open-source | 84 | — | — | $2.5 / $10 | 128K | 13.4 | Aug 2024 |
| 70 | OpenAI · Speed & cost | 83 | — | — | $0.75 / $4.5 | 400K | 31.6 | Mar 2026 |
| 71 | OpenAI · Speed & cost | 83 | — | — | $0.25 / $2 | 400K | 73.8 | Aug 2025 |
| 72 | Alibaba Cloud · Open-source | 82 | — | — | $0.05 / $0.15 | 256K | 820.0 | Mar 2026 |
| 73 | Alibaba Cloud · Open-source | 82 | — | — | $0.1625 / $1.3 | 262K | 112.1 | Feb 2026 |
| 74 | Alibaba Cloud · Open-source | 82 | — | — | $0.195 / $1.56 | 262K | 93.4 | Feb 2026 |
| 75 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $2.08 | 262K | 70.1 | Feb 2026 |
| 76 | Alibaba Cloud · Speed & cost | 82 | — | — | $0.065 / $0.26 | 1M | 504.6 | Feb 2026 |
| 77 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $1.56 | 1M | 90.1 | Feb 2026 |
| 78 | Alibaba Cloud · Open-source | 82 | — | — | $0.39 / $2.34 | 262K | 60.1 | Feb 2026 |
| 79 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.78 / $3.9 | 262K | 35.0 | Feb 2026 |
| 80 | Alibaba Cloud · Code generation | 82 | — | — | $0.12 / $0.75 | 262K | 188.5 | Feb 2026 |
| 81 | Alibaba Cloud · Open-source | 82 | — | — | $0.104 / $0.416 | 131K | 315.4 | Oct 2025 |
| 82 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.117 / $1.365 | 131K | 110.7 | Oct 2025 |
| 83 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.5 | 131K | 282.8 | Oct 2025 |
| 84 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.13 / $1.56 | 131K | 97.0 | Oct 2025 |
| 85 | Alibaba Cloud · Open-source | 82 | — | — | $0.13 / $0.52 | 131K | 252.3 | Oct 2025 |
| 86 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.26 / $2.6 | 131K | 57.3 | Sep 2025 |
| 87 | Alibaba Cloud · Open-source | 82 | — | — | $0.2 / $0.88 | 262K | 151.9 | Sep 2025 |
| 88 | Alibaba Cloud · Open-source | 82 | — | — | $0.78 / $3.9 | 262K | 35.0 | Sep 2025 |
| 89 | Alibaba Cloud · Code generation | 82 | — | — | $0.65 / $3.25 | 1M | 42.1 | Sep 2025 |
| 90 | Alibaba Cloud · Code generation | 82 | — | — | $0.195 / $0.975 | 1M | 140.2 | Sep 2025 |
| 91 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.0975 / $0.78 | 131K | 186.9 | Sep 2025 |
| 92 | Alibaba Cloud · Open-source | 82 | — | — | $0.09 / $1.1 | 262K | 137.8 | Sep 2025 |
| 93 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.08 / $0.4 | 131K | 341.7 | Aug 2025 |
| 94 | Alibaba Cloud · Code generation | 82 | — | — | $0.07 / $0.27 | 160K | 482.4 | Jul 2025 |
| 95 | Alibaba Cloud · Open-source | 82 | — | — | $0.09 / $0.3 | 262K | 420.5 | Jul 2025 |
| 96 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.1495 / $1.495 | 131K | 99.7 | Jul 2025 |
| 97 | Alibaba Cloud · Code generation | 82 | — | — | $0.22 / $1 | 262K | 134.4 | Jul 2025 |
| 98 | Alibaba Cloud · Open-source | 82 | — | — | $0.071 / $0.1 | 262K | 959.1 | Jul 2025 |
| 99 | xAI · Speed & cost | 82 | — | — | $0.3 / $0.5 | 131K | 205.0 | Jun 2025 |
| 100 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.28 | 41K | 455.6 | Apr 2025 |
| 101 | Alibaba Cloud · Open-source | 82 | — | — | $0.05 / $0.4 | 41K | 364.4 | Apr 2025 |
| 102 | Alibaba Cloud · Open-source | 82 | — | — | $0.06 / $0.24 | 41K | 546.7 | Apr 2025 |
| 103 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.24 | 41K | 512.5 | Apr 2025 |
| 104 | Alibaba Cloud · Open-source | 82 | — | — | $0.455 / $1.82 | 131K | 72.1 | Apr 2025 |
| 105 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Apr 2025 |
| 106 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Apr 2025 |
| 107 | xAI · Speed & cost | 82 | — | — | $0.3 / $0.5 | 131K | 205.0 | Apr 2025 |
| 108 | Meta · Open-source | 82 | — | — | $0.15 / $0.6 | 1M | 218.7 | Apr 2025 |
| 109 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Feb 2025 |
| 110 | · General purpose | 82 | — | — | $4 / $8 | 131K | 13.7 | Feb 2025 |
| 111 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Jan 2025 |
| 112 | Alibaba Cloud · Open-source | 82 | — | — | $0.12 / $0.39 | 33K | 321.6 | Sep 2024 |
| 113 | · General purpose | 82 | — | — | $3.75 / $7.5 | 6K | 14.6 | Nov 2023 |
| 114 | Google · Speed & cost | 80 | — | — | $0.5 / $3 | 1M | 45.7 | Dec 2025 |
| 115 | Google · Image generation | 80 | — | — | $0.3 / $2.5 | 33K | 57.1 | Oct 2025 |
| 116 | Google · Speed & cost | 80 | — | — | $0.1 / $0.4 | 1M | 320.0 | Sep 2025 |
| 117 | Google · Speed & cost | 80 | — | — | $0.1 / $0.4 | 1M | 320.0 | Jul 2025 |
| 118 | Google · Speed & cost | 80 | — | — | $0.3 / $2.5 | 1M | 57.1 | Jun 2025 |
| 119 | OpenAI · Speed & cost | 80 | — | — | $0.4 / $1.6 | 1M | 80.0 | Apr 2025 |
| 120 | OpenAI · Search + citations | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Mar 2025 |
| 121 | OpenAI · Speed & cost | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Jul 2024 |
| 122 | OpenAI · Speed & cost | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Jul 2024 |
| 123 | Mistral AI · Code generation | 78 | — | — | $0.3 / $0.9 | 256K | 130.0 | Aug 2025 |
| 124 | Google · Open-source | 76 | — | — | $0.13 / $0.4 | 262K | 286.8 | Apr 2026 |
| 125 | Google · Open-source | 76 | — | — | $0.14 / $0.4 | 262K | 281.5 | Apr 2026 |
| 126 | Anthropic · Speed & cost | 76 | — | — | $1 / $5 | 200K | 25.3 | Oct 2025 |
| 127 | Anthropic · Speed & cost | 76 | — | — | $0.8 / $4 | 200K | 31.7 | Nov 2024 |
| 128 | · Open-source | 74 | — | — | $1.2 / $4 | 203K | 28.5 | Apr 2026 |
| 129 | xAI · General purpose | 74 | — | — | $2 / $6 | 2M | 18.5 | Mar 2026 |
| 130 | xAI · General purpose | 74 | — | — | $2 / $6 | 2M | 18.5 | Mar 2026 |
| 131 | · Open-source | 74 | — | — | $1.2 / $4 | 203K | 28.5 | Mar 2026 |
| 132 | OpenAI · General purpose | 74 | — | — | $2.5 / $10 | 128K | 11.8 | Jan 2026 |
| 133 | · General purpose | 74 | — | — | $1.25 / $1.25 | 128K | 59.2 | Nov 2025 |
| 134 | Amazon · General purpose | 74 | — | — | $2.5 / $12.5 | 1M | 9.9 | Oct 2025 |
| 135 | Perplexity · Search + citations | 74 | — | — | $3 / $15 | 200K | 8.2 | Oct 2025 |
| 136 | OpenAI · Image generation | 74 | — | — | $2.5 / $2 | 400K | 32.9 | Oct 2025 |
| 137 | · Open-source | 74 | — | — | $0.1 / $0.4 | 131K | 296.0 | Oct 2025 |
| 138 | OpenAI · Code generation | 74 | — | — | $1.25 / $10 | 400K | 13.2 | Sep 2025 |
| 139 | · General purpose | 74 | — | — | $2 / $8 | 256K | 14.8 | Aug 2025 |
| 140 | OpenAI · General purpose | 74 | — | — | $1.25 / $10 | 128K | 13.2 | Aug 2025 |
| 141 | xAI · General purpose | 74 | — | — | $3 / $15 | 256K | 8.2 | Jul 2025 |
| 142 | Meta · Speed & cost | 74 | — | — | $0.08 / $0.3 | 328K | 389.5 | Apr 2025 |
| 143 | Google · Open-source | 74 | — | — | $0.04 / $0.13 | 131K | 870.6 | Mar 2025 |
| 144 | Cohere · General purpose | 74 | — | — | $2.5 / $10 | 256K | 11.8 | Mar 2025 |
| 145 | Google · Open-source | 74 | — | — | $0.08 / $0.16 | 131K | 616.7 | Mar 2025 |
| 146 | Perplexity · Search + citations | 74 | — | — | $2 / $8 | 128K | 14.8 | Mar 2025 |
| 147 | Perplexity · Search + citations | 74 | — | — | $3 / $15 | 200K | 8.2 | Mar 2025 |
| 148 | Perplexity · Deep research | 74 | — | — | $2 / $8 | 128K | 14.8 | Mar 2025 |
| 149 | Qwen: Qwen-Max OSS Alibaba Cloud · Open-source | 74 | — | — | $1.04 / $4.16 | 33K | 28.5 | Feb 2025 |
| 150 | · Hard reasoning | 74 | — | — | $3 / $3 | 16K | 24.7 | Jan 2025 |
| 151 | Meta · Open-source | 74 | — | — | $0.1 / $0.32 | 131K | 352.4 | Dec 2024 |
| 152 | Mistral AI · Open-source | 74 | — | — | $2 / $6 | 131K | 18.5 | Nov 2024 |
| 153 | · General purpose | 74 | — | — | $3 / $5 | 16K | 18.5 | Oct 2024 |
| 154 | · Open-source | 74 | — | — | $1.2 / $1.2 | 131K | 61.7 | Oct 2024 |
| 155 | · General purpose | 74 | — | — | $2.5 / $10 | 8K | 11.8 | Oct 2024 |
| 156 | · General purpose | 74 | — | — | $2.5 / $10 | 8K | 11.8 | Oct 2024 |
| 157 | · Search + citations | 74 | — | — | $0.3 / $0.3 | 131K | 246.7 | Aug 2024 |
| 158 | Meta · Open-source | 74 | — | — | $0.4 / $0.4 | 131K | 185.0 | Jul 2024 |
| 159 | · Hard reasoning | 74 | — | — | $1.48 / $1.48 | 8K | 50.0 | Jun 2024 |
| 160 | OpenAI · General purpose | 74 | — | — | $1.5 / $2 | 4K | 42.3 | Sep 2023 |
| 161 | OpenAI · General purpose | 74 | — | — | $3 / $4 | 16K | 21.1 | Aug 2023 |
| 162 | Google · Speed & cost | 73 | — | — | $0.075 / $0.3 | 1M | 389.3 | Feb 2025 |
| 163 | Google · Speed & cost | 73 | — | — | $0.1 / $0.4 | 1M | 292.0 | Feb 2025 |
| 164 | OpenAI · Speed & cost | 72 | — | — | $0.2 / $1.25 | 400K | 99.3 | Mar 2026 |
| 165 | Mistral AI · Open-source | 72 | — | — | $0.15 / $0.6 | 262K | 192.0 | Mar 2026 |
| 166 | Mistral AI · Open-source | 72 | — | — | $0.1 / $0.3 | 33K | 360.0 | Dec 2025 |
| 167 | OpenAI · Speed & cost | 72 | — | — | $0.05 / $0.4 | 400K | 320.0 | Aug 2025 |
| 168 | Mistral AI · Open-source | 72 | — | — | $0.075 / $0.2 | 128K | 523.6 | Jun 2025 |
| 169 | OpenAI · Speed & cost | 72 | — | — | $0.1 / $0.4 | 1M | 288.0 | Apr 2025 |
| 170 | Mistral AI · Open-source | 72 | — | — | $0.03 / $0.11 | 131K | 1028.6 | Mar 2025 |
| 171 | Mistral AI · Open-source | 72 | — | — | $0.05 / $0.08 | 33K | 1107.7 | Jan 2025 |
| 172 | Mistral AI · Open-source | 72 | — | — | $0.02 / $0.04 | 131K | 2400.0 | Jul 2024 |
| 173 | Mistral AI · Open-source | 72 | — | — | $2 / $6 | 66K | 18.0 | Apr 2024 |
| 174 | Anthropic · Speed & cost | 72 | — | — | $0.25 / $1.25 | 200K | 96.0 | Mar 2024 |
| 175 | Mistral AI · Open-source | 72 | — | — | $0.54 / $0.54 | 33K | 133.3 | Dec 2023 |
| 176 | · Speed & cost | 66 | — | — | $0.4 / $2 | 262K | 55.0 | Mar 2026 |
| 177 | · Speed & cost | 66 | — | — | $1 / $3 | 1M | 33.0 | Mar 2026 |
| 178 | Google · Image generation | 66 | — | — | $0.5 / $3 | 66K | 37.7 | Feb 2026 |
| 179 | · Speed & cost | 66 | — | — | $0.8 / $1.6 | 131K | 55.0 | Feb 2026 |
| 180 | Z.ai: GLM 5OSS · Open-source | 66 | — | — | $0.72 / $2.3 | 80K | 43.7 | Feb 2026 |
| 181 | · Speed & cost | 66 | — | — | $0.3827 / $1.72 | 262K | 62.8 | Jan 2026 |
| 182 | · Speed & cost | 66 | — | — | $0.6 / $6 | 1M | 20.0 | Jan 2026 |
| 183 | OpenAI · Speed & cost | 66 | — | — | $0.6 / $2.4 | 128K | 44.0 | Jan 2026 |
| 184 | · Open-source | 66 | — | — | $0.39 / $1.75 | 203K | 61.7 | Dec 2025 |
| 185 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 262K | 55.0 | Dec 2025 |
| 186 | · Search + citations | 66 | — | — | $1 / $3 | 256K | 33.0 | Dec 2025 |
| 187 | · Hard reasoning | 66 | — | — | $0.47 / $2 | 131K | 53.4 | Nov 2025 |
| 188 | · Open-source | 66 | — | — | $0.39 / $1.9 | 205K | 57.6 | Sep 2025 |
| 189 | · Speed & cost | 66 | — | — | $0.85 / $1.25 | 256K | 62.9 | Sep 2025 |
| 190 | · Speed & cost | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Sep 2025 |
| 191 | · Search + citations | 66 | — | — | $1 / $3 | 131K | 33.0 | Aug 2025 |
| 192 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Aug 2025 |
| 193 | · Open-source | 66 | — | — | $0.6 / $1.8 | 66K | 55.0 | Aug 2025 |
| 194 | · Open-source | 66 | — | — | $0.6 / $2.2 | 131K | 47.1 | Jul 2025 |
| 195 | · Speed & cost | 66 | — | — | $0.85 / $3.4 | 131K | 31.1 | Jul 2025 |
| 196 | · Speed & cost | 66 | — | — | $0.57 / $2.3 | 131K | 46.0 | Jul 2025 |
| 197 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Jul 2025 |
| 198 | · Speed & cost | 66 | — | — | $0.9 / $1.9 | 262K | 47.1 | Jul 2025 |
| 199 | · Speed & cost | 66 | — | — | $0.8 / $1.2 | 82K | 66.0 | Jul 2025 |
| 200 | · Speed & cost | 66 | — | — | $0.42 / $1.25 | 123K | 79.0 | Jun 2025 |
| 201 | · Speed & cost | 66 | — | — | $0.4 / $2.2 | 1M | 50.8 | Jun 2025 |
| 202 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | May 2025 |
| 203 | · Speed & cost | 66 | — | — | $0.9 / $3.3 | 131K | 31.4 | May 2025 |
| 204 | · Speed & cost | 66 | — | — | $0.75 / $1.2 | 131K | 67.7 | May 2025 |
| 205 | · Code generation | 66 | — | — | $0.5 / $0.8 | 33K | 101.5 | May 2025 |
| 206 | · Speed & cost | 66 | — | — | $0.8 / $1.2 | 4K | 66.0 | Apr 2025 |
| 207 | · Open-source | 66 | — | — | $0.8 / $1.2 | 4K | 66.0 | Apr 2025 |
| 208 | · Open-source | 66 | — | — | $0.6 / $1.8 | 131K | 55.0 | Apr 2025 |
| 209 | · Speed & cost | 66 | — | — | $0.55 / $0.8 | 33K | 97.8 | Mar 2025 |
| 210 | · Speed & cost | 66 | — | — | $0.7 / $1.4 | 131K | 62.9 | Feb 2025 |
| 211 | Alibaba Cloud · Open-source | 66 | — | — | $0.52 / $2.08 | 131K | 50.8 | Feb 2025 |
| 212 | Alibaba Cloud · Open-source | 66 | — | — | $0.8 / $0.8 | 33K | 82.5 | Feb 2025 |
| 213 | Perplexity · Search + citations | 66 | — | — | $1 / $1 | 127K | 66.0 | Jan 2025 |
| 214 | · Hard reasoning | 66 | — | — | $0.65 / $0.75 | 131K | 94.3 | Dec 2024 |
| 215 | Amazon · Speed & cost | 66 | — | — | $0.8 / $3.2 | 300K | 33.0 | Dec 2024 |
| 216 | Alibaba Cloud · Code generation | 66 | — | — | $0.66 / $1 | 33K | 79.5 | Nov 2024 |
| 217 | · Speed & cost | 66 | — | — | $0.4 / $0.4 | 33K | 165.0 | Nov 2024 |
| 218 | · Hard reasoning | 66 | — | — | $0.85 / $0.85 | 131K | 77.6 | Aug 2024 |
| 219 | · Search + citations | 66 | — | — | $1 / $1 | 131K | 66.0 | Aug 2024 |
| 220 | Meta · Open-source | 66 | — | — | $0.51 / $0.74 | 8K | 105.6 | Apr 2024 |
| 221 | OpenAI · Speed & cost | 66 | — | — | $1 / $2 | 4K | 44.0 | Jan 2024 |
| 222 | · Speed & cost | 66 | — | — | $0.75 / $1 | 8K | 75.4 | Aug 2023 |
| 223 | · Speed & cost | 66 | — | — | $0.45 / $0.65 | 6K | 120.0 | Jul 2023 |
| 224 | OpenAI · Speed & cost | 66 | — | — | $0.5 / $1.5 | 16K | 66.0 | May 2023 |
| 225 | Google · Open-source | 65 | — | — | $0.04 / $0.08 | 131K | 1083.3 | Mar 2025 |
| 226 | · Open-source | 65 | — | — | $0.8 / $1.6 | 33K | 54.2 | Feb 2025 |
| 227 | · Speed & cost | 65 | — | — | $0.065 / $0.14 | 16K | 634.1 | Jan 2025 |
| 228 | Meta · Open-source | 65 | — | — | $0.02 / $0.05 | 16K | 1857.1 | Jul 2024 |
| 229 | Google · Open-source | 65 | — | — | $0.65 / $0.65 | 8K | 100.0 | Jul 2024 |
| 230 | Google · Open-source | 65 | — | — | $0.03 / $0.09 | 8K | 1083.3 | Jun 2024 |
| 231 | · Search + citations | 65 | — | — | $0.14 / $0.14 | 8K | 464.3 | May 2024 |
| 232 | Meta · Open-source | 65 | — | — | $0.03 / $0.04 | 8K | 1857.1 | Apr 2024 |
| 233 | Google · Speed & cost | 62 | — | — | $0.25 / $1.5 | 1M | 70.9 | Mar 2026 |
| 234 | · Speed & cost | 62 | — | — | $0.05 / $0.2 | 262K | 496.0 | Dec 2025 |
| 235 | · Speed & cost | 62 | — | — | $0.2 / $0.6 | 131K | 155.0 | Oct 2025 |
| 236 | · Speed & cost | 62 | — | — | $0.017 / $0.11 | 131K | 976.4 | Oct 2025 |
| 237 | · Speed & cost | 62 | — | — | $0.04 / $0.16 | 131K | 620.0 | Sep 2025 |
| 238 | Amazon · Speed & cost | 62 | — | — | $0.035 / $0.14 | 128K | 708.6 | Dec 2024 |
| 239 | · Speed & cost | 62 | — | — | $0.62 / $0.62 | 66K | 100.0 | Apr 2024 |
| 240 | · Hard reasoning | 58 | — | — | $0.22 / $0.85 | 262K | 108.4 | Apr 2026 |
| 241 | · Code generation | 58 | — | — | $0.3 / $1.2 | 256K | 77.3 | Mar 2026 |
| 242 | · Speed & cost | 58 | — | — | $0.1 / $0.1 | 16K | 580.0 | Mar 2026 |
| 243 | · Speed & cost | 58 | — | — | $0.3 / $1.2 | 205K | 77.3 | Mar 2026 |
| 244 | · Speed & cost | 58 | — | — | $0.1 / $0.5 | 262K | 193.3 | Mar 2026 |
| 245 | · Speed & cost | 58 | — | — | $0.25 / $2 | 262K | 51.6 | Mar 2026 |
| 246 | · Speed & cost | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Mar 2026 |
| 247 | · Speed & cost | 58 | — | — | $0.1 / $0.4 | 262K | 232.0 | Feb 2026 |
| 248 | · Speed & cost | 58 | — | — | $0.118 / $0.99 | 197K | 104.7 | Feb 2026 |
| 249 | · Speed & cost | 58 | — | — | $0.1 / $0.3 | 262K | 290.0 | Jan 2026 |
| 250 | · Speed & cost | 58 | — | — | $0.15 / $0.6 | 128K | 154.7 | Jan 2026 |
| 251 | · Speed & cost | 58 | — | — | $0.3 / $1.2 | 66K | 77.3 | Jan 2026 |
| 252 | · Speed & cost | 58 | — | — | $0.06 / $0.4 | 203K | 252.2 | Jan 2026 |
| 253 | · Open-source | 58 | — | — | $0.2 / $0.6 | 66K | 145.0 | Jan 2026 |
| 254 | · Speed & cost | 58 | — | — | $0.075 / $0.3 | 262K | 309.3 | Dec 2025 |
| 255 | · Speed & cost | 58 | — | — | $0.25 / $2 | 262K | 51.6 | Dec 2025 |
| 256 | · Speed & cost | 58 | — | — | $0.27 / $0.95 | 197K | 95.1 | Dec 2025 |
| 257 | · Speed & cost | 58 | — | — | $0.09 / $0.29 | 262K | 305.3 | Dec 2025 |
| 258 | · Open-source | 58 | — | — | $0.3 / $0.9 | 131K | 96.7 | Dec 2025 |
| 259 | · Speed & cost | 58 | — | — | $0.15 / $0.15 | 33K | 386.7 | Dec 2025 |
| 260 | Amazon · Speed & cost | 58 | — | — | $0.3 / $2.5 | 1M | 41.4 | Dec 2025 |
| 261 | Mistral AI · Speed & cost | 58 | — | — | $0.2 / $0.2 | 262K | 290.0 | Dec 2025 |
| 262 | Mistral AI · Speed & cost | 58 | — | — | $0.15 / $0.15 | 262K | 386.7 | Dec 2025 |
| 263 | Mistral AI · Speed & cost | 58 | — | — | $0.1 / $0.1 | 131K | 580.0 | Dec 2025 |
| 264 | · Speed & cost | 58 | — | — | $0.2 / $1.1 | 131K | 89.2 | Nov 2025 |
| 265 | · Hard reasoning | 58 | — | — | $0.15 / $0.5 | 66K | 178.5 | Nov 2025 |
| 266 | xAI · Speed & cost | 58 | — | — | $0.2 / $0.5 | 2M | 165.7 | Nov 2025 |
| 267 | OpenAI · Code generation | 58 | — | — | $0.25 / $2 | 400K | 51.6 | Nov 2025 |
| 268 | Mistral AI · Open-source | 58 | — | — | $0.1 / $0.3 | 32K | 290.0 | Oct 2025 |
| 269 | OpenAI · Speed & cost | 58 | — | — | $0.075 / $0.3 | 131K | 309.3 | Oct 2025 |
| 270 | · Speed & cost | 58 | — | — | $0.255 / $1 | 197K | 92.4 | Oct 2025 |
| 271 | · Hard reasoning | 58 | — | — | $0.07 / $0.28 | 131K | 331.4 | Oct 2025 |
| 272 | · Speed & cost | 58 | — | — | $0.3 / $0.5 | 131K | 145.0 | Sep 2025 |
| 273 | xAI · Speed & cost | 58 | — | — | $0.2 / $0.5 | 2M | 165.7 | Sep 2025 |
| 274 | Alibaba Cloud · Search + citations | 58 | — | — | $0.09 / $0.45 | 131K | 214.8 | Sep 2025 |
| 275 | · Speed & cost | 58 | — | — | $0.2 / $0.8 | 131K | 116.0 | Sep 2025 |
| 276 | Alibaba Cloud · Hard reasoning | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Sep 2025 |
| 277 | Alibaba Cloud · Open-source | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Sep 2025 |
| 278 | xAI · Speed & cost | 58 | — | — | $0.2 / $1.5 | 256K | 68.2 | Aug 2025 |
| 279 | · Search + citations | 58 | — | — | $0.13 / $0.4 | 131K | 218.9 | Aug 2025 |
| 280 | · Speed & cost | 58 | — | — | $0.07 / $0.28 | 120K | 331.4 | Aug 2025 |
| 281 | · Speed & cost | 58 | — | — | $0.14 / $0.56 | 30K | 165.7 | Aug 2025 |
| 282 | · Open-source | 58 | — | — | $0.13 / $0.85 | 131K | 118.4 | Jul 2025 |
| 283 | Z.ai: GLM 4 32B OSS · Open-source | 58 | — | — | $0.1 / $0.1 | 128K | 580.0 | Jul 2025 |
| 284 | · Speed & cost | 58 | — | — | $0.1 / $0.2 | 128K | 386.7 | Jul 2025 |
| 285 | Mistral AI · Open-source | 58 | — | — | $0.1 / $0.3 | 131K | 290.0 | Jul 2025 |
| 286 | · Speed & cost | 58 | — | — | $0.14 / $0.57 | 131K | 163.4 | Jul 2025 |
| 287 | · Speed & cost | 58 | — | — | $0.28 / $1.1 | 123K | 84.1 | Jun 2025 |
| 288 | · Speed & cost | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Jun 2025 |
| 289 | · Speed & cost | 58 | — | — | $0.18 / $0.18 | 131K | 322.2 | May 2025 |
| 290 | · Code generation | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Apr 2025 |
| 291 | Meta · Open-source | 58 | — | — | $0.18 / $0.18 | 164K | 322.2 | Apr 2025 |
| 292 | Alibaba Cloud · Open-source | 58 | — | — | $0.2 / $0.6 | 128K | 145.0 | Mar 2025 |
| 293 | · Speed & cost | 58 | — | — | $0.1 / $0.2 | 66K | 386.7 | Mar 2025 |
| 294 | Alibaba Cloud · Hard reasoning | 58 | — | — | $0.15 / $0.58 | 131K | 158.9 | Mar 2025 |
| 295 | Mistral AI · Open-source | 58 | — | — | $0.2 / $0.6 | 33K | 145.0 | Feb 2025 |
| 296 | Alibaba Cloud · Open-source | 58 | — | — | $0.1365 / $0.4095 | 131K | 212.5 | Feb 2025 |
| 297 | Alibaba Cloud · Open-source | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Feb 2025 |
| 298 | · Speed & cost | 58 | — | — | $0.2 / $1.1 | 1M | 89.2 | Jan 2025 |
| 299 | Amazon · Speed & cost | 58 | — | — | $0.06 / $0.24 | 300K | 386.7 | Dec 2024 |
| 300 | · Speed & cost | 58 | — | — | $0.17 / $0.43 | 33K | 193.3 | Sep 2024 |
| 301 | Meta · Open-source | 58 | — | — | $0.051 / $0.34 | 80K | 296.7 | Sep 2024 |
| 302 | Cohere · Open-source | 58 | — | — | $0.15 / $0.6 | 128K | 154.7 | Aug 2024 |
| 303 | Mistral AI · Open-source | 58 | — | — | $0.11 / $0.19 | 3K | 386.7 | Sep 2023 |
| 304 | · Speed & cost | 58 | — | — | $0.06 / $0.06 | 4K | 966.7 | Jul 2023 |
| 305 | · Speed & cost | 50 | — | — | $0.03 / $0.12 | 33K | 666.7 | Feb 2026 |
| 306 | · Speed & cost | 50 | — | — | $0.045 / $0.15 | 131K | 512.8 | Dec 2025 |
| 307 | OpenAI · Speed & cost | 50 | — | — | $0.039 / $0.19 | 131K | 436.7 | Aug 2025 |
| 308 | OpenAI · Speed & cost | 50 | — | — | $0.03 / $0.11 | 131K | 714.3 | Aug 2025 |
| 309 | Google · Open-source | 50 | — | — | $0.02 / $0.04 | 33K | 1666.7 | May 2025 |
| 310 | Alibaba Cloud · Code generation | 50 | — | — | $0.03 / $0.09 | 33K | 833.3 | Apr 2025 |
| 311 | · Open-source | 50 | — | — | $0.05 / $0.2 | 128K | 400.0 | Mar 2025 |
| 312 | Meta · Open-source | 50 | — | — | $0.02 / $0.06 | 131K | 1250.0 | Feb 2025 |
| 313 | Alibaba Cloud · Open-source | 50 | — | — | $0.0325 / $0.13 | 131K | 615.4 | Feb 2025 |
| 314 | Cohere · Open-source | 50 | — | — | $0.0375 / $0.15 | 128K | 533.3 | Dec 2024 |
| 315 | Alibaba Cloud · Open-source | 50 | — | — | $0.04 / $0.1 | 33K | 714.3 | Oct 2024 |
| 316 | Meta · Open-source | 50 | — | — | $0.027 / $0.2 | 60K | 440.5 | Sep 2024 |
| 317 | Meta · Open-source | 50 | — | — | $0.049 / $0.049 | 131K | 1020.4 | Sep 2024 |
| 318 | · Hard reasoning | 50 | — | — | $0.04 / $0.05 | 8K | 1111.1 | Aug 2024 |
What to do this quarter
- Update bookmarks and citations. Internal eval-spec docs and procurement RFPs that reference "lmsys.org" should be updated to lmarena.ai. The data continues at the new domain.
- Pull from the right board. Coding teams should cite the coding Arena Elo (Claude Opus 4.7 leads at 1567). Generic chat teams should cite the text leaderboard (Gemini 3.1 Pro Preview leads at ~1500).
- Build dual-vendor capability. The top four models are within 40 Elo of each other. Treat them as interchangeable on capability and optimise for switching cost.
- Pair Arena scores with workload-specific evals. Arena rewards short-conversation polish. Long-context, tool-use, and domain-specific tasks need their own measurement.
- Track the open-weight gap. DeepSeek V4 Pro under Apache 2.0 sits at 1462 Elo, within 38 points of the text leader. The gap is the smallest it has ever been.
- Watch GPT-5.5 Pro pricing. At $30/$180 per 1M tokens, paying for the top of LMArena.ai now costs 200x more per token than the cheapest tier. The cost curve is steepening.
- Re-baseline at every model launch. Tokenizer changes (Claude Opus 4.7 ships ~35% more tokens per input than 4.6) shift effective cost without shifting list price.
Related reading
- LMArena Explained — what LMArena is, how to read Arena Elo
- AI Model Leaderboard — full quality, speed, pricing comparison
- LLM Leaderboard
- LMSys Arena Leaderboard May 2026
- LMArena Elo Explained for Enterprise Buyers
Teams running side-by-side evals against multiple LMArena.ai leaders typically expose them through Swfte Connect as a single endpoint, then run their own internal Elo on production prompts. That is the only way to verify whether public Arena rank translates to your workload.