AI Model Leaderboard — May 2026
Every major AI model ranked by quality, speed, pricing, and value. Filter by category, sort by any metric, and find the right model for your use case. Live data refreshed monthly with LMSys Arena Elo, official provider pricing, and Artificial Analysis benchmarks.
Gold
OpenAI: GPT-5.5 Pro
99
Quality Index
Silver
OpenAI: GPT-5.5
98
Quality Index
Bronze
Anthropic: Claude Opus 4.7
97
Quality Index
Stop reading — start ranking
Three ways to put this leaderboard to work. Pick any one — they all start with a free Swfte account, no card required.
Run OpenAI: GPT-5.5 Pro free
The model topping this page is in your hands in 30 seconds. No card, no trial timer — sign in and prompt.
Start freeGet pinged on rank changes
Email the moment a model takes #1, drops below your price ceiling, or beats a benchmark you care about. One-click subscribe.
Set alertsThe Model-Hopper Challenge
Run the same prompt across 3+ models in the Swfte Playground. Spot something surprising — a sleeper win, a 10× price gap, a weird failure. Best entry each month: 50% off for 6 months.
Submit a findingOne winner picked monthly · discount applies to your first paid plan · see challenge rules
May 2026: Top Models, Best Value, Fastest Inference
The May 2026 ranking covers 326 models across LMSys Arena Elo, MMLU Pro, HumanEval, MATH, pricing, and inference speed. Top of the table: OpenAI: GPT-5.5 Pro at 99/100 quality. The full table below is sortable by any metric. Live data is refreshed hourly from official provider pricing pages and the public Arena.
Top 5 by Quality Index
- OpenAI: GPT-5.5 Pro — 99/100
- OpenAI: GPT-5.5 — 98/100
- Anthropic: Claude Opus 4.7 — 97/100
- OpenAI: GPT-5.4 Pro — 97/100
- OpenAI: GPT-5.2 Pro — 97/100
Best Price-to-Quality
- Mistral: Mistral Nemo — $0.04/1M out
- Mistral: Mistral Small 3 — $0.08/1M out
- Qwen: Qwen3 235B A22B Instruct 2507 — $0.1/1M out
- Mistral: Mistral Small 3.1 24B — $0.11/1M out
- Google: Gemma 3 12B — $0.13/1M out
See our LMSys Arena deep dive and the monthly release roundup.
| # | Model | Quality | Arena ELO | Speed | Price | Context | Value | Released |
|---|---|---|---|---|---|---|---|---|
| 1 | OpenAI · Reasoning at any cost | 99 | 1510 | 68 t/s | $30 / $180 | 1M | 0.9 | Apr 2026 |
| 2 | OpenAI: GPT-5.5 New OpenAI · Frontier general purpose | 98 | 1506 | 70 t/s | $5 / $30 | 1M | 5.6 | Apr 2026 |
| 3 | Anthropic · Coding & agentic workflows | 97 | 1505 | 68 t/s | $5 / $25 | 1M | 6.5 | Apr 2026 |
| 4 | OpenAI · Complex analysis | 97 | — | — | $30 / $180 | 1M | 0.9 | Mar 2026 |
| 5 | OpenAI · Complex analysis | 97 | — | — | $21 / $168 | 400K | 1.0 | Dec 2025 |
| 6 | OpenAI · Deep research | 96 | — | — | $10 / $40 | 200K | 3.8 | Oct 2025 |
| 7 | OpenAI · Deep research | 96 | — | — | $2 / $8 | 200K | 19.2 | Oct 2025 |
| 8 | OpenAI · Hard reasoning | 96 | — | — | $20 / $80 | 200K | 1.9 | Jun 2025 |
| 9 | Google · Speed & cost | 96 | 1505 | — | $2 / $12 | 1M | 13.7 | Feb 2026 |
| 10 | Google · Speed & cost | 96 | 1505 | — | $2 / $12 | 1M | 13.7 | Feb 2026 |
| 11 | Anthropic · General purpose | 95 | 1490 | — | $5 / $25 | 1M | 6.3 | Feb 2026 |
| 12 | Anthropic · General purpose | 95 | — | — | $5 / $25 | 200K | 6.3 | Nov 2025 |
| 13 | xAI: Grok 4.3 New xAI · Agentic tasks & real-time info | 94 | 1498 | 83 t/s | $1.25 / $2.5 | 1M | 50.1 | May 2026 |
| 14 | Google · Image generation | 94 | — | — | $2 / $12 | 66K | 13.4 | Nov 2025 |
| 15 | Anthropic · Multimodal | 94 | — | — | $15 / $75 | 200K | 2.1 | Aug 2025 |
| 16 | Anthropic · Multimodal | 94 | — | — | $15 / $75 | 200K | 2.1 | May 2025 |
| 17 | Moonshot AI · Frontier quality at low cost | 93 | 1466 | 48 t/s | $0.95 / $4 | 256K | 37.6 | Apr 2026 |
| 18 | OpenAI · General purpose | 93 | 1495 | — | $2.5 / $15 | 1M | 10.6 | Mar 2026 |
| 19 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Mar 2026 |
| 20 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Feb 2026 |
| 21 | OpenAI · Code generation | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Jan 2026 |
| 22 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 128K | 11.8 | Dec 2025 |
| 23 | OpenAI · General purpose | 93 | — | — | $1.75 / $14 | 400K | 11.8 | Dec 2025 |
| 24 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Dec 2025 |
| 25 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 26 | OpenAI · General purpose | 93 | — | — | $1.25 / $10 | 128K | 16.5 | Nov 2025 |
| 27 | OpenAI · Code generation | 93 | — | — | $1.25 / $10 | 400K | 16.5 | Nov 2025 |
| 28 | OpenAI · Hard reasoning | 93 | — | — | $150 / $600 | 200K | 0.2 | Mar 2025 |
| 29 | OpenAI · Complex analysis | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 30 | OpenAI · Multimodal | 93 | — | — | $30 / $60 | 8K | 2.1 | May 2023 |
| 31 | xAI · General purpose | 93 | 1496 | — | $2 / $6 | 2M | 23.3 | Mar 2026 |
| 32 | DeepSeek · Open-source value leader | 92 | 1467 | 33 t/s | $1.74 / $3.48 | 1M | 35.2 | Apr 2026 |
| 33 | OpenAI · Hard reasoning | 92 | — | — | $2 / $8 | 200K | 18.4 | Apr 2025 |
| 34 | · Hard reasoning | 91 | — | — | $0.3 / $1.1 | 164K | 130.0 | Jul 2025 |
| 35 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | Jun 2025 |
| 36 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | Jun 2025 |
| 37 | DeepSeek · Hard reasoning | 91 | — | — | $0.45 / $2.15 | 164K | 70.0 | May 2025 |
| 38 | Google · Speed & cost | 91 | — | — | $1.25 / $10 | 1M | 16.2 | May 2025 |
| 39 | DeepSeek · Hard reasoning | 91 | — | — | $0.29 / $0.29 | 33K | 313.8 | Jan 2025 |
| 40 | DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $0.8 | 131K | 121.3 | Jan 2025 |
| 41 | DeepSeek: R1OSS DeepSeek · Hard reasoning | 91 | — | — | $0.7 / $2.5 | 64K | 56.9 | Jan 2025 |
| 42 | Anthropic · General purpose | 91 | 1467 | — | $3 / $15 | 1M | 10.1 | Feb 2026 |
| 43 | · Open-weight agentic & tool use | 90 | 1467 | 48 t/s | $1.55 / $4.65 | 200K | 29.0 | Apr 2026 |
| 44 | OpenAI · General purpose | 90 | 1455 | — | $1.25 / $10 | 400K | 16.0 | Aug 2025 |
| 45 | xAI · General purpose | 90 | — | — | $3 / $15 | 131K | 10.0 | Jun 2025 |
| 46 | xAI · General purpose | 90 | — | — | $3 / $15 | 131K | 10.0 | Apr 2025 |
| 47 | OpenAI · General purpose | 89 | — | — | $2 / $8 | 1M | 17.8 | Apr 2025 |
| 48 | Moonshot AI · Speed & cost | 89 | 1452 | — | $0.3827 / $1.72 | 262K | 84.7 | Jan 2026 |
| 49 | OpenAI · Multimodal | 88 | — | — | $10 / $10 | 400K | 8.8 | Oct 2025 |
| 50 | OpenAI · Complex analysis | 88 | — | — | $15 / $120 | 400K | 1.3 | Oct 2025 |
| 51 | Anthropic · General purpose | 88 | — | — | $3 / $15 | 1M | 9.8 | Sep 2025 |
| 52 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Aug 2025 |
| 53 | OpenAI · Search + citations | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Mar 2025 |
| 54 | OpenAI · Hard reasoning | 88 | — | — | $15 / $60 | 200K | 2.3 | Dec 2024 |
| 55 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Nov 2024 |
| 56 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | Aug 2024 |
| 57 | OpenAI · General purpose | 88 | — | — | $2.5 / $10 | 128K | 14.1 | May 2024 |
| 58 | OpenAI · Multimodal | 88 | — | — | $6 / $18 | 128K | 7.3 | May 2024 |
| 59 | OpenAI · General purpose | 88 | — | — | $5 / $15 | 128K | 8.8 | May 2024 |
| 60 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Apr 2024 |
| 61 | OpenAI · Complex analysis | 88 | — | — | $10 / $30 | 128K | 4.4 | Jan 2024 |
| 62 | OpenAI · Multimodal | 88 | — | — | $10 / $30 | 128K | 4.4 | Nov 2023 |
| 63 | Z.ai: GLM 5OSS · Open-source | 88 | 1450 | — | $0.72 / $2.3 | 80K | 58.3 | Feb 2026 |
| 64 | DeepSeek · Open-source | 87 | 1455 | — | $0.26 / $0.38 | 164K | 271.9 | Dec 2025 |
| 65 | · Open-source | 86 | — | — | $0.135 / $0.5 | 131K | 270.9 | Dec 2025 |
| 66 | DeepSeek · Open-source | 86 | — | — | $0.4 / $1.2 | 164K | 107.5 | Dec 2025 |
| 67 | DeepSeek · Open-source | 86 | — | — | $0.27 / $0.41 | 164K | 252.9 | Sep 2025 |
| 68 | DeepSeek · Open-source | 86 | — | — | $0.21 / $0.79 | 164K | 172.0 | Sep 2025 |
| 69 | DeepSeek · Open-source | 86 | — | — | $0.15 / $0.75 | 33K | 191.1 | Aug 2025 |
| 70 | Anthropic · General purpose | 86 | — | — | $3 / $15 | 200K | 9.6 | May 2025 |
| 71 | DeepSeek · Open-source | 86 | — | — | $0.2 / $0.77 | 164K | 177.3 | Mar 2025 |
| 72 | Anthropic · General purpose | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 73 | Anthropic · Hard reasoning | 86 | — | — | $3 / $15 | 200K | 9.6 | Feb 2025 |
| 74 | DeepSeek · Open-source | 86 | — | — | $0.32 / $0.89 | 164K | 142.1 | Dec 2024 |
| 75 | DeepSeek · Cheap-and-fast cascade tier | 85 | 1410 | 105 t/s | $0.14 / $0.28 | 1M | 404.8 | Apr 2026 |
| 76 | Mistral AI · Open-source | 85 | — | — | $0.5 / $1.5 | 262K | 85.0 | Dec 2025 |
| 77 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 131K | 21.3 | Nov 2024 |
| 78 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 131K | 21.3 | Nov 2024 |
| 79 | Mistral AI · Open-source | 85 | — | — | $2 / $6 | 128K | 21.3 | Feb 2024 |
| 80 | Cohere · Open-source | 84 | — | — | $2.5 / $10 | 128K | 13.4 | Aug 2024 |
| 81 | OpenAI · Speed & cost | 83 | — | — | $0.75 / $4.5 | 400K | 31.6 | Mar 2026 |
| 82 | OpenAI · Speed & cost | 83 | — | — | $0.25 / $2 | 400K | 73.8 | Aug 2025 |
| 83 | Alibaba Cloud · Open-source | 82 | — | — | $0.05 / $0.15 | 256K | 820.0 | Mar 2026 |
| 84 | Alibaba Cloud · Open-source | 82 | — | — | $0.1625 / $1.3 | 262K | 112.1 | Feb 2026 |
| 85 | Alibaba Cloud · Open-source | 82 | — | — | $0.195 / $1.56 | 262K | 93.4 | Feb 2026 |
| 86 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $2.08 | 262K | 70.1 | Feb 2026 |
| 87 | Alibaba Cloud · Speed & cost | 82 | — | — | $0.065 / $0.26 | 1M | 504.6 | Feb 2026 |
| 88 | Alibaba Cloud · Open-source | 82 | — | — | $0.26 / $1.56 | 1M | 90.1 | Feb 2026 |
| 89 | Alibaba Cloud · Open-source | 82 | — | — | $0.39 / $2.34 | 262K | 60.1 | Feb 2026 |
| 90 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.78 / $3.9 | 262K | 35.0 | Feb 2026 |
| 91 | Alibaba Cloud · Code generation | 82 | — | — | $0.12 / $0.75 | 262K | 188.5 | Feb 2026 |
| 92 | Alibaba Cloud · Open-source | 82 | — | — | $0.104 / $0.416 | 131K | 315.4 | Oct 2025 |
| 93 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.117 / $1.365 | 131K | 110.7 | Oct 2025 |
| 94 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.5 | 131K | 282.8 | Oct 2025 |
| 95 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.13 / $1.56 | 131K | 97.0 | Oct 2025 |
| 96 | Alibaba Cloud · Open-source | 82 | — | — | $0.13 / $0.52 | 131K | 252.3 | Oct 2025 |
| 97 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.26 / $2.6 | 131K | 57.3 | Sep 2025 |
| 98 | Alibaba Cloud · Open-source | 82 | — | — | $0.2 / $0.88 | 262K | 151.9 | Sep 2025 |
| 99 | Alibaba Cloud · Open-source | 82 | — | — | $0.78 / $3.9 | 262K | 35.0 | Sep 2025 |
| 100 | Alibaba Cloud · Code generation | 82 | — | — | $0.65 / $3.25 | 1M | 42.1 | Sep 2025 |
| 101 | Alibaba Cloud · Code generation | 82 | — | — | $0.195 / $0.975 | 1M | 140.2 | Sep 2025 |
| 102 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.0975 / $0.78 | 131K | 186.9 | Sep 2025 |
| 103 | Alibaba Cloud · Open-source | 82 | — | — | $0.09 / $1.1 | 262K | 137.8 | Sep 2025 |
| 104 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.08 / $0.4 | 131K | 341.7 | Aug 2025 |
| 105 | Alibaba Cloud · Code generation | 82 | — | — | $0.07 / $0.27 | 160K | 482.4 | Jul 2025 |
| 106 | Alibaba Cloud · Open-source | 82 | — | — | $0.09 / $0.3 | 262K | 420.5 | Jul 2025 |
| 107 | Alibaba Cloud · Hard reasoning | 82 | — | — | $0.1495 / $1.495 | 131K | 99.7 | Jul 2025 |
| 108 | Alibaba Cloud · Code generation | 82 | — | — | $0.22 / $1 | 262K | 134.4 | Jul 2025 |
| 109 | Alibaba Cloud · Open-source | 82 | — | — | $0.071 / $0.1 | 262K | 959.1 | Jul 2025 |
| 110 | xAI · Speed & cost | 82 | — | — | $0.3 / $0.5 | 131K | 205.0 | Jun 2025 |
| 111 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.28 | 41K | 455.6 | Apr 2025 |
| 112 | Alibaba Cloud · Open-source | 82 | — | — | $0.05 / $0.4 | 41K | 364.4 | Apr 2025 |
| 113 | Alibaba Cloud · Open-source | 82 | — | — | $0.06 / $0.24 | 41K | 546.7 | Apr 2025 |
| 114 | Alibaba Cloud · Open-source | 82 | — | — | $0.08 / $0.24 | 41K | 512.5 | Apr 2025 |
| 115 | Alibaba Cloud · Open-source | 82 | — | — | $0.455 / $1.82 | 131K | 72.1 | Apr 2025 |
| 116 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Apr 2025 |
| 117 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Apr 2025 |
| 118 | xAI · Speed & cost | 82 | — | — | $0.3 / $0.5 | 131K | 205.0 | Apr 2025 |
| 119 | Meta · Open-source | 82 | — | — | $0.15 / $0.6 | 1M | 218.7 | Apr 2025 |
| 120 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Feb 2025 |
| 121 | · General purpose | 82 | — | — | $4 / $8 | 131K | 13.7 | Feb 2025 |
| 122 | OpenAI · Hard reasoning | 82 | — | — | $1.1 / $4.4 | 200K | 29.8 | Jan 2025 |
| 123 | Alibaba Cloud · Open-source | 82 | — | — | $0.12 / $0.39 | 33K | 321.6 | Sep 2024 |
| 124 | · General purpose | 82 | — | — | $3.75 / $7.5 | 6K | 14.6 | Nov 2023 |
| 125 | Google · Speed & cost | 80 | — | — | $0.5 / $3 | 1M | 45.7 | Dec 2025 |
| 126 | Google · Image generation | 80 | — | — | $0.3 / $2.5 | 33K | 57.1 | Oct 2025 |
| 127 | Google · Speed & cost | 80 | — | — | $0.1 / $0.4 | 1M | 320.0 | Sep 2025 |
| 128 | Google · Speed & cost | 80 | — | — | $0.1 / $0.4 | 1M | 320.0 | Jul 2025 |
| 129 | Google · Speed & cost | 80 | — | — | $0.3 / $2.5 | 1M | 57.1 | Jun 2025 |
| 130 | OpenAI · Speed & cost | 80 | — | — | $0.4 / $1.6 | 1M | 80.0 | Apr 2025 |
| 131 | OpenAI · Search + citations | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Mar 2025 |
| 132 | OpenAI · Speed & cost | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Jul 2024 |
| 133 | OpenAI · Speed & cost | 80 | — | — | $0.15 / $0.6 | 128K | 213.3 | Jul 2024 |
| 134 | Mistral AI · Code generation | 78 | — | — | $0.3 / $0.9 | 256K | 130.0 | Aug 2025 |
| 135 | Google · Open-source | 76 | — | — | $0.13 / $0.4 | 262K | 286.8 | Apr 2026 |
| 136 | Google · Open-source | 76 | — | — | $0.14 / $0.4 | 262K | 281.5 | Apr 2026 |
| 137 | Anthropic · Speed & cost | 76 | — | — | $1 / $5 | 200K | 25.3 | Oct 2025 |
| 138 | Anthropic · Speed & cost | 76 | — | — | $0.8 / $4 | 200K | 31.7 | Nov 2024 |
| 139 | · Open-source | 74 | — | — | $1.2 / $4 | 203K | 28.5 | Apr 2026 |
| 140 | xAI · General purpose | 74 | — | — | $2 / $6 | 2M | 18.5 | Mar 2026 |
| 141 | · Open-source | 74 | — | — | $1.2 / $4 | 203K | 28.5 | Mar 2026 |
| 142 | OpenAI · General purpose | 74 | — | — | $2.5 / $10 | 128K | 11.8 | Jan 2026 |
| 143 | · General purpose | 74 | — | — | $1.25 / $1.25 | 128K | 59.2 | Nov 2025 |
| 144 | Amazon · General purpose | 74 | — | — | $2.5 / $12.5 | 1M | 9.9 | Oct 2025 |
| 145 | Perplexity · Search + citations | 74 | — | — | $3 / $15 | 200K | 8.2 | Oct 2025 |
| 146 | OpenAI · Image generation | 74 | — | — | $2.5 / $2 | 400K | 32.9 | Oct 2025 |
| 147 | · Open-source | 74 | — | — | $0.1 / $0.4 | 131K | 296.0 | Oct 2025 |
| 148 | OpenAI · Code generation | 74 | — | — | $1.25 / $10 | 400K | 13.2 | Sep 2025 |
| 149 | · General purpose | 74 | — | — | $2 / $8 | 256K | 14.8 | Aug 2025 |
| 150 | OpenAI · General purpose | 74 | — | — | $1.25 / $10 | 128K | 13.2 | Aug 2025 |
| 151 | xAI · General purpose | 74 | — | — | $3 / $15 | 256K | 8.2 | Jul 2025 |
| 152 | Meta · Speed & cost | 74 | — | — | $0.08 / $0.3 | 328K | 389.5 | Apr 2025 |
| 153 | Google · Open-source | 74 | — | — | $0.04 / $0.13 | 131K | 870.6 | Mar 2025 |
| 154 | Cohere · General purpose | 74 | — | — | $2.5 / $10 | 256K | 11.8 | Mar 2025 |
| 155 | Google · Open-source | 74 | — | — | $0.08 / $0.16 | 131K | 616.7 | Mar 2025 |
| 156 | Perplexity · Search + citations | 74 | — | — | $2 / $8 | 128K | 14.8 | Mar 2025 |
| 157 | Perplexity · Search + citations | 74 | — | — | $3 / $15 | 200K | 8.2 | Mar 2025 |
| 158 | Perplexity · Deep research | 74 | — | — | $2 / $8 | 128K | 14.8 | Mar 2025 |
| 159 | Qwen: Qwen-Max OSS Alibaba Cloud · Open-source | 74 | — | — | $1.04 / $4.16 | 33K | 28.5 | Feb 2025 |
| 160 | · Hard reasoning | 74 | — | — | $3 / $3 | 16K | 24.7 | Jan 2025 |
| 161 | Meta · Open-source | 74 | — | — | $0.1 / $0.32 | 131K | 352.4 | Dec 2024 |
| 162 | Mistral AI · Open-source | 74 | — | — | $2 / $6 | 131K | 18.5 | Nov 2024 |
| 163 | · General purpose | 74 | — | — | $3 / $5 | 16K | 18.5 | Oct 2024 |
| 164 | · Open-source | 74 | — | — | $1.2 / $1.2 | 131K | 61.7 | Oct 2024 |
| 165 | · General purpose | 74 | — | — | $2.5 / $10 | 8K | 11.8 | Oct 2024 |
| 166 | · General purpose | 74 | — | — | $2.5 / $10 | 8K | 11.8 | Oct 2024 |
| 167 | · Search + citations | 74 | — | — | $0.3 / $0.3 | 131K | 246.7 | Aug 2024 |
| 168 | Meta · Open-source | 74 | — | — | $0.4 / $0.4 | 131K | 185.0 | Jul 2024 |
| 169 | · Hard reasoning | 74 | — | — | $1.48 / $1.48 | 8K | 50.0 | Jun 2024 |
| 170 | OpenAI · General purpose | 74 | — | — | $1.5 / $2 | 4K | 42.3 | Sep 2023 |
| 171 | OpenAI · General purpose | 74 | — | — | $3 / $4 | 16K | 21.1 | Aug 2023 |
| 172 | Google · Speed & cost | 73 | — | — | $0.075 / $0.3 | 1M | 389.3 | Feb 2025 |
| 173 | Google · Speed & cost | 73 | — | — | $0.1 / $0.4 | 1M | 292.0 | Feb 2025 |
| 174 | OpenAI · Speed & cost | 72 | — | — | $0.2 / $1.25 | 400K | 99.3 | Mar 2026 |
| 175 | Mistral AI · Open-source | 72 | — | — | $0.15 / $0.6 | 262K | 192.0 | Mar 2026 |
| 176 | Mistral AI · Open-source | 72 | — | — | $0.1 / $0.3 | 33K | 360.0 | Dec 2025 |
| 177 | OpenAI · Speed & cost | 72 | — | — | $0.05 / $0.4 | 400K | 320.0 | Aug 2025 |
| 178 | Mistral AI · Open-source | 72 | — | — | $0.075 / $0.2 | 128K | 523.6 | Jun 2025 |
| 179 | OpenAI · Speed & cost | 72 | — | — | $0.1 / $0.4 | 1M | 288.0 | Apr 2025 |
| 180 | Mistral AI · Open-source | 72 | — | — | $0.03 / $0.11 | 131K | 1028.6 | Mar 2025 |
| 181 | Mistral AI · Open-source | 72 | — | — | $0.05 / $0.08 | 33K | 1107.7 | Jan 2025 |
| 182 | Mistral AI · Open-source | 72 | — | — | $0.02 / $0.04 | 131K | 2400.0 | Jul 2024 |
| 183 | Mistral AI · Open-source | 72 | — | — | $2 / $6 | 66K | 18.0 | Apr 2024 |
| 184 | Anthropic · Speed & cost | 72 | — | — | $0.25 / $1.25 | 200K | 96.0 | Mar 2024 |
| 185 | Mistral AI · Open-source | 72 | — | — | $0.54 / $0.54 | 33K | 133.3 | Dec 2023 |
| 186 | · Speed & cost | 66 | — | — | $0.4 / $2 | 262K | 55.0 | Mar 2026 |
| 187 | · Speed & cost | 66 | — | — | $1 / $3 | 1M | 33.0 | Mar 2026 |
| 188 | Google · Image generation | 66 | — | — | $0.5 / $3 | 66K | 37.7 | Feb 2026 |
| 189 | · Speed & cost | 66 | — | — | $0.8 / $1.6 | 131K | 55.0 | Feb 2026 |
| 190 | · Speed & cost | 66 | — | — | $0.6 / $6 | 1M | 20.0 | Jan 2026 |
| 191 | OpenAI · Speed & cost | 66 | — | — | $0.6 / $2.4 | 128K | 44.0 | Jan 2026 |
| 192 | · Open-source | 66 | — | — | $0.39 / $1.75 | 203K | 61.7 | Dec 2025 |
| 193 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 262K | 55.0 | Dec 2025 |
| 194 | · Search + citations | 66 | — | — | $1 / $3 | 256K | 33.0 | Dec 2025 |
| 195 | Moonshot AI · Hard reasoning | 66 | — | — | $0.47 / $2 | 131K | 53.4 | Nov 2025 |
| 196 | · Open-source | 66 | — | — | $0.39 / $1.9 | 205K | 57.6 | Sep 2025 |
| 197 | · Speed & cost | 66 | — | — | $0.85 / $1.25 | 256K | 62.9 | Sep 2025 |
| 198 | Moonshot AI · Speed & cost | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Sep 2025 |
| 199 | · Search + citations | 66 | — | — | $1 / $3 | 131K | 33.0 | Aug 2025 |
| 200 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Aug 2025 |
| 201 | · Open-source | 66 | — | — | $0.6 / $1.8 | 66K | 55.0 | Aug 2025 |
| 202 | · Open-source | 66 | — | — | $0.6 / $2.2 | 131K | 47.1 | Jul 2025 |
| 203 | · Speed & cost | 66 | — | — | $0.85 / $3.4 | 131K | 31.1 | Jul 2025 |
| 204 | Moonshot AI · Speed & cost | 66 | — | — | $0.57 / $2.3 | 131K | 46.0 | Jul 2025 |
| 205 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | Jul 2025 |
| 206 | · Speed & cost | 66 | — | — | $0.9 / $1.9 | 262K | 47.1 | Jul 2025 |
| 207 | · Speed & cost | 66 | — | — | $0.8 / $1.2 | 82K | 66.0 | Jul 2025 |
| 208 | · Speed & cost | 66 | — | — | $0.42 / $1.25 | 123K | 79.0 | Jun 2025 |
| 209 | · Speed & cost | 66 | — | — | $0.4 / $2.2 | 1M | 50.8 | Jun 2025 |
| 210 | Mistral AI · Open-source | 66 | — | — | $0.4 / $2 | 131K | 55.0 | May 2025 |
| 211 | · Speed & cost | 66 | — | — | $0.9 / $3.3 | 131K | 31.4 | May 2025 |
| 212 | · Speed & cost | 66 | — | — | $0.75 / $1.2 | 131K | 67.7 | May 2025 |
| 213 | · Code generation | 66 | — | — | $0.5 / $0.8 | 33K | 101.5 | May 2025 |
| 214 | · Speed & cost | 66 | — | — | $0.8 / $1.2 | 4K | 66.0 | Apr 2025 |
| 215 | · Open-source | 66 | — | — | $0.8 / $1.2 | 4K | 66.0 | Apr 2025 |
| 216 | · Open-source | 66 | — | — | $0.6 / $1.8 | 131K | 55.0 | Apr 2025 |
| 217 | · Speed & cost | 66 | — | — | $0.55 / $0.8 | 33K | 97.8 | Mar 2025 |
| 218 | · Speed & cost | 66 | — | — | $0.7 / $1.4 | 131K | 62.9 | Feb 2025 |
| 219 | Alibaba Cloud · Open-source | 66 | — | — | $0.52 / $2.08 | 131K | 50.8 | Feb 2025 |
| 220 | Alibaba Cloud · Open-source | 66 | — | — | $0.8 / $0.8 | 33K | 82.5 | Feb 2025 |
| 221 | Perplexity · Search + citations | 66 | — | — | $1 / $1 | 127K | 66.0 | Jan 2025 |
| 222 | · Hard reasoning | 66 | — | — | $0.65 / $0.75 | 131K | 94.3 | Dec 2024 |
| 223 | Amazon · Speed & cost | 66 | — | — | $0.8 / $3.2 | 300K | 33.0 | Dec 2024 |
| 224 | Alibaba Cloud · Code generation | 66 | — | — | $0.66 / $1 | 33K | 79.5 | Nov 2024 |
| 225 | · Speed & cost | 66 | — | — | $0.4 / $0.4 | 33K | 165.0 | Nov 2024 |
| 226 | · Hard reasoning | 66 | — | — | $0.85 / $0.85 | 131K | 77.6 | Aug 2024 |
| 227 | · Search + citations | 66 | — | — | $1 / $1 | 131K | 66.0 | Aug 2024 |
| 228 | Meta · Open-source | 66 | — | — | $0.51 / $0.74 | 8K | 105.6 | Apr 2024 |
| 229 | OpenAI · Speed & cost | 66 | — | — | $1 / $2 | 4K | 44.0 | Jan 2024 |
| 230 | · Speed & cost | 66 | — | — | $0.75 / $1 | 8K | 75.4 | Aug 2023 |
| 231 | · Speed & cost | 66 | — | — | $0.45 / $0.65 | 6K | 120.0 | Jul 2023 |
| 232 | OpenAI · Speed & cost | 66 | — | — | $0.5 / $1.5 | 16K | 66.0 | May 2023 |
| 233 | Google · Open-source | 65 | — | — | $0.04 / $0.08 | 131K | 1083.3 | Mar 2025 |
| 234 | · Open-source | 65 | — | — | $0.8 / $1.6 | 33K | 54.2 | Feb 2025 |
| 235 | · Speed & cost | 65 | — | — | $0.065 / $0.14 | 16K | 634.1 | Jan 2025 |
| 236 | Meta · Open-source | 65 | — | — | $0.02 / $0.05 | 16K | 1857.1 | Jul 2024 |
| 237 | Google · Open-source | 65 | — | — | $0.65 / $0.65 | 8K | 100.0 | Jul 2024 |
| 238 | Google · Open-source | 65 | — | — | $0.03 / $0.09 | 8K | 1083.3 | Jun 2024 |
| 239 | · Search + citations | 65 | — | — | $0.14 / $0.14 | 8K | 464.3 | May 2024 |
| 240 | Meta · Open-source | 65 | — | — | $0.03 / $0.04 | 8K | 1857.1 | Apr 2024 |
| 241 | Google · Speed & cost | 62 | — | — | $0.25 / $1.5 | 1M | 70.9 | Mar 2026 |
| 242 | · Speed & cost | 62 | — | — | $0.05 / $0.2 | 262K | 496.0 | Dec 2025 |
| 243 | · Speed & cost | 62 | — | — | $0.2 / $0.6 | 131K | 155.0 | Oct 2025 |
| 244 | · Speed & cost | 62 | — | — | $0.017 / $0.11 | 131K | 976.4 | Oct 2025 |
| 245 | · Speed & cost | 62 | — | — | $0.04 / $0.16 | 131K | 620.0 | Sep 2025 |
| 246 | Amazon · Speed & cost | 62 | — | — | $0.035 / $0.14 | 128K | 708.6 | Dec 2024 |
| 247 | · Speed & cost | 62 | — | — | $0.62 / $0.62 | 66K | 100.0 | Apr 2024 |
| 248 | · Hard reasoning | 58 | — | — | $0.22 / $0.85 | 262K | 108.4 | Apr 2026 |
| 249 | · Code generation | 58 | — | — | $0.3 / $1.2 | 256K | 77.3 | Mar 2026 |
| 250 | · Speed & cost | 58 | — | — | $0.1 / $0.1 | 16K | 580.0 | Mar 2026 |
| 251 | · Speed & cost | 58 | — | — | $0.3 / $1.2 | 205K | 77.3 | Mar 2026 |
| 252 | · Speed & cost | 58 | — | — | $0.1 / $0.5 | 262K | 193.3 | Mar 2026 |
| 253 | · Speed & cost | 58 | — | — | $0.25 / $2 | 262K | 51.6 | Mar 2026 |
| 254 | · Speed & cost | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Mar 2026 |
| 255 | · Speed & cost | 58 | — | — | $0.1 / $0.4 | 262K | 232.0 | Feb 2026 |
| 256 | · Speed & cost | 58 | — | — | $0.118 / $0.99 | 197K | 104.7 | Feb 2026 |
| 257 | · Speed & cost | 58 | — | — | $0.1 / $0.3 | 262K | 290.0 | Jan 2026 |
| 258 | · Speed & cost | 58 | — | — | $0.15 / $0.6 | 128K | 154.7 | Jan 2026 |
| 259 | · Speed & cost | 58 | — | — | $0.3 / $1.2 | 66K | 77.3 | Jan 2026 |
| 260 | · Speed & cost | 58 | — | — | $0.06 / $0.4 | 203K | 252.2 | Jan 2026 |
| 261 | · Open-source | 58 | — | — | $0.2 / $0.6 | 66K | 145.0 | Jan 2026 |
| 262 | · Speed & cost | 58 | — | — | $0.075 / $0.3 | 262K | 309.3 | Dec 2025 |
| 263 | · Speed & cost | 58 | — | — | $0.25 / $2 | 262K | 51.6 | Dec 2025 |
| 264 | · Speed & cost | 58 | — | — | $0.27 / $0.95 | 197K | 95.1 | Dec 2025 |
| 265 | · Speed & cost | 58 | — | — | $0.09 / $0.29 | 262K | 305.3 | Dec 2025 |
| 266 | · Open-source | 58 | — | — | $0.3 / $0.9 | 131K | 96.7 | Dec 2025 |
| 267 | · Speed & cost | 58 | — | — | $0.15 / $0.15 | 33K | 386.7 | Dec 2025 |
| 268 | Amazon · Speed & cost | 58 | — | — | $0.3 / $2.5 | 1M | 41.4 | Dec 2025 |
| 269 | Mistral AI · Speed & cost | 58 | — | — | $0.2 / $0.2 | 262K | 290.0 | Dec 2025 |
| 270 | Mistral AI · Speed & cost | 58 | — | — | $0.15 / $0.15 | 262K | 386.7 | Dec 2025 |
| 271 | Mistral AI · Speed & cost | 58 | — | — | $0.1 / $0.1 | 131K | 580.0 | Dec 2025 |
| 272 | · Speed & cost | 58 | — | — | $0.2 / $1.1 | 131K | 89.2 | Nov 2025 |
| 273 | · Hard reasoning | 58 | — | — | $0.15 / $0.5 | 66K | 178.5 | Nov 2025 |
| 274 | xAI · Speed & cost | 58 | — | — | $0.2 / $0.5 | 2M | 165.7 | Nov 2025 |
| 275 | OpenAI · Code generation | 58 | — | — | $0.25 / $2 | 400K | 51.6 | Nov 2025 |
| 276 | Mistral AI · Open-source | 58 | — | — | $0.1 / $0.3 | 32K | 290.0 | Oct 2025 |
| 277 | OpenAI · Speed & cost | 58 | — | — | $0.075 / $0.3 | 131K | 309.3 | Oct 2025 |
| 278 | · Speed & cost | 58 | — | — | $0.255 / $1 | 197K | 92.4 | Oct 2025 |
| 279 | · Hard reasoning | 58 | — | — | $0.07 / $0.28 | 131K | 331.4 | Oct 2025 |
| 280 | · Speed & cost | 58 | — | — | $0.3 / $0.5 | 131K | 145.0 | Sep 2025 |
| 281 | xAI · Speed & cost | 58 | — | — | $0.2 / $0.5 | 2M | 165.7 | Sep 2025 |
| 282 | Alibaba Cloud · Search + citations | 58 | — | — | $0.09 / $0.45 | 131K | 214.8 | Sep 2025 |
| 283 | · Speed & cost | 58 | — | — | $0.2 / $0.8 | 131K | 116.0 | Sep 2025 |
| 284 | Alibaba Cloud · Hard reasoning | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Sep 2025 |
| 285 | Alibaba Cloud · Open-source | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Sep 2025 |
| 286 | xAI · Speed & cost | 58 | — | — | $0.2 / $1.5 | 256K | 68.2 | Aug 2025 |
| 287 | · Search + citations | 58 | — | — | $0.13 / $0.4 | 131K | 218.9 | Aug 2025 |
| 288 | · Speed & cost | 58 | — | — | $0.07 / $0.28 | 120K | 331.4 | Aug 2025 |
| 289 | · Speed & cost | 58 | — | — | $0.14 / $0.56 | 30K | 165.7 | Aug 2025 |
| 290 | · Open-source | 58 | — | — | $0.13 / $0.85 | 131K | 118.4 | Jul 2025 |
| 291 | Z.ai: GLM 4 32B OSS · Open-source | 58 | — | — | $0.1 / $0.1 | 128K | 580.0 | Jul 2025 |
| 292 | · Speed & cost | 58 | — | — | $0.1 / $0.2 | 128K | 386.7 | Jul 2025 |
| 293 | Mistral AI · Open-source | 58 | — | — | $0.1 / $0.3 | 131K | 290.0 | Jul 2025 |
| 294 | · Speed & cost | 58 | — | — | $0.14 / $0.57 | 131K | 163.4 | Jul 2025 |
| 295 | · Speed & cost | 58 | — | — | $0.28 / $1.1 | 123K | 84.1 | Jun 2025 |
| 296 | · Speed & cost | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Jun 2025 |
| 297 | · Speed & cost | 58 | — | — | $0.18 / $0.18 | 131K | 322.2 | May 2025 |
| 298 | · Code generation | 58 | — | — | $0.25 / $0.75 | 128K | 116.0 | Apr 2025 |
| 299 | Meta · Open-source | 58 | — | — | $0.18 / $0.18 | 164K | 322.2 | Apr 2025 |
| 300 | Alibaba Cloud · Open-source | 58 | — | — | $0.2 / $0.6 | 128K | 145.0 | Mar 2025 |
| 301 | · Speed & cost | 58 | — | — | $0.1 / $0.2 | 66K | 386.7 | Mar 2025 |
| 302 | Alibaba Cloud · Hard reasoning | 58 | — | — | $0.15 / $0.58 | 131K | 158.9 | Mar 2025 |
| 303 | Mistral AI · Open-source | 58 | — | — | $0.2 / $0.6 | 33K | 145.0 | Feb 2025 |
| 304 | Alibaba Cloud · Open-source | 58 | — | — | $0.1365 / $0.4095 | 131K | 212.5 | Feb 2025 |
| 305 | Alibaba Cloud · Open-source | 58 | — | — | $0.26 / $0.78 | 1M | 111.5 | Feb 2025 |
| 306 | · Speed & cost | 58 | — | — | $0.2 / $1.1 | 1M | 89.2 | Jan 2025 |
| 307 | Amazon · Speed & cost | 58 | — | — | $0.06 / $0.24 | 300K | 386.7 | Dec 2024 |
| 308 | · Speed & cost | 58 | — | — | $0.17 / $0.43 | 33K | 193.3 | Sep 2024 |
| 309 | Meta · Open-source | 58 | — | — | $0.051 / $0.34 | 80K | 296.7 | Sep 2024 |
| 310 | Cohere · Open-source | 58 | — | — | $0.15 / $0.6 | 128K | 154.7 | Aug 2024 |
| 311 | Mistral AI · Open-source | 58 | — | — | $0.11 / $0.19 | 3K | 386.7 | Sep 2023 |
| 312 | · Speed & cost | 58 | — | — | $0.06 / $0.06 | 4K | 966.7 | Jul 2023 |
| 313 | · Speed & cost | 50 | — | — | $0.03 / $0.12 | 33K | 666.7 | Feb 2026 |
| 314 | · Speed & cost | 50 | — | — | $0.045 / $0.15 | 131K | 512.8 | Dec 2025 |
| 315 | OpenAI · Speed & cost | 50 | — | — | $0.039 / $0.19 | 131K | 436.7 | Aug 2025 |
| 316 | OpenAI · Speed & cost | 50 | — | — | $0.03 / $0.11 | 131K | 714.3 | Aug 2025 |
| 317 | Google · Open-source | 50 | — | — | $0.02 / $0.04 | 33K | 1666.7 | May 2025 |
| 318 | Alibaba Cloud · Code generation | 50 | — | — | $0.03 / $0.09 | 33K | 833.3 | Apr 2025 |
| 319 | · Open-source | 50 | — | — | $0.05 / $0.2 | 128K | 400.0 | Mar 2025 |
| 320 | Meta · Open-source | 50 | — | — | $0.02 / $0.06 | 131K | 1250.0 | Feb 2025 |
| 321 | Alibaba Cloud · Open-source | 50 | — | — | $0.0325 / $0.13 | 131K | 615.4 | Feb 2025 |
| 322 | Cohere · Open-source | 50 | — | — | $0.0375 / $0.15 | 128K | 533.3 | Dec 2024 |
| 323 | Alibaba Cloud · Open-source | 50 | — | — | $0.04 / $0.1 | 33K | 714.3 | Oct 2024 |
| 324 | Meta · Open-source | 50 | — | — | $0.027 / $0.2 | 60K | 440.5 | Sep 2024 |
| 325 | Meta · Open-source | 50 | — | — | $0.049 / $0.049 | 131K | 1020.4 | Sep 2024 |
| 326 | · Hard reasoning | 50 | — | — | $0.04 / $0.05 | 8K | 1111.1 | Aug 2024 |
LLM Leaderboard May 2026
Large language models ranked by LMSys Arena Elo, MMLU, HumanEval, MATH, pricing, and tokens-per-second. Text-only view.
LM Leaderboard May 2026
Language model rankings: LMArena Elo, price-to-Elo ratio, and open-weight vs closed-source comparison.
LMSys Arena Leaderboard May 2026
LMArena (formerly LMSys Chatbot Arena) tracker — pairwise human preference Elo scores, refreshed as the public arena publishes.
Image Model Leaderboard 2026
Generative AI image and video models — Imagen 4, Flux 2, DALL-E 4, Stable Diffusion 4 Ultra, Sora 2 ranked by quality and cost.
Coding Model Leaderboard 2026
AI coding assistants ranked: Claude Opus, GPT-5.5, Gemini 3.1 Pro, DeepSeek V4, plus HumanEval and SWE-Bench scores.
Vendor Lock-in Leaderboard 2026
AI vendors ranked by portability — license, weight availability, fine-tuning openness, and exit cost score.
How We Rank AI Models
Our leaderboard uses a composite quality index that combines three key benchmarks: MMLU Pro (measuring knowledge and reasoning across 57 subjects), HumanEval (measuring code generation ability), and MATH (measuring mathematical problem-solving). Scores are normalized to a 0-100 scale and cross-referenced against LMSYS Chatbot Arena ELO ratings for real-world validation.
We track speed (tokens per second), time-to-first-token (TTFT), pricing, and context window size to give you a complete picture. The Value Score divides quality by cost, showing you which models deliver the most capability per dollar.
Key Trends in AI Model Performance
- Open-source catching up: DeepSeek R1 and V3 now compete with top closed-source models on reasoning and coding benchmarks
- Reasoning specialization: Models like o3 and R1 trade speed for dramatically better performance on complex tasks
- Context windows expanding: 1M+ tokens is now standard for flagship models, with Llama 4 Scout supporting 10M
- Speed improving: Flash-tier models now exceed 200 tokens/second while maintaining strong quality
Choosing the Right Model
There is no single "best" model — it depends on your use case. For most applications, a model routing approach works best: route simple queries to fast, cheap models and complex queries to frontier models. This gives you the best of both worlds — low cost and high quality.