What is the best AI model in May 2026?

By quality index, OpenAI: GPT-5.5 Pro (99/100) currently leads our May 2026 leaderboard. The "best" model depends on the workload — text chat, code, reasoning, image generation, speed, or cost. See the rankings below for the top model in each category.

What is the LLM leaderboard for May 2026?

Our LLM leaderboard ranks 326 large language models by composite quality (MMLU Pro, HumanEval, MATH), LMSys Arena Elo, pricing per million tokens, and inference speed. See /ai/llm/leaderboard for the LLM-only view.

What is the LMSys Chatbot Arena leaderboard May 2026?

LMArena (formerly LMSys Chatbot Arena) ranks models by pairwise human preference votes. Our May 2026 snapshot mirrors the latest published Elo with provider pricing and benchmark cross-validation. See /lmarena for the full breakdown.

Which AI model has the best price-to-quality in 2026?

DeepSeek V4 Pro (Apache 2.0, $1.74 input / $3.48 output per 1M tokens) currently leads price-per-quality — roughly 3x cheaper than Gemini 3.1 Pro and 6x cheaper than GPT-5.5 for similar Arena Elo bands, with a 75%-off launch promo running through May 31. Grok 4.3 ($1.25 / $2.50) and Kimi K2.6 ($0.95 / $4.00) are close behind.

Which AI image model is best in 2026?

For 2026 image generation, the leaderboard tracks Imagen 4, Flux 2, DALL-E 4, and Stable Diffusion 4 Ultra. See our Image Model Leaderboard section below for ranked quality scores.

Which AI coding model is best in May 2026?

Claude Opus 4.7 leads on coding benchmarks — SWE-bench Verified 87.6%, SWE-bench Pro 64.3% — and sits in the LMArena top tier at ~1505 Elo. GPT-5.5 and Gemini 3.1 Pro follow closely. For open weights, DeepSeek V4 Pro is the leader. See /ai/llm/leaderboard for the coding-specific ranking.

How are AI models ranked on this leaderboard?

The leaderboard uses a composite quality index (0-100) drawn from MMLU Pro (knowledge), HumanEval (coding), and MATH (reasoning), validated against LMSys Chatbot Arena Elo. Pricing comes from official provider pages and OpenRouter; speed (tok/s, TTFT) from Artificial Analysis.

What is the best open-source AI model in May 2026?

DeepSeek V4 Pro (Apache 2.0, 1.6T MoE / 49B active, 1M context) is the May 2026 leader for open-weights. Gemma 4 (Google) and NVIDIA Nemotron 3 Nano Omni are strong alternatives.

Updated May 14, 2026

AI Model Leaderboard — May 2026

Every major AI model ranked by quality, speed, pricing, and value. Filter by category, sort by any metric, and find the right model for your use case. Live data refreshed monthly with LMSys Arena Elo, official provider pricing, and Artificial Analysis benchmarks.

Gold

OpenAI: GPT-5.5 Pro

Quality Index

Silver

OpenAI: GPT-5.5

Quality Index

Bronze

Anthropic: Claude Opus 4.7

Quality Index

Climb the Leaderboard

Stop reading — start ranking

Three ways to put this leaderboard to work. Pick any one — they all start with a free Swfte account, no card required.

Run OpenAI: GPT-5.5 Pro free

The model topping this page is in your hands in 30 seconds. No card, no trial timer — sign in and prompt.

Start free

Get pinged on rank changes

Email the moment a model takes #1, drops below your price ceiling, or beats a benchmark you care about. One-click subscribe.

Set alerts

50% OFF · 6 MO

The Model-Hopper Challenge

Run the same prompt across 3+ models in the Swfte Playground. Spot something surprising — a sleeper win, a 10× price gap, a weird failure. Best entry each month: 50% off for 6 months.

Submit a finding

One winner picked monthly · discount applies to your first paid plan · see challenge rules

Monthly Snapshot

May 2026: Top Models, Best Value, Fastest Inference

The May 2026 ranking covers 326 models across LMSys Arena Elo, MMLU Pro, HumanEval, MATH, pricing, and inference speed. Top of the table: OpenAI: GPT-5.5 Pro at 99/100 quality. The full table below is sortable by any metric. Live data is refreshed hourly from official provider pricing pages and the public Arena.

Top 5 by Quality Index

OpenAI: GPT-5.5 Pro — 99/100
OpenAI: GPT-5.5 — 98/100
Anthropic: Claude Opus 4.7 — 97/100
OpenAI: GPT-5.4 Pro — 97/100
OpenAI: GPT-5.2 Pro — 97/100

Best Price-to-Quality

Mistral: Mistral Nemo — $0.04/1M out
Mistral: Mistral Small 3 — $0.08/1M out
Qwen: Qwen3 235B A22B Instruct 2507 — $0.1/1M out
Mistral: Mistral Small 3.1 24B — $0.11/1M out
Google: Gemma 3 12B — $0.13/1M out

See our LMSys Arena deep dive and the monthly release roundup.

326 models

#	Model	Quality	Arena ELO	Speed	Price	Context	Value	Released
1	OpenAI: GPT-5.5 Pro New OpenAI · Reasoning at any cost	99	1510	68 t/s	$30 / $180	1M	0.9	Apr 2026
2	OpenAI: GPT-5.5 New OpenAI · Frontier general purpose	98	1506	70 t/s	$5 / $30	1M	5.6	Apr 2026
3	Anthropic: Claude Opus 4.7 Anthropic · Coding & agentic workflows	97	1505	68 t/s	$5 / $25	1M	6.5	Apr 2026
4	OpenAI: GPT-5.4 Pro OpenAI · Complex analysis	97	—	—	$30 / $180	1M	0.9	Mar 2026
5	OpenAI: GPT-5.2 Pro OpenAI · Complex analysis	97	—	—	$21 / $168	400K	1.0	Dec 2025
6	OpenAI: o3 Deep Research OpenAI · Deep research	96	—	—	$10 / $40	200K	3.8	Oct 2025
7	OpenAI: o4 Mini Deep Research OpenAI · Deep research	96	—	—	$2 / $8	200K	19.2	Oct 2025
8	OpenAI: o3 Pro OpenAI · Hard reasoning	96	—	—	$20 / $80	200K	1.9	Jun 2025
9	Google: Gemini 3.1 Pro Preview Custom Tools Google · Speed & cost	96	1505	—	$2 / $12	1M	13.7	Feb 2026
10	Google: Gemini 3.1 Pro Preview Google · Speed & cost	96	1505	—	$2 / $12	1M	13.7	Feb 2026
11	Anthropic: Claude Opus 4.6 Anthropic · General purpose	95	1490	—	$5 / $25	1M	6.3	Feb 2026
12	Anthropic: Claude Opus 4.5 Anthropic · General purpose	95	—	—	$5 / $25	200K	6.3	Nov 2025
13	xAI: Grok 4.3 New xAI · Agentic tasks & real-time info	94	1498	83 t/s	$1.25 / $2.5	1M	50.1	May 2026
14	Google: Nano Banana Pro (Gemini 3 Pro Image Preview) Google · Image generation	94	—	—	$2 / $12	66K	13.4	Nov 2025
15	Anthropic: Claude Opus 4.1 Anthropic · Multimodal	94	—	—	$15 / $75	200K	2.1	Aug 2025
16	Anthropic: Claude Opus 4 Anthropic · Multimodal	94	—	—	$15 / $75	200K	2.1	May 2025
17	MoonshotAI: Kimi K2.6 Moonshot AI · Frontier quality at low cost	93	1466	48 t/s	$0.95 / $4	256K	37.6	Apr 2026
18	OpenAI: GPT-5.4 OpenAI · General purpose	93	1495	—	$2.5 / $15	1M	10.6	Mar 2026
19	OpenAI: GPT-5.3 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Mar 2026
20	OpenAI: GPT-5.3-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Feb 2026
21	OpenAI: GPT-5.2-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Jan 2026
22	OpenAI: GPT-5.2 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Dec 2025
23	OpenAI: GPT-5.2 OpenAI · General purpose	93	—	—	$1.75 / $14	400K	11.8	Dec 2025
24	OpenAI: GPT-5.1-Codex-Max OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Dec 2025
25	OpenAI: GPT-5.1 OpenAI · General purpose	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
26	OpenAI: GPT-5.1 Chat OpenAI · General purpose	93	—	—	$1.25 / $10	128K	16.5	Nov 2025
27	OpenAI: GPT-5.1-Codex OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
28	OpenAI: o1-pro OpenAI · Hard reasoning	93	—	—	$150 / $600	200K	0.2	Mar 2025
29	OpenAI: GPT-4 (older v0314) OpenAI · Complex analysis	93	—	—	$30 / $60	8K	2.1	May 2023
30	OpenAI: GPT-4 OpenAI · Multimodal	93	—	—	$30 / $60	8K	2.1	May 2023
31	xAI: Grok 4.20 xAI · General purpose	93	1496	—	$2 / $6	2M	23.3	Mar 2026
32	DeepSeek: DeepSeek V4 Pro NewOSS DeepSeek · Open-source value leader	92	1467	33 t/s	$1.74 / $3.48	1M	35.2	Apr 2026
33	OpenAI: o3 OpenAI · Hard reasoning	92	—	—	$2 / $8	200K	18.4	Apr 2025
34	TNG: DeepSeek R1T2 ChimeraOSS · Hard reasoning	91	—	—	$0.3 / $1.1	164K	130.0	Jul 2025
35	Google: Gemini 2.5 Pro Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	Jun 2025
36	Google: Gemini 2.5 Pro Preview 06-05 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	Jun 2025
37	DeepSeek: R1 0528OSS DeepSeek · Hard reasoning	91	—	—	$0.45 / $2.15	164K	70.0	May 2025
38	Google: Gemini 2.5 Pro Preview 05-06 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	May 2025
39	DeepSeek: R1 Distill Qwen 32BOSS DeepSeek · Hard reasoning	91	—	—	$0.29 / $0.29	33K	313.8	Jan 2025
40	DeepSeek: R1 Distill Llama 70BOSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $0.8	131K	121.3	Jan 2025
41	DeepSeek: R1OSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $2.5	64K	56.9	Jan 2025
42	Anthropic: Claude Sonnet 4.6 Anthropic · General purpose	91	1467	—	$3 / $15	1M	10.1	Feb 2026
43	Z.ai: GLM 5.1 NewOSS · Open-weight agentic & tool use	90	1467	48 t/s	$1.55 / $4.65	200K	29.0	Apr 2026
44	OpenAI: GPT-5 OpenAI · General purpose	90	1455	—	$1.25 / $10	400K	16.0	Aug 2025
45	xAI: Grok 3 xAI · General purpose	90	—	—	$3 / $15	131K	10.0	Jun 2025
46	xAI: Grok 3 Beta xAI · General purpose	90	—	—	$3 / $15	131K	10.0	Apr 2025
47	OpenAI: GPT-4.1 OpenAI · General purpose	89	—	—	$2 / $8	1M	17.8	Apr 2025
48	MoonshotAI: Kimi K2.5 Moonshot AI · Speed & cost	89	1452	—	$0.3827 / $1.72	262K	84.7	Jan 2026
49	OpenAI: GPT-5 Image OpenAI · Multimodal	88	—	—	$10 / $10	400K	8.8	Oct 2025
50	OpenAI: GPT-5 Pro OpenAI · Complex analysis	88	—	—	$15 / $120	400K	1.3	Oct 2025
51	Anthropic: Claude Sonnet 4.5 Anthropic · General purpose	88	—	—	$3 / $15	1M	9.8	Sep 2025
52	OpenAI: GPT-4o Audio OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Aug 2025
53	OpenAI: GPT-4o Search Preview OpenAI · Search + citations	88	—	—	$2.5 / $10	128K	14.1	Mar 2025
54	OpenAI: o1 OpenAI · Hard reasoning	88	—	—	$15 / $60	200K	2.3	Dec 2024
55	OpenAI: GPT-4o (2024-11-20) OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Nov 2024
56	OpenAI: GPT-4o (2024-08-06) OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Aug 2024
57	OpenAI: GPT-4o OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	May 2024
58	OpenAI: GPT-4o (extended) OpenAI · Multimodal	88	—	—	$6 / $18	128K	7.3	May 2024
59	OpenAI: GPT-4o (2024-05-13) OpenAI · General purpose	88	—	—	$5 / $15	128K	8.8	May 2024
60	OpenAI: GPT-4 Turbo OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Apr 2024
61	OpenAI: GPT-4 Turbo Preview OpenAI · Complex analysis	88	—	—	$10 / $30	128K	4.4	Jan 2024
62	OpenAI: GPT-4 Turbo (older v1106) OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Nov 2023
63	Z.ai: GLM 5OSS · Open-source	88	1450	—	$0.72 / $2.3	80K	58.3	Feb 2026
64	DeepSeek: DeepSeek V3.2OSS DeepSeek · Open-source	87	1455	—	$0.26 / $0.38	164K	271.9	Dec 2025
65	Nex AGI: DeepSeek V3.1 Nex N1OSS · Open-source	86	—	—	$0.135 / $0.5	131K	270.9	Dec 2025
66	DeepSeek: DeepSeek V3.2 SpecialeOSS DeepSeek · Open-source	86	—	—	$0.4 / $1.2	164K	107.5	Dec 2025
67	DeepSeek: DeepSeek V3.2 ExpOSS DeepSeek · Open-source	86	—	—	$0.27 / $0.41	164K	252.9	Sep 2025
68	DeepSeek: DeepSeek V3.1 TerminusOSS DeepSeek · Open-source	86	—	—	$0.21 / $0.79	164K	172.0	Sep 2025
69	DeepSeek: DeepSeek V3.1OSS DeepSeek · Open-source	86	—	—	$0.15 / $0.75	33K	191.1	Aug 2025
70	Anthropic: Claude Sonnet 4 Anthropic · General purpose	86	—	—	$3 / $15	200K	9.6	May 2025
71	DeepSeek: DeepSeek V3 0324OSS DeepSeek · Open-source	86	—	—	$0.2 / $0.77	164K	177.3	Mar 2025
72	Anthropic: Claude 3.7 Sonnet Anthropic · General purpose	86	—	—	$3 / $15	200K	9.6	Feb 2025
73	Anthropic: Claude 3.7 Sonnet (thinking) Anthropic · Hard reasoning	86	—	—	$3 / $15	200K	9.6	Feb 2025
74	DeepSeek: DeepSeek V3OSS DeepSeek · Open-source	86	—	—	$0.32 / $0.89	164K	142.1	Dec 2024
75	DeepSeek: DeepSeek V4 Flash NewOSS DeepSeek · Cheap-and-fast cascade tier	85	1410	105 t/s	$0.14 / $0.28	1M	404.8	Apr 2026
76	Mistral: Mistral Large 3 2512OSS Mistral AI · Open-source	85	—	—	$0.5 / $1.5	262K	85.0	Dec 2025
77	Mistral Large 2411OSS Mistral AI · Open-source	85	—	—	$2 / $6	131K	21.3	Nov 2024
78	Mistral Large 2407OSS Mistral AI · Open-source	85	—	—	$2 / $6	131K	21.3	Nov 2024
79	Mistral LargeOSS Mistral AI · Open-source	85	—	—	$2 / $6	128K	21.3	Feb 2024
80	Cohere: Command R+ (08-2024)OSS Cohere · Open-source	84	—	—	$2.5 / $10	128K	13.4	Aug 2024
81	OpenAI: GPT-5.4 Mini OpenAI · Speed & cost	83	—	—	$0.75 / $4.5	400K	31.6	Mar 2026
82	OpenAI: GPT-5 Mini OpenAI · Speed & cost	83	—	—	$0.25 / $2	400K	73.8	Aug 2025
83	Qwen: Qwen3.5-9BOSS Alibaba Cloud · Open-source	82	—	—	$0.05 / $0.15	256K	820.0	Mar 2026
84	Qwen: Qwen3.5-35B-A3BOSS Alibaba Cloud · Open-source	82	—	—	$0.1625 / $1.3	262K	112.1	Feb 2026
85	Qwen: Qwen3.5-27BOSS Alibaba Cloud · Open-source	82	—	—	$0.195 / $1.56	262K	93.4	Feb 2026
86	Qwen: Qwen3.5-122B-A10BOSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $2.08	262K	70.1	Feb 2026
87	Qwen: Qwen3.5-FlashOSS Alibaba Cloud · Speed & cost	82	—	—	$0.065 / $0.26	1M	504.6	Feb 2026
88	Qwen: Qwen3.5 Plus 2026-02-15OSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $1.56	1M	90.1	Feb 2026
89	Qwen: Qwen3.5 397B A17BOSS Alibaba Cloud · Open-source	82	—	—	$0.39 / $2.34	262K	60.1	Feb 2026
90	Qwen: Qwen3 Max ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.78 / $3.9	262K	35.0	Feb 2026
91	Qwen: Qwen3 Coder NextOSS Alibaba Cloud · Code generation	82	—	—	$0.12 / $0.75	262K	188.5	Feb 2026
92	Qwen: Qwen3 VL 32B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.104 / $0.416	131K	315.4	Oct 2025
93	Qwen: Qwen3 VL 8B ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.117 / $1.365	131K	110.7	Oct 2025
94	Qwen: Qwen3 VL 8B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.08 / $0.5	131K	282.8	Oct 2025
95	Qwen: Qwen3 VL 30B A3B ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.13 / $1.56	131K	97.0	Oct 2025
96	Qwen: Qwen3 VL 30B A3B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.13 / $0.52	131K	252.3	Oct 2025
97	Qwen: Qwen3 VL 235B A22B ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.26 / $2.6	131K	57.3	Sep 2025
98	Qwen: Qwen3 VL 235B A22B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.2 / $0.88	262K	151.9	Sep 2025
99	Qwen: Qwen3 MaxOSS Alibaba Cloud · Open-source	82	—	—	$0.78 / $3.9	262K	35.0	Sep 2025
100	Qwen: Qwen3 Coder PlusOSS Alibaba Cloud · Code generation	82	—	—	$0.65 / $3.25	1M	42.1	Sep 2025
101	Qwen: Qwen3 Coder FlashOSS Alibaba Cloud · Code generation	82	—	—	$0.195 / $0.975	1M	140.2	Sep 2025
102	Qwen: Qwen3 Next 80B A3B ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.0975 / $0.78	131K	186.9	Sep 2025
103	Qwen: Qwen3 Next 80B A3B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.09 / $1.1	262K	137.8	Sep 2025
104	Qwen: Qwen3 30B A3B Thinking 2507OSS Alibaba Cloud · Hard reasoning	82	—	—	$0.08 / $0.4	131K	341.7	Aug 2025
105	Qwen: Qwen3 Coder 30B A3B InstructOSS Alibaba Cloud · Code generation	82	—	—	$0.07 / $0.27	160K	482.4	Jul 2025
106	Qwen: Qwen3 30B A3B Instruct 2507OSS Alibaba Cloud · Open-source	82	—	—	$0.09 / $0.3	262K	420.5	Jul 2025
107	Qwen: Qwen3 235B A22B Thinking 2507OSS Alibaba Cloud · Hard reasoning	82	—	—	$0.1495 / $1.495	131K	99.7	Jul 2025
108	Qwen: Qwen3 Coder 480B A35BOSS Alibaba Cloud · Code generation	82	—	—	$0.22 / $1	262K	134.4	Jul 2025
109	Qwen: Qwen3 235B A22B Instruct 2507OSS Alibaba Cloud · Open-source	82	—	—	$0.071 / $0.1	262K	959.1	Jul 2025
110	xAI: Grok 3 Mini xAI · Speed & cost	82	—	—	$0.3 / $0.5	131K	205.0	Jun 2025
111	Qwen: Qwen3 30B A3BOSS Alibaba Cloud · Open-source	82	—	—	$0.08 / $0.28	41K	455.6	Apr 2025
112	Qwen: Qwen3 8BOSS Alibaba Cloud · Open-source	82	—	—	$0.05 / $0.4	41K	364.4	Apr 2025
113	Qwen: Qwen3 14BOSS Alibaba Cloud · Open-source	82	—	—	$0.06 / $0.24	41K	546.7	Apr 2025
114	Qwen: Qwen3 32BOSS Alibaba Cloud · Open-source	82	—	—	$0.08 / $0.24	41K	512.5	Apr 2025
115	Qwen: Qwen3 235B A22BOSS Alibaba Cloud · Open-source	82	—	—	$0.455 / $1.82	131K	72.1	Apr 2025
116	OpenAI: o4 Mini High OpenAI · Hard reasoning	82	—	—	$1.1 / $4.4	200K	29.8	Apr 2025
117	OpenAI: o4 Mini OpenAI · Hard reasoning	82	—	—	$1.1 / $4.4	200K	29.8	Apr 2025
118	xAI: Grok 3 Mini Beta xAI · Speed & cost	82	—	—	$0.3 / $0.5	131K	205.0	Apr 2025
119	Meta: Llama 4 MaverickOSS Meta · Open-source	82	—	—	$0.15 / $0.6	1M	218.7	Apr 2025
120	OpenAI: o3 Mini High OpenAI · Hard reasoning	82	—	—	$1.1 / $4.4	200K	29.8	Feb 2025
121	AionLabs: Aion-1.0 · General purpose	82	—	—	$4 / $8	131K	13.7	Feb 2025
122	OpenAI: o3 Mini OpenAI · Hard reasoning	82	—	—	$1.1 / $4.4	200K	29.8	Jan 2025
123	Qwen2.5 72B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.12 / $0.39	33K	321.6	Sep 2024
124	Goliath 120B · General purpose	82	—	—	$3.75 / $7.5	6K	14.6	Nov 2023
125	Google: Gemini 3 Flash Preview Google · Speed & cost	80	—	—	$0.5 / $3	1M	45.7	Dec 2025
126	Google: Nano Banana (Gemini 2.5 Flash Image) Google · Image generation	80	—	—	$0.3 / $2.5	33K	57.1	Oct 2025
127	Google: Gemini 2.5 Flash Lite Preview 09-2025 Google · Speed & cost	80	—	—	$0.1 / $0.4	1M	320.0	Sep 2025
128	Google: Gemini 2.5 Flash Lite Google · Speed & cost	80	—	—	$0.1 / $0.4	1M	320.0	Jul 2025
129	Google: Gemini 2.5 Flash Google · Speed & cost	80	—	—	$0.3 / $2.5	1M	57.1	Jun 2025
130	OpenAI: GPT-4.1 Mini OpenAI · Speed & cost	80	—	—	$0.4 / $1.6	1M	80.0	Apr 2025
131	OpenAI: GPT-4o-mini Search Preview OpenAI · Search + citations	80	—	—	$0.15 / $0.6	128K	213.3	Mar 2025
132	OpenAI: GPT-4o-mini (2024-07-18) OpenAI · Speed & cost	80	—	—	$0.15 / $0.6	128K	213.3	Jul 2024
133	OpenAI: GPT-4o-mini OpenAI · Speed & cost	80	—	—	$0.15 / $0.6	128K	213.3	Jul 2024
134	Mistral: Codestral 2508OSS Mistral AI · Code generation	78	—	—	$0.3 / $0.9	256K	130.0	Aug 2025
135	Google: Gemma 4 26B A4B OSS Google · Open-source	76	—	—	$0.13 / $0.4	262K	286.8	Apr 2026
136	Google: Gemma 4 31BOSS Google · Open-source	76	—	—	$0.14 / $0.4	262K	281.5	Apr 2026
137	Anthropic: Claude Haiku 4.5 Anthropic · Speed & cost	76	—	—	$1 / $5	200K	25.3	Oct 2025
138	Anthropic: Claude 3.5 Haiku Anthropic · Speed & cost	76	—	—	$0.8 / $4	200K	31.7	Nov 2024
139	Z.ai: GLM 5V TurboOSS · Open-source	74	—	—	$1.2 / $4	203K	28.5	Apr 2026
140	xAI: Grok 4.20 Multi-Agent xAI · General purpose	74	—	—	$2 / $6	2M	18.5	Mar 2026
141	Z.ai: GLM 5 TurboOSS · Open-source	74	—	—	$1.2 / $4	203K	28.5	Mar 2026
142	OpenAI: GPT Audio OpenAI · General purpose	74	—	—	$2.5 / $10	128K	11.8	Jan 2026
143	Deep Cogito: Cogito v2.1 671B · General purpose	74	—	—	$1.25 / $1.25	128K	59.2	Nov 2025
144	Amazon: Nova Premier 1.0 Amazon · General purpose	74	—	—	$2.5 / $12.5	1M	9.9	Oct 2025
145	Perplexity: Sonar Pro Search Perplexity · Search + citations	74	—	—	$3 / $15	200K	8.2	Oct 2025
146	OpenAI: GPT-5 Image Mini OpenAI · Image generation	74	—	—	$2.5 / $2	400K	32.9	Oct 2025
147	NVIDIA: Llama 3.3 Nemotron Super 49B V1.5OSS · Open-source	74	—	—	$0.1 / $0.4	131K	296.0	Oct 2025
148	OpenAI: GPT-5 Codex OpenAI · Code generation	74	—	—	$1.25 / $10	400K	13.2	Sep 2025
149	AI21: Jamba Large 1.7 · General purpose	74	—	—	$2 / $8	256K	14.8	Aug 2025
150	OpenAI: GPT-5 Chat OpenAI · General purpose	74	—	—	$1.25 / $10	128K	13.2	Aug 2025
151	xAI: Grok 4 xAI · General purpose	74	—	—	$3 / $15	256K	8.2	Jul 2025
152	Meta: Llama 4 ScoutOSS Meta · Speed & cost	74	—	—	$0.08 / $0.3	328K	389.5	Apr 2025
153	Google: Gemma 3 12BOSS Google · Open-source	74	—	—	$0.04 / $0.13	131K	870.6	Mar 2025
154	Cohere: Command A Cohere · General purpose	74	—	—	$2.5 / $10	256K	11.8	Mar 2025
155	Google: Gemma 3 27BOSS Google · Open-source	74	—	—	$0.08 / $0.16	131K	616.7	Mar 2025
156	Perplexity: Sonar Reasoning Pro Perplexity · Search + citations	74	—	—	$2 / $8	128K	14.8	Mar 2025
157	Perplexity: Sonar Pro Perplexity · Search + citations	74	—	—	$3 / $15	200K	8.2	Mar 2025
158	Perplexity: Sonar Deep Research Perplexity · Deep research	74	—	—	$2 / $8	128K	14.8	Mar 2025
159	Qwen: Qwen-Max OSS Alibaba Cloud · Open-source	74	—	—	$1.04 / $4.16	33K	28.5	Feb 2025
160	Sao10K: Llama 3.1 70B Hanami x1OSS · Hard reasoning	74	—	—	$3 / $3	16K	24.7	Jan 2025
161	Meta: Llama 3.3 70B InstructOSS Meta · Open-source	74	—	—	$0.1 / $0.32	131K	352.4	Dec 2024
162	Mistral: Pixtral Large 2411OSS Mistral AI · Open-source	74	—	—	$2 / $6	131K	18.5	Nov 2024
163	Magnum v4 72B · General purpose	74	—	—	$3 / $5	16K	18.5	Oct 2024
164	NVIDIA: Llama 3.1 Nemotron 70B InstructOSS · Open-source	74	—	—	$1.2 / $1.2	131K	61.7	Oct 2024
165	Inflection: Inflection 3 Pi · General purpose	74	—	—	$2.5 / $10	8K	11.8	Oct 2024
166	Inflection: Inflection 3 Productivity · General purpose	74	—	—	$2.5 / $10	8K	11.8	Oct 2024
167	Nous: Hermes 3 70B InstructOSS · Search + citations	74	—	—	$0.3 / $0.3	131K	246.7	Aug 2024
168	Meta: Llama 3.1 70B InstructOSS Meta · Open-source	74	—	—	$0.4 / $0.4	131K	185.0	Jul 2024
169	Sao10k: Llama 3 Euryale 70B v2.1OSS · Hard reasoning	74	—	—	$1.48 / $1.48	8K	50.0	Jun 2024
170	OpenAI: GPT-3.5 Turbo Instruct OpenAI · General purpose	74	—	—	$1.5 / $2	4K	42.3	Sep 2023
171	OpenAI: GPT-3.5 Turbo 16k OpenAI · General purpose	74	—	—	$3 / $4	16K	21.1	Aug 2023
172	Google: Gemini 2.0 Flash Lite Google · Speed & cost	73	—	—	$0.075 / $0.3	1M	389.3	Feb 2025
173	Google: Gemini 2.0 Flash Google · Speed & cost	73	—	—	$0.1 / $0.4	1M	292.0	Feb 2025
174	OpenAI: GPT-5.4 Nano OpenAI · Speed & cost	72	—	—	$0.2 / $1.25	400K	99.3	Mar 2026
175	Mistral: Mistral Small 4OSS Mistral AI · Open-source	72	—	—	$0.15 / $0.6	262K	192.0	Mar 2026
176	Mistral: Mistral Small CreativeOSS Mistral AI · Open-source	72	—	—	$0.1 / $0.3	33K	360.0	Dec 2025
177	OpenAI: GPT-5 Nano OpenAI · Speed & cost	72	—	—	$0.05 / $0.4	400K	320.0	Aug 2025
178	Mistral: Mistral Small 3.2 24BOSS Mistral AI · Open-source	72	—	—	$0.075 / $0.2	128K	523.6	Jun 2025
179	OpenAI: GPT-4.1 Nano OpenAI · Speed & cost	72	—	—	$0.1 / $0.4	1M	288.0	Apr 2025
180	Mistral: Mistral Small 3.1 24BOSS Mistral AI · Open-source	72	—	—	$0.03 / $0.11	131K	1028.6	Mar 2025
181	Mistral: Mistral Small 3OSS Mistral AI · Open-source	72	—	—	$0.05 / $0.08	33K	1107.7	Jan 2025
182	Mistral: Mistral NemoOSS Mistral AI · Open-source	72	—	—	$0.02 / $0.04	131K	2400.0	Jul 2024
183	Mistral: Mixtral 8x22B InstructOSS Mistral AI · Open-source	72	—	—	$2 / $6	66K	18.0	Apr 2024
184	Anthropic: Claude 3 Haiku Anthropic · Speed & cost	72	—	—	$0.25 / $1.25	200K	96.0	Mar 2024
185	Mistral: Mixtral 8x7B InstructOSS Mistral AI · Open-source	72	—	—	$0.54 / $0.54	33K	133.3	Dec 2023
186	Xiaomi: MiMo-V2-Omni · Speed & cost	66	—	—	$0.4 / $2	262K	55.0	Mar 2026
187	Xiaomi: MiMo-V2-Pro · Speed & cost	66	—	—	$1 / $3	1M	33.0	Mar 2026
188	Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) Google · Image generation	66	—	—	$0.5 / $3	66K	37.7	Feb 2026
189	AionLabs: Aion-2.0 · Speed & cost	66	—	—	$0.8 / $1.6	131K	55.0	Feb 2026
190	Writer: Palmyra X5 · Speed & cost	66	—	—	$0.6 / $6	1M	20.0	Jan 2026
191	OpenAI: GPT Audio Mini OpenAI · Speed & cost	66	—	—	$0.6 / $2.4	128K	44.0	Jan 2026
192	Z.ai: GLM 4.7OSS · Open-source	66	—	—	$0.39 / $1.75	203K	61.7	Dec 2025
193	Mistral: Devstral 2 2512OSS Mistral AI · Open-source	66	—	—	$0.4 / $2	262K	55.0	Dec 2025
194	Relace: Relace Search · Search + citations	66	—	—	$1 / $3	256K	33.0	Dec 2025
195	MoonshotAI: Kimi K2 Thinking Moonshot AI · Hard reasoning	66	—	—	$0.47 / $2	131K	53.4	Nov 2025
196	Z.ai: GLM 4.6OSS · Open-source	66	—	—	$0.39 / $1.9	205K	57.6	Sep 2025
197	Relace: Relace Apply 3 · Speed & cost	66	—	—	$0.85 / $1.25	256K	62.9	Sep 2025
198	MoonshotAI: Kimi K2 0905 Moonshot AI · Speed & cost	66	—	—	$0.4 / $2	131K	55.0	Sep 2025
199	Nous: Hermes 4 405BOSS · Search + citations	66	—	—	$1 / $3	131K	33.0	Aug 2025
200	Mistral: Mistral Medium 3.1OSS Mistral AI · Open-source	66	—	—	$0.4 / $2	131K	55.0	Aug 2025
201	Z.ai: GLM 4.5VOSS · Open-source	66	—	—	$0.6 / $1.8	66K	55.0	Aug 2025
202	Z.ai: GLM 4.5OSS · Open-source	66	—	—	$0.6 / $2.2	131K	47.1	Jul 2025
203	Switchpoint Router · Speed & cost	66	—	—	$0.85 / $3.4	131K	31.1	Jul 2025
204	MoonshotAI: Kimi K2 0711 Moonshot AI · Speed & cost	66	—	—	$0.57 / $2.3	131K	46.0	Jul 2025
205	Mistral: Devstral MediumOSS Mistral AI · Open-source	66	—	—	$0.4 / $2	131K	55.0	Jul 2025
206	Morph: Morph V3 Large · Speed & cost	66	—	—	$0.9 / $1.9	262K	47.1	Jul 2025
207	Morph: Morph V3 Fast · Speed & cost	66	—	—	$0.8 / $1.2	82K	66.0	Jul 2025
208	Baidu: ERNIE 4.5 VL 424B A47B · Speed & cost	66	—	—	$0.42 / $1.25	123K	79.0	Jun 2025
209	MiniMax: MiniMax M1 · Speed & cost	66	—	—	$0.4 / $2.2	1M	50.8	Jun 2025
210	Mistral: Mistral Medium 3OSS Mistral AI · Open-source	66	—	—	$0.4 / $2	131K	55.0	May 2025
211	Arcee AI: Maestro Reasoning · Speed & cost	66	—	—	$0.9 / $3.3	131K	31.4	May 2025
212	Arcee AI: Virtuoso Large · Speed & cost	66	—	—	$0.75 / $1.2	131K	67.7	May 2025
213	Arcee AI: Coder Large · Code generation	66	—	—	$0.5 / $0.8	33K	101.5	May 2025
214	EleutherAI: Llemma 7b · Speed & cost	66	—	—	$0.8 / $1.2	4K	66.0	Apr 2025
215	AlfredPros: CodeLLaMa 7B Instruct SolidityOSS · Open-source	66	—	—	$0.8 / $1.2	4K	66.0	Apr 2025
216	NVIDIA: Llama 3.1 Nemotron Ultra 253B v1OSS · Open-source	66	—	—	$0.6 / $1.8	131K	55.0	Apr 2025
217	TheDrummer: Skyfall 36B V2 · Speed & cost	66	—	—	$0.55 / $0.8	33K	97.8	Mar 2025
218	AionLabs: Aion-1.0-Mini · Speed & cost	66	—	—	$0.7 / $1.4	131K	62.9	Feb 2025
219	Qwen: Qwen VL MaxOSS Alibaba Cloud · Open-source	66	—	—	$0.52 / $2.08	131K	50.8	Feb 2025
220	Qwen: Qwen2.5 VL 72B InstructOSS Alibaba Cloud · Open-source	66	—	—	$0.8 / $0.8	33K	82.5	Feb 2025
221	Perplexity: Sonar Perplexity · Search + citations	66	—	—	$1 / $1	127K	66.0	Jan 2025
222	Sao10K: Llama 3.3 Euryale 70BOSS · Hard reasoning	66	—	—	$0.65 / $0.75	131K	94.3	Dec 2024
223	Amazon: Nova Pro 1.0 Amazon · Speed & cost	66	—	—	$0.8 / $3.2	300K	33.0	Dec 2024
224	Qwen2.5 Coder 32B InstructOSS Alibaba Cloud · Code generation	66	—	—	$0.66 / $1	33K	79.5	Nov 2024
225	TheDrummer: UnslopNemo 12B · Speed & cost	66	—	—	$0.4 / $0.4	33K	165.0	Nov 2024
226	Sao10K: Llama 3.1 Euryale 70B v2.2OSS · Hard reasoning	66	—	—	$0.85 / $0.85	131K	77.6	Aug 2024
227	Nous: Hermes 3 405B InstructOSS · Search + citations	66	—	—	$1 / $1	131K	66.0	Aug 2024
228	Meta: Llama 3 70B InstructOSS Meta · Open-source	66	—	—	$0.51 / $0.74	8K	105.6	Apr 2024
229	OpenAI: GPT-3.5 Turbo (older v0613) OpenAI · Speed & cost	66	—	—	$1 / $2	4K	44.0	Jan 2024
230	Mancer: Weaver (alpha) · Speed & cost	66	—	—	$0.75 / $1	8K	75.4	Aug 2023
231	ReMM SLERP 13B · Speed & cost	66	—	—	$0.45 / $0.65	6K	120.0	Jul 2023
232	OpenAI: GPT-3.5 Turbo OpenAI · Speed & cost	66	—	—	$0.5 / $1.5	16K	66.0	May 2023
233	Google: Gemma 3 4BOSS Google · Open-source	65	—	—	$0.04 / $0.08	131K	1083.3	Mar 2025
234	AionLabs: Aion-RP 1.0 (8B)OSS · Open-source	65	—	—	$0.8 / $1.6	33K	54.2	Feb 2025
235	Microsoft: Phi 4OSS · Speed & cost	65	—	—	$0.065 / $0.14	16K	634.1	Jan 2025
236	Meta: Llama 3.1 8B InstructOSS Meta · Open-source	65	—	—	$0.02 / $0.05	16K	1857.1	Jul 2024
237	Google: Gemma 2 27BOSS Google · Open-source	65	—	—	$0.65 / $0.65	8K	100.0	Jul 2024
238	Google: Gemma 2 9BOSS Google · Open-source	65	—	—	$0.03 / $0.09	8K	1083.3	Jun 2024
239	NousResearch: Hermes 2 Pro - Llama-3 8BOSS · Search + citations	65	—	—	$0.14 / $0.14	8K	464.3	May 2024
240	Meta: Llama 3 8B InstructOSS Meta · Open-source	65	—	—	$0.03 / $0.04	8K	1857.1	Apr 2024
241	Google: Gemini 3.1 Flash Lite Preview Google · Speed & cost	62	—	—	$0.25 / $1.5	1M	70.9	Mar 2026
242	NVIDIA: Nemotron 3 Nano 30B A3B · Speed & cost	62	—	—	$0.05 / $0.2	262K	496.0	Dec 2025
243	NVIDIA: Nemotron Nano 12B 2 VL · Speed & cost	62	—	—	$0.2 / $0.6	131K	155.0	Oct 2025
244	IBM: Granite 4.0 Micro · Speed & cost	62	—	—	$0.017 / $0.11	131K	976.4	Oct 2025
245	NVIDIA: Nemotron Nano 9B V2 · Speed & cost	62	—	—	$0.04 / $0.16	131K	620.0	Sep 2025
246	Amazon: Nova Micro 1.0 Amazon · Speed & cost	62	—	—	$0.035 / $0.14	128K	708.6	Dec 2024
247	WizardLM-2 8x22BOSS · Speed & cost	62	—	—	$0.62 / $0.62	66K	100.0	Apr 2024
248	Arcee AI: Trinity Large Thinking · Hard reasoning	58	—	—	$0.22 / $0.85	262K	108.4	Apr 2026
249	Kwaipilot: KAT-Coder-Pro V2 · Code generation	58	—	—	$0.3 / $1.2	256K	77.3	Mar 2026
250	Reka Edge · Speed & cost	58	—	—	$0.1 / $0.1	16K	580.0	Mar 2026
251	MiniMax: MiniMax M2.7 · Speed & cost	58	—	—	$0.3 / $1.2	205K	77.3	Mar 2026
252	NVIDIA: Nemotron 3 Super · Speed & cost	58	—	—	$0.1 / $0.5	262K	193.3	Mar 2026
253	ByteDance Seed: Seed-2.0-Lite · Speed & cost	58	—	—	$0.25 / $2	262K	51.6	Mar 2026
254	Inception: Mercury 2 · Speed & cost	58	—	—	$0.25 / $0.75	128K	116.0	Mar 2026
255	ByteDance Seed: Seed-2.0-Mini · Speed & cost	58	—	—	$0.1 / $0.4	262K	232.0	Feb 2026
256	MiniMax: MiniMax M2.5 · Speed & cost	58	—	—	$0.118 / $0.99	197K	104.7	Feb 2026
257	StepFun: Step 3.5 Flash · Speed & cost	58	—	—	$0.1 / $0.3	262K	290.0	Jan 2026
258	Upstage: Solar Pro 3 · Speed & cost	58	—	—	$0.15 / $0.6	128K	154.7	Jan 2026
259	MiniMax: MiniMax M2-her · Speed & cost	58	—	—	$0.3 / $1.2	66K	77.3	Jan 2026
260	Z.ai: GLM 4.7 FlashOSS · Speed & cost	58	—	—	$0.06 / $0.4	203K	252.2	Jan 2026
261	AllenAI: Olmo 3.1 32B InstructOSS · Open-source	58	—	—	$0.2 / $0.6	66K	145.0	Jan 2026
262	ByteDance Seed: Seed 1.6 Flash · Speed & cost	58	—	—	$0.075 / $0.3	262K	309.3	Dec 2025
263	ByteDance Seed: Seed 1.6 · Speed & cost	58	—	—	$0.25 / $2	262K	51.6	Dec 2025
264	MiniMax: MiniMax M2.1 · Speed & cost	58	—	—	$0.27 / $0.95	197K	95.1	Dec 2025
265	Xiaomi: MiMo-V2-Flash · Speed & cost	58	—	—	$0.09 / $0.29	262K	305.3	Dec 2025
266	Z.ai: GLM 4.6VOSS · Open-source	58	—	—	$0.3 / $0.9	131K	96.7	Dec 2025
267	EssentialAI: Rnj 1 Instruct · Speed & cost	58	—	—	$0.15 / $0.15	33K	386.7	Dec 2025
268	Amazon: Nova 2 Lite Amazon · Speed & cost	58	—	—	$0.3 / $2.5	1M	41.4	Dec 2025
269	Mistral: Ministral 3 14B 2512OSS Mistral AI · Speed & cost	58	—	—	$0.2 / $0.2	262K	290.0	Dec 2025
270	Mistral: Ministral 3 8B 2512OSS Mistral AI · Speed & cost	58	—	—	$0.15 / $0.15	262K	386.7	Dec 2025
271	Mistral: Ministral 3 3B 2512OSS Mistral AI · Speed & cost	58	—	—	$0.1 / $0.1	131K	580.0	Dec 2025
272	Prime Intellect: INTELLECT-3 · Speed & cost	58	—	—	$0.2 / $1.1	131K	89.2	Nov 2025
273	AllenAI: Olmo 3 32B ThinkOSS · Hard reasoning	58	—	—	$0.15 / $0.5	66K	178.5	Nov 2025
274	xAI: Grok 4.1 Fast xAI · Speed & cost	58	—	—	$0.2 / $0.5	2M	165.7	Nov 2025
275	OpenAI: GPT-5.1-Codex-Mini OpenAI · Code generation	58	—	—	$0.25 / $2	400K	51.6	Nov 2025
276	Mistral: Voxtral Small 24B 2507OSS Mistral AI · Open-source	58	—	—	$0.1 / $0.3	32K	290.0	Oct 2025
277	OpenAI: gpt-oss-safeguard-20b OpenAI · Speed & cost	58	—	—	$0.075 / $0.3	131K	309.3	Oct 2025
278	MiniMax: MiniMax M2 · Speed & cost	58	—	—	$0.255 / $1	197K	92.4	Oct 2025
279	Baidu: ERNIE 4.5 21B A3B Thinking · Hard reasoning	58	—	—	$0.07 / $0.28	131K	331.4	Oct 2025
280	TheDrummer: Cydonia 24B V4.1 · Speed & cost	58	—	—	$0.3 / $0.5	131K	145.0	Sep 2025
281	xAI: Grok 4 Fast xAI · Speed & cost	58	—	—	$0.2 / $0.5	2M	165.7	Sep 2025
282	Tongyi DeepResearch 30B A3BOSS Alibaba Cloud · Search + citations	58	—	—	$0.09 / $0.45	131K	214.8	Sep 2025
283	Meituan: LongCat Flash Chat · Speed & cost	58	—	—	$0.2 / $0.8	131K	116.0	Sep 2025
284	Qwen: Qwen Plus 0728 (thinking)OSS Alibaba Cloud · Hard reasoning	58	—	—	$0.26 / $0.78	1M	111.5	Sep 2025
285	Qwen: Qwen Plus 0728OSS Alibaba Cloud · Open-source	58	—	—	$0.26 / $0.78	1M	111.5	Sep 2025
286	xAI: Grok Code Fast 1 xAI · Speed & cost	58	—	—	$0.2 / $1.5	256K	68.2	Aug 2025
287	Nous: Hermes 4 70BOSS · Search + citations	58	—	—	$0.13 / $0.4	131K	218.9	Aug 2025
288	Baidu: ERNIE 4.5 21B A3B · Speed & cost	58	—	—	$0.07 / $0.28	120K	331.4	Aug 2025
289	Baidu: ERNIE 4.5 VL 28B A3B · Speed & cost	58	—	—	$0.14 / $0.56	30K	165.7	Aug 2025
290	Z.ai: GLM 4.5 AirOSS · Open-source	58	—	—	$0.13 / $0.85	131K	118.4	Jul 2025
291	Z.ai: GLM 4 32B OSS · Open-source	58	—	—	$0.1 / $0.1	128K	580.0	Jul 2025
292	ByteDance: UI-TARS 7B · Speed & cost	58	—	—	$0.1 / $0.2	128K	386.7	Jul 2025
293	Mistral: Devstral Small 1.1OSS Mistral AI · Open-source	58	—	—	$0.1 / $0.3	131K	290.0	Jul 2025
294	Tencent: Hunyuan A13B Instruct · Speed & cost	58	—	—	$0.14 / $0.57	131K	163.4	Jul 2025
295	Baidu: ERNIE 4.5 300B A47B · Speed & cost	58	—	—	$0.28 / $1.1	123K	84.1	Jun 2025
296	Inception: Mercury · Speed & cost	58	—	—	$0.25 / $0.75	128K	116.0	Jun 2025
297	Arcee AI: Spotlight · Speed & cost	58	—	—	$0.18 / $0.18	131K	322.2	May 2025
298	Inception: Mercury Coder · Code generation	58	—	—	$0.25 / $0.75	128K	116.0	Apr 2025
299	Meta: Llama Guard 4 12BOSS Meta · Open-source	58	—	—	$0.18 / $0.18	164K	322.2	Apr 2025
300	Qwen: Qwen2.5 VL 32B InstructOSS Alibaba Cloud · Open-source	58	—	—	$0.2 / $0.6	128K	145.0	Mar 2025
301	Reka Flash 3 · Speed & cost	58	—	—	$0.1 / $0.2	66K	386.7	Mar 2025
302	Qwen: QwQ 32BOSS Alibaba Cloud · Hard reasoning	58	—	—	$0.15 / $0.58	131K	158.9	Mar 2025
303	Mistral: SabaOSS Mistral AI · Open-source	58	—	—	$0.2 / $0.6	33K	145.0	Feb 2025
304	Qwen: Qwen VL PlusOSS Alibaba Cloud · Open-source	58	—	—	$0.1365 / $0.4095	131K	212.5	Feb 2025
305	Qwen: Qwen-PlusOSS Alibaba Cloud · Open-source	58	—	—	$0.26 / $0.78	1M	111.5	Feb 2025
306	MiniMax: MiniMax-01 · Speed & cost	58	—	—	$0.2 / $1.1	1M	89.2	Jan 2025
307	Amazon: Nova Lite 1.0 Amazon · Speed & cost	58	—	—	$0.06 / $0.24	300K	386.7	Dec 2024
308	TheDrummer: Rocinante 12B · Speed & cost	58	—	—	$0.17 / $0.43	33K	193.3	Sep 2024
309	Meta: Llama 3.2 3B InstructOSS Meta · Open-source	58	—	—	$0.051 / $0.34	80K	296.7	Sep 2024
310	Cohere: Command R (08-2024)OSS Cohere · Open-source	58	—	—	$0.15 / $0.6	128K	154.7	Aug 2024
311	Mistral: Mistral 7B Instruct v0.1OSS Mistral AI · Open-source	58	—	—	$0.11 / $0.19	3K	386.7	Sep 2023
312	MythoMax 13B · Speed & cost	58	—	—	$0.06 / $0.06	4K	966.7	Jul 2023
313	LiquidAI: LFM2-24B-A2B · Speed & cost	50	—	—	$0.03 / $0.12	33K	666.7	Feb 2026
314	Arcee AI: Trinity Mini · Speed & cost	50	—	—	$0.045 / $0.15	131K	512.8	Dec 2025
315	OpenAI: gpt-oss-120b OpenAI · Speed & cost	50	—	—	$0.039 / $0.19	131K	436.7	Aug 2025
316	OpenAI: gpt-oss-20b OpenAI · Speed & cost	50	—	—	$0.03 / $0.11	131K	714.3	Aug 2025
317	Google: Gemma 3n 4BOSS Google · Open-source	50	—	—	$0.02 / $0.04	33K	1666.7	May 2025
318	Qwen: Qwen2.5 Coder 7B InstructOSS Alibaba Cloud · Code generation	50	—	—	$0.03 / $0.09	33K	833.3	Apr 2025
319	AllenAI: Olmo 2 32B InstructOSS · Open-source	50	—	—	$0.05 / $0.2	128K	400.0	Mar 2025
320	Llama Guard 3 8BOSS Meta · Open-source	50	—	—	$0.02 / $0.06	131K	1250.0	Feb 2025
321	Qwen: Qwen-TurboOSS Alibaba Cloud · Open-source	50	—	—	$0.0325 / $0.13	131K	615.4	Feb 2025
322	Cohere: Command R7B (12-2024)OSS Cohere · Open-source	50	—	—	$0.0375 / $0.15	128K	533.3	Dec 2024
323	Qwen: Qwen2.5 7B InstructOSS Alibaba Cloud · Open-source	50	—	—	$0.04 / $0.1	33K	714.3	Oct 2024
324	Meta: Llama 3.2 1B InstructOSS Meta · Open-source	50	—	—	$0.027 / $0.2	60K	440.5	Sep 2024
325	Meta: Llama 3.2 11B Vision InstructOSS Meta · Open-source	50	—	—	$0.049 / $0.049	131K	1020.4	Sep 2024
326	Sao10K: Llama 3 8B LunarisOSS · Hard reasoning	50	—	—	$0.04 / $0.05	8K	1111.1	Aug 2024

Quality = composite benchmark (MMLU, HumanEval, MATH)Arena ELO = LMSYS Chatbot Arena ratingValue = quality per dollarPrice = input / output per 1M tokens

LLM Leaderboard May 2026

Large language models ranked by LMSys Arena Elo, MMLU, HumanEval, MATH, pricing, and tokens-per-second. Text-only view.

LM Leaderboard May 2026

Language model rankings: LMArena Elo, price-to-Elo ratio, and open-weight vs closed-source comparison.

LMSys Arena Leaderboard May 2026

LMArena (formerly LMSys Chatbot Arena) tracker — pairwise human preference Elo scores, refreshed as the public arena publishes.

Image Model Leaderboard 2026

Generative AI image and video models — Imagen 4, Flux 2, DALL-E 4, Stable Diffusion 4 Ultra, Sora 2 ranked by quality and cost.

Coding Model Leaderboard 2026

AI coding assistants ranked: Claude Opus, GPT-5.5, Gemini 3.1 Pro, DeepSeek V4, plus HumanEval and SWE-Bench scores.

Vendor Lock-in Leaderboard 2026

AI vendors ranked by portability — license, weight availability, fine-tuning openness, and exit cost score.

How We Rank AI Models

Our leaderboard uses a composite quality index that combines three key benchmarks: MMLU Pro (measuring knowledge and reasoning across 57 subjects), HumanEval (measuring code generation ability), and MATH (measuring mathematical problem-solving). Scores are normalized to a 0-100 scale and cross-referenced against LMSYS Chatbot Arena ELO ratings for real-world validation.

We track speed (tokens per second), time-to-first-token (TTFT), pricing, and context window size to give you a complete picture. The Value Score divides quality by cost, showing you which models deliver the most capability per dollar.

Key Trends in AI Model Performance

Open-source catching up: DeepSeek R1 and V3 now compete with top closed-source models on reasoning and coding benchmarks
Reasoning specialization: Models like o3 and R1 trade speed for dramatically better performance on complex tasks
Context windows expanding: 1M+ tokens is now standard for flagship models, with Llama 4 Scout supporting 10M
Speed improving: Flash-tier models now exceed 200 tokens/second while maintaining strong quality

Choosing the Right Model

There is no single "best" model — it depends on your use case. For most applications, a model routing approach works best: route simple queries to fast, cheap models and complex queries to frontier models. This gives you the best of both worlds — low cost and high quality.