Updated May 14, 2026

AI Model Pricing Index

Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.

326

Models Tracked

52

Providers

$0.02

Cheapest Input

8824x

Price Range

Full Pricing Table

326 models
ModelProviderInput / 1MOutput / 1MBlendedQualityValueContext

Open-source

Mistral AI$0.02$0.04$0.03
72
2400.0131K

Open-source

Google$0.02$0.04$0.03
50
1666.733K

Open-source

Meta$0.02$0.05$0.04
65
1857.116K

Open-source

Meta$0.03$0.04$0.04
65
1857.18K

Open-source

Meta$0.02$0.06$0.04
50
1250.0131K

Hard reasoning

sao10k$0.04$0.05$0.04
50
1111.18K

Open-source

Meta$0.05$0.05$0.05
50
1020.4131K

Open-source

Google$0.04$0.08$0.06
65
1083.3131K

Open-source

Google$0.03$0.09$0.06
65
1083.38K

Speed & cost

gryphe$0.06$0.06$0.06
58
966.74K

Code generation

Alibaba Cloud$0.03$0.09$0.06
50
833.333K

Speed & cost

ibm-granite$0.02$0.11$0.06
62
976.4131K

Open-source

Mistral AI$0.05$0.08$0.07
72
1107.733K

Open-source

Mistral AI$0.03$0.11$0.07
72
1028.6131K

Speed & cost

OpenAI$0.03$0.11$0.07
50
714.3131K

Open-source

Alibaba Cloud$0.04$0.10$0.07
50
714.333K

Speed & cost

liquid$0.03$0.12$0.07
50
666.733K

Open-source

Alibaba Cloud$0.03$0.13$0.08
50
615.4131K

Open-source

Google$0.04$0.13$0.09
74
870.6131K

Open-source

Alibaba Cloud$0.07$0.10$0.09
82
959.1262K

Speed & cost

Amazon$0.04$0.14$0.09
62
708.6128K

Open-source

Cohere$0.04$0.15$0.09
50
533.3128K

Speed & cost

arcee$0.04$0.15$0.10
50
512.8131K

Open-source

Alibaba Cloud$0.05$0.15$0.10
82
820.0256K

Speed & cost

nvidia$0.04$0.16$0.10
62
620.0131K

Speed & cost

rekaai$0.10$0.10$0.10
58
580.016K

Speed & cost

Mistral AI$0.10$0.10$0.10
58
580.0131K

Open-source

z-ai$0.10$0.10$0.10
58
580.0128K

Speed & cost

microsoft$0.07$0.14$0.10
65
634.116K

Open-source

Meta$0.03$0.20$0.11
50
440.560K

Speed & cost

OpenAI$0.04$0.19$0.11
50
436.7131K

Open-source

Google$0.08$0.16$0.12
74
616.7131K

Speed & cost

nvidia$0.05$0.20$0.13
62
496.0262K

Open-source

allenai$0.05$0.20$0.13
50
400.0128K

Open-source

Mistral AI$0.07$0.20$0.14
72
523.6128K

Search + citations

nousresearch$0.14$0.14$0.14
65
464.38K

Open-source

Alibaba Cloud$0.06$0.24$0.15
82
546.741K

Speed & cost

essentialai$0.15$0.15$0.15
58
386.733K

Speed & cost

Mistral AI$0.15$0.15$0.15
58
386.7262K

Speed & cost

Amazon$0.06$0.24$0.15
58
386.7300K

Open-source

Mistral AI$0.11$0.19$0.15
58
386.73K

Speed & cost

bytedance$0.10$0.20$0.15
58
386.7128K

Speed & cost

rekaai$0.10$0.20$0.15
58
386.766K

Open-source

Alibaba Cloud$0.08$0.24$0.16
82
512.541K

Speed & cost

Alibaba Cloud$0.07$0.26$0.16
82
504.61M

Code generation

Alibaba Cloud$0.07$0.27$0.17
82
482.4160K

Hard reasoning

baidu$0.07$0.28$0.18
58
331.4131K

Speed & cost

baidu$0.07$0.28$0.18
58
331.4120K

Speed & cost

arcee$0.18$0.18$0.18
58
322.2131K

Open-source

Meta$0.18$0.18$0.18
58
322.2164K

Open-source

Alibaba Cloud$0.08$0.28$0.18
82
455.641K

Speed & cost

Google$0.07$0.30$0.19
73
389.31M

Speed & cost

bytedance$0.07$0.30$0.19
58
309.3262K

Speed & cost

OpenAI$0.07$0.30$0.19
58
309.3131K

Speed & cost

Meta$0.08$0.30$0.19
74
389.5328K

Speed & cost

xiaomi$0.09$0.29$0.19
58
305.3262K

Open-source

Alibaba Cloud$0.09$0.30$0.20
82
420.5262K

Open-source

Meta$0.05$0.34$0.20
58
296.780K

Open-source

Mistral AI$0.10$0.30$0.20
72
360.033K

Speed & cost

stepfun$0.10$0.30$0.20
58
290.0262K

Speed & cost

Mistral AI$0.20$0.20$0.20
58
290.0262K

Open-source

Mistral AI$0.10$0.30$0.20
58
290.032K

Open-source

Mistral AI$0.10$0.30$0.20
58
290.0131K

Cheap-and-fast cascade tier

DeepSeek$0.14$0.28$0.21
85
404.81M

Open-source

Meta$0.10$0.32$0.21
74
352.4131K

Open-source

Alibaba Cloud$0.05$0.40$0.23
82
364.441K

Speed & cost

OpenAI$0.05$0.40$0.23
72
320.0400K

Speed & cost

z-ai$0.06$0.40$0.23
58
252.2203K

Hard reasoning

Alibaba Cloud$0.08$0.40$0.24
82
341.7131K

Speed & cost

Google$0.10$0.40$0.25
80
320.01M

Speed & cost

Google$0.10$0.40$0.25
80
320.01M

Open-source

nvidia$0.10$0.40$0.25
74
296.0131K

Speed & cost

Google$0.10$0.40$0.25
73
292.01M

Speed & cost

OpenAI$0.10$0.40$0.25
72
288.01M

Speed & cost

bytedance$0.10$0.40$0.25
58
232.0262K

Open-source

Alibaba Cloud$0.12$0.39$0.26
82
321.633K

Open-source

Alibaba Cloud$0.10$0.42$0.26
82
315.4131K

Open-source

Google$0.13$0.40$0.27
76
286.8262K

Search + citations

nousresearch$0.13$0.40$0.27
58
218.9131K

Open-source

Google$0.14$0.40$0.27
76
281.5262K

Search + citations

Alibaba Cloud$0.09$0.45$0.27
58
214.8131K

Open-source

Alibaba Cloud$0.14$0.41$0.27
58
212.5131K

Hard reasoning

DeepSeek$0.29$0.29$0.29
91
313.833K

Open-source

Alibaba Cloud$0.08$0.50$0.29
82
282.8131K

Search + citations

nousresearch$0.30$0.30$0.30
74
246.7131K

Speed & cost

nvidia$0.10$0.50$0.30
58
193.3262K

Speed & cost

thedrummer$0.17$0.43$0.30
58
193.333K

Open-source

nex-agi$0.14$0.50$0.32
86
270.9131K

Open-source

DeepSeek$0.26$0.38$0.32
87
271.9164K

Open-source

Alibaba Cloud$0.13$0.52$0.33
82
252.3131K

Hard reasoning

allenai$0.15$0.50$0.33
58
178.566K

Open-source

DeepSeek$0.27$0.41$0.34
86
252.9164K

Speed & cost

xAI$0.20$0.50$0.35
58
165.72M

Speed & cost

xAI$0.20$0.50$0.35
58
165.72M

Speed & cost

baidu$0.14$0.56$0.35
58
165.730K

Speed & cost

tencent$0.14$0.57$0.35
58
163.4131K

Hard reasoning

Alibaba Cloud$0.15$0.58$0.36
58
158.9131K

Open-source

Meta$0.15$0.60$0.38
82
218.71M

Search + citations

OpenAI$0.15$0.60$0.38
80
213.3128K

Speed & cost

OpenAI$0.15$0.60$0.38
80
213.3128K

Speed & cost

OpenAI$0.15$0.60$0.38
80
213.3128K

Open-source

Mistral AI$0.15$0.60$0.38
72
192.0262K

Speed & cost

upstage$0.15$0.60$0.38
58
154.7128K

Open-source

Cohere$0.15$0.60$0.38
58
154.7128K

Speed & cost

xAI$0.30$0.50$0.40
82
205.0131K

Speed & cost

xAI$0.30$0.50$0.40
82
205.0131K

Open-source

Meta$0.40$0.40$0.40
74
185.0131K

Speed & cost

thedrummer$0.40$0.40$0.40
66
165.033K

Speed & cost

nvidia$0.20$0.60$0.40
62
155.0131K

Open-source

allenai$0.20$0.60$0.40
58
145.066K

Speed & cost

thedrummer$0.30$0.50$0.40
58
145.0131K

Open-source

Alibaba Cloud$0.20$0.60$0.40
58
145.0128K

Open-source

Mistral AI$0.20$0.60$0.40
58
145.033K

Code generation

Alibaba Cloud$0.12$0.75$0.43
82
188.5262K

Hard reasoning

Alibaba Cloud$0.10$0.78$0.44
82
186.9131K

Open-source

DeepSeek$0.15$0.75$0.45
86
191.133K

Open-source

DeepSeek$0.20$0.77$0.48
86
177.3164K

Open-source

z-ai$0.13$0.85$0.49
58
118.4131K

Open-source

DeepSeek$0.21$0.79$0.50
86
172.0164K

Speed & cost

inception$0.25$0.75$0.50
58
116.0128K

Speed & cost

meituan$0.20$0.80$0.50
58
116.0131K

Speed & cost

inception$0.25$0.75$0.50
58
116.0128K

Code generation

inception$0.25$0.75$0.50
58
116.0128K

Hard reasoning

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Open-source

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Open-source

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Hard reasoning

arcee$0.22$0.85$0.54
58
108.4262K

Open-source

Alibaba Cloud$0.20$0.88$0.54
82
151.9262K

Open-source

Mistral AI$0.54$0.54$0.54
72
133.333K

Speed & cost

undi95$0.45$0.65$0.55
66
120.06K

Speed & cost

minimax$0.12$0.99$0.55
58
104.7197K

Code generation

Alibaba Cloud$0.20$0.97$0.58
82
140.21M

Open-source

Alibaba Cloud$0.09$1.10$0.60
82
137.8262K

Code generation

Mistral AI$0.30$0.90$0.60
78
130.0256K

Open-source

z-ai$0.30$0.90$0.60
58
96.7131K

Open-source

DeepSeek$0.32$0.89$0.60
86
142.1164K

Code generation

Alibaba Cloud$0.22$1.00$0.61
82
134.4262K

Speed & cost

minimax$0.27$0.95$0.61
58
95.1197K

Speed & cost

microsoft$0.62$0.62$0.62
62
100.066K

Open-source

Meta$0.51$0.74$0.63
66
105.68K

Speed & cost

minimax$0.26$1.00$0.63
58
92.4197K

Code generation

arcee$0.50$0.80$0.65
66
101.533K

Open-source

Google$0.65$0.65$0.65
65
100.08K

Speed & cost

prime-intellect$0.20$1.10$0.65
58
89.2131K

Speed & cost

minimax$0.20$1.10$0.65
58
89.21M

Speed & cost

thedrummer$0.55$0.80$0.68
66
97.833K

Speed & cost

baidu$0.28$1.10$0.69
58
84.1123K

Hard reasoning

sao10k$0.65$0.75$0.70
66
94.3131K

Hard reasoning

tngtech$0.30$1.10$0.70
91
130.0164K

Speed & cost

OpenAI$0.20$1.25$0.72
72
99.3400K

Open-source

Alibaba Cloud$0.16$1.30$0.73
82
112.1262K

Hard reasoning

Alibaba Cloud$0.12$1.36$0.74
82
110.7131K

Hard reasoning

DeepSeek$0.70$0.80$0.75
91
121.3131K

Speed & cost

Anthropic$0.25$1.25$0.75
72
96.0200K

Code generation

kwaipilot$0.30$1.20$0.75
58
77.3256K

Speed & cost

minimax$0.30$1.20$0.75
58
77.3205K

Speed & cost

minimax$0.30$1.20$0.75
58
77.366K

Open-source

DeepSeek$0.40$1.20$0.80
86
107.5164K

Open-source

Alibaba Cloud$0.80$0.80$0.80
66
82.533K

Hard reasoning

Alibaba Cloud$0.15$1.50$0.82
82
99.7131K

Code generation

Alibaba Cloud$0.66$1.00$0.83
66
79.533K

Speed & cost

baidu$0.42$1.25$0.83
66
79.0123K

Hard reasoning

Alibaba Cloud$0.13$1.56$0.84
82
97.0131K

Hard reasoning

sao10k$0.85$0.85$0.85
66
77.6131K

Speed & cost

xAI$0.20$1.50$0.85
58
68.2256K

Speed & cost

mancer$0.75$1.00$0.88
66
75.48K

Speed & cost

Google$0.25$1.50$0.88
62
70.91M

Open-source

Alibaba Cloud$0.20$1.56$0.88
82
93.4262K

Open-source

Alibaba Cloud$0.26$1.56$0.91
82
90.11M

Speed & cost

arcee$0.75$1.20$0.97
66
67.7131K

Open-source

Mistral AI$0.50$1.50$1.00
85
85.0262K

Speed & cost

OpenAI$0.40$1.60$1.00
80
80.01M

Speed & cost

morph$0.80$1.20$1.00
66
66.082K

Speed & cost

eleutherai$0.80$1.20$1.00
66
66.04K

Open-source

alfredpros$0.80$1.20$1.00
66
66.04K

Search + citations

Perplexity$1.00$1.00$1.00
66
66.0127K

Search + citations

nousresearch$1.00$1.00$1.00
66
66.0131K

Speed & cost

OpenAI$0.50$1.50$1.00
66
66.016K

Speed & cost

aion$0.70$1.40$1.05
66
62.9131K

Speed & cost

relace$0.85$1.25$1.05
66
62.9256K

Speed & cost

Moonshot AI$0.38$1.72$1.05
89
84.7262K

Open-source

z-ai$0.39$1.75$1.07
66
61.7203K

Speed & cost

OpenAI$0.25$2.00$1.13
83
73.8400K

Speed & cost

bytedance$0.25$2.00$1.13
58
51.6262K

Speed & cost

bytedance$0.25$2.00$1.13
58
51.6262K

Code generation

OpenAI$0.25$2.00$1.13
58
51.6400K

Open-source

Alibaba Cloud$0.46$1.82$1.14
82
72.1131K

Open-source

z-ai$0.39$1.90$1.15
66
57.6205K

Open-source

Alibaba Cloud$0.26$2.08$1.17
82
70.1262K

Open-source

nvidia$1.20$1.20$1.20
74
61.7131K

Speed & cost

xiaomi$0.40$2.00$1.20
66
55.0262K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0262K

Speed & cost

Moonshot AI$0.40$2.00$1.20
66
55.0131K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

z-ai$0.60$1.80$1.20
66
55.066K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

nvidia$0.60$1.80$1.20
66
55.0131K

Speed & cost

aion$0.80$1.60$1.20
66
55.0131K

Open-source

aion$0.80$1.60$1.20
65
54.233K

Hard reasoning

Moonshot AI$0.47$2.00$1.23
66
53.4131K

General purpose

deepcogito$1.25$1.25$1.25
74
59.2128K

Hard reasoning

DeepSeek$0.45$2.15$1.30
91
70.0164K

Speed & cost

minimax$0.40$2.20$1.30
66
50.81M

Open-source

Alibaba Cloud$0.52$2.08$1.30
66
50.8131K

Open-source

Alibaba Cloud$0.39$2.34$1.36
82
60.1262K

Image generation

Google$0.30$2.50$1.40
80
57.133K

Speed & cost

Google$0.30$2.50$1.40
80
57.11M

Speed & cost

morph$0.90$1.90$1.40
66
47.1262K

Speed & cost

Amazon$0.30$2.50$1.40
58
41.41M

Open-source

z-ai$0.60$2.20$1.40
66
47.1131K

Hard reasoning

Alibaba Cloud$0.26$2.60$1.43
82
57.3131K

Speed & cost

Moonshot AI$0.57$2.30$1.43
66
46.0131K

Hard reasoning

sao10k$1.48$1.48$1.48
74
50.08K

Speed & cost

OpenAI$0.60$2.40$1.50
66
44.0128K

Speed & cost

OpenAI$1.00$2.00$1.50
66
44.04K

Open-source

z-ai$0.72$2.30$1.51
88
58.380K

Hard reasoning

DeepSeek$0.70$2.50$1.60
91
56.964K

Speed & cost

Google$0.50$3.00$1.75
80
45.71M

General purpose

OpenAI$1.50$2.00$1.75
74
42.34K

Image generation

Google$0.50$3.00$1.75
66
37.766K

Agentic tasks & real-time info

xAI$1.25$2.50$1.88
94
50.11M

Code generation

Alibaba Cloud$0.65$3.25$1.95
82
42.11M

Speed & cost

xiaomi$1.00$3.00$2.00
66
33.01M

Search + citations

relace$1.00$3.00$2.00
66
33.0256K

Search + citations

nousresearch$1.00$3.00$2.00
66
33.0131K

Speed & cost

Amazon$0.80$3.20$2.00
66
33.0300K

Speed & cost

arcee$0.90$3.30$2.10
66
31.4131K

Speed & cost

switchpoint$0.85$3.40$2.13
66
31.1131K

Image generation

OpenAI$2.50$2.00$2.25
74
32.9400K

Hard reasoning

Alibaba Cloud$0.78$3.90$2.34
82
35.0262K

Open-source

Alibaba Cloud$0.78$3.90$2.34
82
35.0262K

Speed & cost

Anthropic$0.80$4.00$2.40
76
31.7200K

Frontier quality at low cost

Moonshot AI$0.95$4.00$2.48
93
37.6256K

Open-source

z-ai$1.20$4.00$2.60
74
28.5203K

Open-source

z-ai$1.20$4.00$2.60
74
28.5203K

Open-source

Alibaba Cloud$1.04$4.16$2.60
74
28.533K

Open-source value leader

DeepSeek$1.74$3.48$2.61
92
35.21M

Speed & cost

OpenAI$0.75$4.50$2.63
83
31.6400K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Speed & cost

Anthropic$1.00$5.00$3.00
76
25.3200K

Hard reasoning

sao10k$3.00$3.00$3.00
74
24.716K

Open-weight agentic & tool use

z-ai$1.55$4.65$3.10
90
29.0200K

Speed & cost

writer$0.60$6.00$3.30
66
20.01M

General purpose

OpenAI$3.00$4.00$3.50
74
21.116K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3131K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3131K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3128K

General purpose

xAI$2.00$6.00$4.00
74
18.52M

General purpose

xAI$2.00$6.00$4.00
93
23.32M

Open-source

Mistral AI$2.00$6.00$4.00
74
18.5131K

General purpose

anthracite-org$3.00$5.00$4.00
74
18.516K

Open-source

Mistral AI$2.00$6.00$4.00
72
18.066K

Deep research

OpenAI$2.00$8.00$5.00
96
19.2200K

Hard reasoning

OpenAI$2.00$8.00$5.00
92
18.4200K

General purpose

OpenAI$2.00$8.00$5.00
89
17.81M

General purpose

ai21$2.00$8.00$5.00
74
14.8256K

Search + citations

Perplexity$2.00$8.00$5.00
74
14.8128K

Deep research

Perplexity$2.00$8.00$5.00
74
14.8128K

Code generation

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
93
16.5128K

Code generation

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
90
16.0400K

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

General purpose

alpindale$3.75$7.50$5.63
82
14.66K

Code generation

OpenAI$1.25$10.00$5.63
74
13.2400K

General purpose

OpenAI$1.25$10.00$5.63
74
13.2128K

General purpose

aion$4.00$8.00$6.00
82
13.7131K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

Search + citations

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

Open-source

Cohere$2.50$10.00$6.25
84
13.4128K

General purpose

OpenAI$2.50$10.00$6.25
74
11.8128K

General purpose

Cohere$2.50$10.00$6.25
74
11.8256K

General purpose

inflection$2.50$10.00$6.25
74
11.88K

General purpose

inflection$2.50$10.00$6.25
74
11.88K

Speed & cost

Google$2.00$12.00$7.00
96
13.71M

Speed & cost

Google$2.00$12.00$7.00
96
13.71M

Image generation

Google$2.00$12.00$7.00
94
13.466K

General purpose

Amazon$2.50$12.50$7.50
74
9.91M

General purpose

OpenAI$1.75$14.00$7.88
93
11.8128K

Code generation

OpenAI$1.75$14.00$7.88
93
11.8400K

Code generation

OpenAI$1.75$14.00$7.88
93
11.8400K

General purpose

OpenAI$1.75$14.00$7.88
93
11.8128K

General purpose

OpenAI$1.75$14.00$7.88
93
11.8400K

General purpose

OpenAI$2.50$15.00$8.75
93
10.61M

General purpose

xAI$3.00$15.00$9.00
90
10.0131K

General purpose

xAI$3.00$15.00$9.00
90
10.0131K

General purpose

Anthropic$3.00$15.00$9.00
91
10.11M

General purpose

Anthropic$3.00$15.00$9.00
88
9.81M

General purpose

Anthropic$3.00$15.00$9.00
86
9.6200K

General purpose

Anthropic$3.00$15.00$9.00
86
9.6200K

Hard reasoning

Anthropic$3.00$15.00$9.00
86
9.6200K

Search + citations

Perplexity$3.00$15.00$9.00
74
8.2200K

General purpose

xAI$3.00$15.00$9.00
74
8.2256K

Search + citations

Perplexity$3.00$15.00$9.00
74
8.2200K

Multimodal

OpenAI$10.00$10.00$10.00
88
8.8400K

General purpose

OpenAI$5.00$15.00$10.00
88
8.8128K

Multimodal

OpenAI$6.00$18.00$12.00
88
7.3128K

Coding & agentic workflows

Anthropic$5.00$25.00$15.00
97
6.51M

General purpose

Anthropic$5.00$25.00$15.00
95
6.31M

General purpose

Anthropic$5.00$25.00$15.00
95
6.3200K

Frontier general purpose

OpenAI$5.00$30.00$17.50
98
5.61M

Multimodal

OpenAI$10.00$30.00$20.00
88
4.4128K

Complex analysis

OpenAI$10.00$30.00$20.00
88
4.4128K

Multimodal

OpenAI$10.00$30.00$20.00
88
4.4128K

Deep research

OpenAI$10.00$40.00$25.00
96
3.8200K

Hard reasoning

OpenAI$15.00$60.00$37.50
88
2.3200K

Multimodal

Anthropic$15.00$75.00$45.00
94
2.1200K

Multimodal

Anthropic$15.00$75.00$45.00
94
2.1200K

Complex analysis

OpenAI$30.00$60.00$45.00
93
2.18K

Multimodal

OpenAI$30.00$60.00$45.00
93
2.18K

Hard reasoning

OpenAI$20.00$80.00$50.00
96
1.9200K

Complex analysis

OpenAI$15.00$120.00$67.50
88
1.3400K

Complex analysis

OpenAI$21.00$168.00$94.50
97
1.0400K

Reasoning at any cost

OpenAI$30.00$180.00$105.00
99
0.91M

Complex analysis

OpenAI$30.00$180.00$105.00
97
0.91M

Hard reasoning

OpenAI$150.00$600.00$375.00
93
0.2200K
Blended = avg of input + output per 1M tokensQuality = composite benchmark score (0-100)Value = quality per dollar (higher is better)

Estimate Your Monthly Cost

Monthly cost estimate

Enter your typical request shape. Costs below are projected over one month, based on current public list-price API rates.

Per month: 100K requests · 50.0M input tokens · 30.0M output tokens. Excludes prompt caching, batch discounts, retries, and fees.

Cheapest

Mistral: Mistral Nemo

$2.20

per month at this volume

Best value (quality ≥ 80)

Qwen: Qwen3 235B A22B Instruct 2507 · Q 82

$6.55

per month at this volume

Most expensive

OpenAI: o1-pro

$25,500

per month at this volume

Save 30-60% with Mixture-of-Routers

Most production traffic is mixed-difficulty. Send the easy 60% to a cheap model and the hard 10% to a frontier model — same quality, fraction of the cost.

See the math

Full breakdown by model

Sorted cheapest to most expensive

ModelCost / requestInput cost / moOutput cost / moTotal / mo

Mistral: Mistral Nemo

$0.02 in / $0.04 out per 1M

$0.000022$1.00$1.20$2.20

Google: Gemma 3n 4B

$0.02 in / $0.04 out per 1M

$0.000022$1.00$1.20$2.20

Meta: Llama 3.1 8B Instruct

$0.02 in / $0.05 out per 1M

$0.000025$1.00$1.50$2.50

Meta: Llama 3 8B Instruct

$0.03 in / $0.04 out per 1M

$0.000027$1.50$1.20$2.70

Llama Guard 3 8B

$0.02 in / $0.06 out per 1M

$0.000028$1.00$1.80$2.80

Sao10K: Llama 3 8B Lunaris

$0.04 in / $0.05 out per 1M

$0.000035$2.00$1.50$3.50

Meta: Llama 3.2 11B Vision Instruct

$0.049 in / $0.049 out per 1M

$0.000039$2.45$1.47$3.92

IBM: Granite 4.0 Micro

$0.017 in / $0.11 out per 1M

$0.000042$0.8500$3.30$4.15

Google: Gemma 2 9B

$0.03 in / $0.09 out per 1M

$0.000042$1.50$2.70$4.20

Qwen: Qwen2.5 Coder 7B Instruct

$0.03 in / $0.09 out per 1M

$0.000042$1.50$2.70$4.20

Google: Gemma 3 4B

$0.04 in / $0.08 out per 1M

$0.000044$2.00$2.40$4.40

Mistral: Mistral Small 3.1 24B

$0.03 in / $0.11 out per 1M

$0.000048$1.50$3.30$4.80

MythoMax 13B

$0.06 in / $0.06 out per 1M

$0.000048$3.00$1.80$4.80

OpenAI: gpt-oss-20b

$0.03 in / $0.11 out per 1M

$0.000048$1.50$3.30$4.80

Mistral: Mistral Small 3

$0.05 in / $0.08 out per 1M

$0.000049$2.50$2.40$4.90

Qwen: Qwen2.5 7B Instruct

$0.04 in / $0.1 out per 1M

$0.000050$2.00$3.00$5.00

LiquidAI: LFM2-24B-A2B

$0.03 in / $0.12 out per 1M

$0.000051$1.50$3.60$5.10

Qwen: Qwen-Turbo

$0.0325 in / $0.13 out per 1M

$0.000055$1.63$3.90$5.53

Google: Gemma 3 12B

$0.04 in / $0.13 out per 1M

$0.000059$2.00$3.90$5.90

Amazon: Nova Micro 1.0

$0.035 in / $0.14 out per 1M

$0.000060$1.75$4.20$5.95

Cohere: Command R7B (12-2024)

$0.0375 in / $0.15 out per 1M

$0.000064$1.88$4.50$6.38

Qwen: Qwen3 235B A22B Instruct 2507

$0.071 in / $0.1 out per 1M

$0.000065$3.55$3.00$6.55

Arcee AI: Trinity Mini

$0.045 in / $0.15 out per 1M

$0.000068$2.25$4.50$6.75

NVIDIA: Nemotron Nano 9B V2

$0.04 in / $0.16 out per 1M

$0.000068$2.00$4.80$6.80

Qwen: Qwen3.5-9B

$0.05 in / $0.15 out per 1M

$0.000070$2.50$4.50$7.00

Meta: Llama 3.2 1B Instruct

$0.027 in / $0.2 out per 1M

$0.000073$1.35$6.00$7.35

Microsoft: Phi 4

$0.065 in / $0.14 out per 1M

$0.000075$3.25$4.20$7.45

OpenAI: gpt-oss-120b

$0.039 in / $0.19 out per 1M

$0.000077$1.95$5.70$7.65

Reka Edge

$0.1 in / $0.1 out per 1M

$0.000080$5.00$3.00$8.00

Mistral: Ministral 3 3B 2512

$0.1 in / $0.1 out per 1M

$0.000080$5.00$3.00$8.00

Z.ai: GLM 4 32B

$0.1 in / $0.1 out per 1M

$0.000080$5.00$3.00$8.00

NVIDIA: Nemotron 3 Nano 30B A3B

$0.05 in / $0.2 out per 1M

$0.000085$2.50$6.00$8.50

AllenAI: Olmo 2 32B Instruct

$0.05 in / $0.2 out per 1M

$0.000085$2.50$6.00$8.50

Google: Gemma 3 27B

$0.08 in / $0.16 out per 1M

$0.000088$4.00$4.80$8.80

Mistral: Mistral Small 3.2 24B

$0.075 in / $0.2 out per 1M

$0.000097$3.75$6.00$9.75

Qwen: Qwen3 14B

$0.06 in / $0.24 out per 1M

$0.000102$3.00$7.20$10.20

Amazon: Nova Lite 1.0

$0.06 in / $0.24 out per 1M

$0.000102$3.00$7.20$10.20

ByteDance: UI-TARS 7B

$0.1 in / $0.2 out per 1M

$0.000110$5.00$6.00$11.00

Reka Flash 3

$0.1 in / $0.2 out per 1M

$0.000110$5.00$6.00$11.00

Qwen: Qwen3.5-Flash

$0.065 in / $0.26 out per 1M

$0.000111$3.25$7.80$11.05

Qwen: Qwen3 32B

$0.08 in / $0.24 out per 1M

$0.000112$4.00$7.20$11.20

Mistral: Mistral 7B Instruct v0.1

$0.11 in / $0.19 out per 1M

$0.000112$5.50$5.70$11.20

NousResearch: Hermes 2 Pro - Llama-3 8B

$0.14 in / $0.14 out per 1M

$0.000112$7.00$4.20$11.20

Qwen: Qwen3 Coder 30B A3B Instruct

$0.07 in / $0.27 out per 1M

$0.000116$3.50$8.10$11.60

Baidu: ERNIE 4.5 21B A3B Thinking

$0.07 in / $0.28 out per 1M

$0.000119$3.50$8.40$11.90

Baidu: ERNIE 4.5 21B A3B

$0.07 in / $0.28 out per 1M

$0.000119$3.50$8.40$11.90

EssentialAI: Rnj 1 Instruct

$0.15 in / $0.15 out per 1M

$0.000120$7.50$4.50$12.00

Mistral: Ministral 3 8B 2512

$0.15 in / $0.15 out per 1M

$0.000120$7.50$4.50$12.00

Qwen: Qwen3 30B A3B

$0.08 in / $0.28 out per 1M

$0.000124$4.00$8.40$12.40

Google: Gemini 2.0 Flash Lite

$0.075 in / $0.3 out per 1M

$0.000128$3.75$9.00$12.75

ByteDance Seed: Seed 1.6 Flash

$0.075 in / $0.3 out per 1M

$0.000128$3.75$9.00$12.75

OpenAI: gpt-oss-safeguard-20b

$0.075 in / $0.3 out per 1M

$0.000128$3.75$9.00$12.75

Meta: Llama 3.2 3B Instruct

$0.051 in / $0.34 out per 1M

$0.000128$2.55$10.20$12.75

Meta: Llama 4 Scout

$0.08 in / $0.3 out per 1M

$0.000130$4.00$9.00$13.00

Xiaomi: MiMo-V2-Flash

$0.09 in / $0.29 out per 1M

$0.000132$4.50$8.70$13.20

Qwen: Qwen3 30B A3B Instruct 2507

$0.09 in / $0.3 out per 1M

$0.000135$4.50$9.00$13.50

Mistral: Mistral Small Creative

$0.1 in / $0.3 out per 1M

$0.000140$5.00$9.00$14.00

StepFun: Step 3.5 Flash

$0.1 in / $0.3 out per 1M

$0.000140$5.00$9.00$14.00

Mistral: Voxtral Small 24B 2507

$0.1 in / $0.3 out per 1M

$0.000140$5.00$9.00$14.00

Mistral: Devstral Small 1.1

$0.1 in / $0.3 out per 1M

$0.000140$5.00$9.00$14.00

Arcee AI: Spotlight

$0.18 in / $0.18 out per 1M

$0.000144$9.00$5.40$14.40

Meta: Llama Guard 4 12B

$0.18 in / $0.18 out per 1M

$0.000144$9.00$5.40$14.40

Qwen: Qwen3 8B

$0.05 in / $0.4 out per 1M

$0.000145$2.50$12.00$14.50

OpenAI: GPT-5 Nano

$0.05 in / $0.4 out per 1M

$0.000145$2.50$12.00$14.50

Meta: Llama 3.3 70B Instruct

$0.1 in / $0.32 out per 1M

$0.000146$5.00$9.60$14.60

Z.ai: GLM 4.7 Flash

$0.06 in / $0.4 out per 1M

$0.000150$3.00$12.00$15.00

DeepSeek: DeepSeek V4 Flash

$0.14 in / $0.28 out per 1M

$0.000154$7.00$8.40$15.40

Qwen: Qwen3 30B A3B Thinking 2507

$0.08 in / $0.4 out per 1M

$0.000160$4.00$12.00$16.00

Mistral: Ministral 3 14B 2512

$0.2 in / $0.2 out per 1M

$0.000160$10.00$6.00$16.00

Google: Gemini 2.5 Flash Lite Preview 09-2025

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

Google: Gemini 2.5 Flash Lite

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

Google: Gemini 2.0 Flash

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

OpenAI: GPT-4.1 Nano

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

ByteDance Seed: Seed-2.0-Mini

$0.1 in / $0.4 out per 1M

$0.000170$5.00$12.00$17.00

Qwen: Qwen3 VL 32B Instruct

$0.104 in / $0.416 out per 1M

$0.000177$5.20$12.48$17.68

Qwen2.5 72B Instruct

$0.12 in / $0.39 out per 1M

$0.000177$6.00$11.70$17.70

Tongyi DeepResearch 30B A3B

$0.09 in / $0.45 out per 1M

$0.000180$4.50$13.50$18.00

Google: Gemma 4 26B A4B

$0.13 in / $0.4 out per 1M

$0.000185$6.50$12.00$18.50

Nous: Hermes 4 70B

$0.13 in / $0.4 out per 1M

$0.000185$6.50$12.00$18.50

Qwen: Qwen3 VL 8B Instruct

$0.08 in / $0.5 out per 1M

$0.000190$4.00$15.00$19.00

Google: Gemma 4 31B

$0.14 in / $0.4 out per 1M

$0.000190$7.00$12.00$19.00

Qwen: Qwen VL Plus

$0.1365 in / $0.4095 out per 1M

$0.000191$6.83$12.29$19.11

NVIDIA: Nemotron 3 Super

$0.1 in / $0.5 out per 1M

$0.000200$5.00$15.00$20.00

TheDrummer: Rocinante 12B

$0.17 in / $0.43 out per 1M

$0.000214$8.50$12.90$21.40

Nex AGI: DeepSeek V3.1 Nex N1

$0.135 in / $0.5 out per 1M

$0.000218$6.75$15.00$21.75

Qwen: Qwen3 VL 30B A3B Instruct

$0.13 in / $0.52 out per 1M

$0.000221$6.50$15.60$22.10

AllenAI: Olmo 3 32B Think

$0.15 in / $0.5 out per 1M

$0.000225$7.50$15.00$22.50

DeepSeek: R1 Distill Qwen 32B

$0.29 in / $0.29 out per 1M

$0.000232$14.50$8.70$23.20

Baidu: ERNIE 4.5 VL 28B A3B

$0.14 in / $0.56 out per 1M

$0.000238$7.00$16.80$23.80

Nous: Hermes 3 70B Instruct

$0.3 in / $0.3 out per 1M

$0.000240$15.00$9.00$24.00

Tencent: Hunyuan A13B Instruct

$0.14 in / $0.57 out per 1M

$0.000241$7.00$17.10$24.10

DeepSeek: DeepSeek V3.2

$0.26 in / $0.38 out per 1M

$0.000244$13.00$11.40$24.40

Qwen: QwQ 32B

$0.15 in / $0.58 out per 1M

$0.000249$7.50$17.40$24.90

xAI: Grok 4.1 Fast

$0.2 in / $0.5 out per 1M

$0.000250$10.00$15.00$25.00

xAI: Grok 4 Fast

$0.2 in / $0.5 out per 1M

$0.000250$10.00$15.00$25.00

Meta: Llama 4 Maverick

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

OpenAI: GPT-4o-mini Search Preview

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

OpenAI: GPT-4o-mini (2024-07-18)

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

OpenAI: GPT-4o-mini

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

Mistral: Mistral Small 4

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

Upstage: Solar Pro 3

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

Cohere: Command R (08-2024)

$0.15 in / $0.6 out per 1M

$0.000255$7.50$18.00$25.50

DeepSeek: DeepSeek V3.2 Exp

$0.27 in / $0.41 out per 1M

$0.000258$13.50$12.30$25.80

NVIDIA: Nemotron Nano 12B 2 VL

$0.2 in / $0.6 out per 1M

$0.000280$10.00$18.00$28.00

AllenAI: Olmo 3.1 32B Instruct

$0.2 in / $0.6 out per 1M

$0.000280$10.00$18.00$28.00

Qwen: Qwen2.5 VL 32B Instruct

$0.2 in / $0.6 out per 1M

$0.000280$10.00$18.00$28.00

Mistral: Saba

$0.2 in / $0.6 out per 1M

$0.000280$10.00$18.00$28.00

Qwen: Qwen3 Next 80B A3B Thinking

$0.0975 in / $0.78 out per 1M

$0.000283$4.88$23.40$28.28

Qwen: Qwen3 Coder Next

$0.12 in / $0.75 out per 1M

$0.000285$6.00$22.50$28.50

DeepSeek: DeepSeek V3.1

$0.15 in / $0.75 out per 1M

$0.000300$7.50$22.50$30.00

xAI: Grok 3 Mini

$0.3 in / $0.5 out per 1M

$0.000300$15.00$15.00$30.00

xAI: Grok 3 Mini Beta

$0.3 in / $0.5 out per 1M

$0.000300$15.00$15.00$30.00

TheDrummer: Cydonia 24B V4.1

$0.3 in / $0.5 out per 1M

$0.000300$15.00$15.00$30.00

Meta: Llama 3.1 70B Instruct

$0.4 in / $0.4 out per 1M

$0.000320$20.00$12.00$32.00

TheDrummer: UnslopNemo 12B

$0.4 in / $0.4 out per 1M

$0.000320$20.00$12.00$32.00

Z.ai: GLM 4.5 Air

$0.13 in / $0.85 out per 1M

$0.000320$6.50$25.50$32.00

DeepSeek: DeepSeek V3 0324

$0.2 in / $0.77 out per 1M

$0.000331$10.00$23.10$33.10

Meituan: LongCat Flash Chat

$0.2 in / $0.8 out per 1M

$0.000340$10.00$24.00$34.00

DeepSeek: DeepSeek V3.1 Terminus

$0.21 in / $0.79 out per 1M

$0.000342$10.50$23.70$34.20

Inception: Mercury 2

$0.25 in / $0.75 out per 1M

$0.000350$12.50$22.50$35.00

Inception: Mercury

$0.25 in / $0.75 out per 1M

$0.000350$12.50$22.50$35.00

Inception: Mercury Coder

$0.25 in / $0.75 out per 1M

$0.000350$12.50$22.50$35.00

MiniMax: MiniMax M2.5

$0.118 in / $0.99 out per 1M

$0.000356$5.90$29.70$35.60

Qwen: Qwen3 VL 235B A22B Instruct

$0.2 in / $0.88 out per 1M

$0.000364$10.00$26.40$36.40

Qwen: Qwen Plus 0728 (thinking)

$0.26 in / $0.78 out per 1M

$0.000364$13.00$23.40$36.40

Qwen: Qwen Plus 0728

$0.26 in / $0.78 out per 1M

$0.000364$13.00$23.40$36.40

Qwen: Qwen-Plus

$0.26 in / $0.78 out per 1M

$0.000364$13.00$23.40$36.40

Arcee AI: Trinity Large Thinking

$0.22 in / $0.85 out per 1M

$0.000365$11.00$25.50$36.50

Qwen: Qwen3 Next 80B A3B Instruct

$0.09 in / $1.1 out per 1M

$0.000375$4.50$33.00$37.50

Qwen: Qwen3 Coder Flash

$0.195 in / $0.975 out per 1M

$0.000390$9.75$29.25$39.00

Qwen: Qwen3 Coder 480B A35B

$0.22 in / $1 out per 1M

$0.000410$11.00$30.00$41.00

Mistral: Codestral 2508

$0.3 in / $0.9 out per 1M

$0.000420$15.00$27.00$42.00

ReMM SLERP 13B

$0.45 in / $0.65 out per 1M

$0.000420$22.50$19.50$42.00

MiniMax: MiniMax M2.1

$0.27 in / $0.95 out per 1M

$0.000420$13.50$28.50$42.00

Z.ai: GLM 4.6V

$0.3 in / $0.9 out per 1M

$0.000420$15.00$27.00$42.00

DeepSeek: DeepSeek V3

$0.32 in / $0.89 out per 1M

$0.000427$16.00$26.70$42.70

MiniMax: MiniMax M2

$0.255 in / $1 out per 1M

$0.000427$12.75$30.00$42.75

Prime Intellect: INTELLECT-3

$0.2 in / $1.1 out per 1M

$0.000430$10.00$33.00$43.00

MiniMax: MiniMax-01

$0.2 in / $1.1 out per 1M

$0.000430$10.00$33.00$43.00

Mistral: Mixtral 8x7B Instruct

$0.54 in / $0.54 out per 1M

$0.000432$27.00$16.20$43.20

Qwen: Qwen3 VL 8B Thinking

$0.117 in / $1.365 out per 1M

$0.000468$5.85$40.95$46.80

Baidu: ERNIE 4.5 300B A47B

$0.28 in / $1.1 out per 1M

$0.000470$14.00$33.00$47.00

Qwen: Qwen3.5-35B-A3B

$0.1625 in / $1.3 out per 1M

$0.000471$8.13$39.00$47.13

OpenAI: GPT-5.4 Nano

$0.2 in / $1.25 out per 1M

$0.000475$10.00$37.50$47.50

Meta: Llama 3 70B Instruct

$0.51 in / $0.74 out per 1M

$0.000477$25.50$22.20$47.70

TNG: DeepSeek R1T2 Chimera

$0.3 in / $1.1 out per 1M

$0.000480$15.00$33.00$48.00

Arcee AI: Coder Large

$0.5 in / $0.8 out per 1M

$0.000490$25.00$24.00$49.00

WizardLM-2 8x22B

$0.62 in / $0.62 out per 1M

$0.000496$31.00$18.60$49.60

Anthropic: Claude 3 Haiku

$0.25 in / $1.25 out per 1M

$0.000500$12.50$37.50$50.00

Kwaipilot: KAT-Coder-Pro V2

$0.3 in / $1.2 out per 1M

$0.000510$15.00$36.00$51.00

MiniMax: MiniMax M2.7

$0.3 in / $1.2 out per 1M

$0.000510$15.00$36.00$51.00

MiniMax: MiniMax M2-her

$0.3 in / $1.2 out per 1M

$0.000510$15.00$36.00$51.00

TheDrummer: Skyfall 36B V2

$0.55 in / $0.8 out per 1M

$0.000515$27.50$24.00$51.50

Google: Gemma 2 27B

$0.65 in / $0.65 out per 1M

$0.000520$32.50$19.50$52.00

Qwen: Qwen3 235B A22B Thinking 2507

$0.1495 in / $1.495 out per 1M

$0.000523$7.47$44.85$52.33

Qwen: Qwen3 VL 30B A3B Thinking

$0.13 in / $1.56 out per 1M

$0.000533$6.50$46.80$53.30

Sao10K: Llama 3.3 Euryale 70B

$0.65 in / $0.75 out per 1M

$0.000550$32.50$22.50$55.00

xAI: Grok Code Fast 1

$0.2 in / $1.5 out per 1M

$0.000550$10.00$45.00$55.00

DeepSeek: DeepSeek V3.2 Speciale

$0.4 in / $1.2 out per 1M

$0.000560$20.00$36.00$56.00

Qwen: Qwen3.5-27B

$0.195 in / $1.56 out per 1M

$0.000566$9.75$46.80$56.55

Google: Gemini 3.1 Flash Lite Preview

$0.25 in / $1.5 out per 1M

$0.000575$12.50$45.00$57.50

Baidu: ERNIE 4.5 VL 424B A47B

$0.42 in / $1.25 out per 1M

$0.000585$21.00$37.50$58.50

DeepSeek: R1 Distill Llama 70B

$0.7 in / $0.8 out per 1M

$0.000590$35.00$24.00$59.00

Qwen: Qwen3.5 Plus 2026-02-15

$0.26 in / $1.56 out per 1M

$0.000598$13.00$46.80$59.80

Qwen2.5 Coder 32B Instruct

$0.66 in / $1 out per 1M

$0.000630$33.00$30.00$63.00

Qwen: Qwen2.5 VL 72B Instruct

$0.8 in / $0.8 out per 1M

$0.000640$40.00$24.00$64.00

Mancer: Weaver (alpha)

$0.75 in / $1 out per 1M

$0.000675$37.50$30.00$67.50

OpenAI: GPT-4.1 Mini

$0.4 in / $1.6 out per 1M

$0.000680$20.00$48.00$68.00

Sao10K: Llama 3.1 Euryale 70B v2.2

$0.85 in / $0.85 out per 1M

$0.000680$42.50$25.50$68.00

Mistral: Mistral Large 3 2512

$0.5 in / $1.5 out per 1M

$0.000700$25.00$45.00$70.00

OpenAI: GPT-3.5 Turbo

$0.5 in / $1.5 out per 1M

$0.000700$25.00$45.00$70.00

MoonshotAI: Kimi K2.5

$0.3827 in / $1.72 out per 1M

$0.000707$19.13$51.60$70.73

Z.ai: GLM 4.7

$0.39 in / $1.75 out per 1M

$0.000720$19.50$52.50$72.00

OpenAI: GPT-5 Mini

$0.25 in / $2 out per 1M

$0.000725$12.50$60.00$72.50

ByteDance Seed: Seed-2.0-Lite

$0.25 in / $2 out per 1M

$0.000725$12.50$60.00$72.50

ByteDance Seed: Seed 1.6

$0.25 in / $2 out per 1M

$0.000725$12.50$60.00$72.50

OpenAI: GPT-5.1-Codex-Mini

$0.25 in / $2 out per 1M

$0.000725$12.50$60.00$72.50

Arcee AI: Virtuoso Large

$0.75 in / $1.2 out per 1M

$0.000735$37.50$36.00$73.50

Qwen: Qwen3.5-122B-A10B

$0.26 in / $2.08 out per 1M

$0.000754$13.00$62.40$75.40

Morph: Morph V3 Fast

$0.8 in / $1.2 out per 1M

$0.000760$40.00$36.00$76.00

EleutherAI: Llemma 7b

$0.8 in / $1.2 out per 1M

$0.000760$40.00$36.00$76.00

AlfredPros: CodeLLaMa 7B Instruct Solidity

$0.8 in / $1.2 out per 1M

$0.000760$40.00$36.00$76.00

Z.ai: GLM 4.6

$0.39 in / $1.9 out per 1M

$0.000765$19.50$57.00$76.50

AionLabs: Aion-1.0-Mini

$0.7 in / $1.4 out per 1M

$0.000770$35.00$42.00$77.00

Qwen: Qwen3 235B A22B

$0.455 in / $1.82 out per 1M

$0.000773$22.75$54.60$77.35

Xiaomi: MiMo-V2-Omni

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Mistral: Devstral 2 2512

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Relace: Relace Apply 3

$0.85 in / $1.25 out per 1M

$0.000800$42.50$37.50$80.00

MoonshotAI: Kimi K2 0905

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Mistral: Mistral Medium 3.1

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Mistral: Devstral Medium

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Mistral: Mistral Medium 3

$0.4 in / $2 out per 1M

$0.000800$20.00$60.00$80.00

Perplexity: Sonar

$1 in / $1 out per 1M

$0.000800$50.00$30.00$80.00

Nous: Hermes 3 405B Instruct

$1 in / $1 out per 1M

$0.000800$50.00$30.00$80.00

MoonshotAI: Kimi K2 Thinking

$0.47 in / $2 out per 1M

$0.000835$23.50$60.00$83.50

Z.ai: GLM 4.5V

$0.6 in / $1.8 out per 1M

$0.000840$30.00$54.00$84.00

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

$0.6 in / $1.8 out per 1M

$0.000840$30.00$54.00$84.00

MiniMax: MiniMax M1

$0.4 in / $2.2 out per 1M

$0.000860$20.00$66.00$86.00

DeepSeek: R1 0528

$0.45 in / $2.15 out per 1M

$0.000870$22.50$64.50$87.00

AionLabs: Aion-2.0

$0.8 in / $1.6 out per 1M

$0.000880$40.00$48.00$88.00

AionLabs: Aion-RP 1.0 (8B)

$0.8 in / $1.6 out per 1M

$0.000880$40.00$48.00$88.00

Qwen: Qwen VL Max

$0.52 in / $2.08 out per 1M

$0.000884$26.00$62.40$88.40

Qwen: Qwen3.5 397B A17B

$0.39 in / $2.34 out per 1M

$0.000897$19.50$70.20$89.70

Google: Nano Banana (Gemini 2.5 Flash Image)

$0.3 in / $2.5 out per 1M

$0.000900$15.00$75.00$90.00

Google: Gemini 2.5 Flash

$0.3 in / $2.5 out per 1M

$0.000900$15.00$75.00$90.00

Amazon: Nova 2 Lite

$0.3 in / $2.5 out per 1M

$0.000900$15.00$75.00$90.00

Qwen: Qwen3 VL 235B A22B Thinking

$0.26 in / $2.6 out per 1M

$0.000910$13.00$78.00$91.00

NVIDIA: Llama 3.1 Nemotron 70B Instruct

$1.2 in / $1.2 out per 1M

$0.000960$60.00$36.00$96.00

Z.ai: GLM 4.5

$0.6 in / $2.2 out per 1M

$0.000960$30.00$66.00$96.00

MoonshotAI: Kimi K2 0711

$0.57 in / $2.3 out per 1M

$0.000975$28.50$69.00$97.50

Deep Cogito: Cogito v2.1 671B

$1.25 in / $1.25 out per 1M

$0.001000$62.50$37.50$100.00

OpenAI: GPT Audio Mini

$0.6 in / $2.4 out per 1M

$0.001020$30.00$72.00$102.00

Morph: Morph V3 Large

$0.9 in / $1.9 out per 1M

$0.001020$45.00$57.00$102.00

Z.ai: GLM 5

$0.72 in / $2.3 out per 1M

$0.001050$36.00$69.00$105.00

DeepSeek: R1

$0.7 in / $2.5 out per 1M

$0.001100$35.00$75.00$110.00

OpenAI: GPT-3.5 Turbo (older v0613)

$1 in / $2 out per 1M

$0.001100$50.00$60.00$110.00

Google: Gemini 3 Flash Preview

$0.5 in / $3 out per 1M

$0.001150$25.00$90.00$115.00

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

$0.5 in / $3 out per 1M

$0.001150$25.00$90.00$115.00

Sao10k: Llama 3 Euryale 70B v2.1

$1.48 in / $1.48 out per 1M

$0.001184$74.00$44.40$118.40

Qwen: Qwen3 Coder Plus

$0.65 in / $3.25 out per 1M

$0.001300$32.50$97.50$130.00

OpenAI: GPT-3.5 Turbo Instruct

$1.5 in / $2 out per 1M

$0.001350$75.00$60.00$135.00

Amazon: Nova Pro 1.0

$0.8 in / $3.2 out per 1M

$0.001360$40.00$96.00$136.00

xAI: Grok 4.3

$1.25 in / $2.5 out per 1M

$0.001375$62.50$75.00$137.50

Xiaomi: MiMo-V2-Pro

$1 in / $3 out per 1M

$0.001400$50.00$90.00$140.00

Relace: Relace Search

$1 in / $3 out per 1M

$0.001400$50.00$90.00$140.00

Nous: Hermes 4 405B

$1 in / $3 out per 1M

$0.001400$50.00$90.00$140.00

Arcee AI: Maestro Reasoning

$0.9 in / $3.3 out per 1M

$0.001440$45.00$99.00$144.00

Switchpoint Router

$0.85 in / $3.4 out per 1M

$0.001445$42.50$102.00$144.50

Qwen: Qwen3 Max Thinking

$0.78 in / $3.9 out per 1M

$0.001560$39.00$117.00$156.00

Qwen: Qwen3 Max

$0.78 in / $3.9 out per 1M

$0.001560$39.00$117.00$156.00

Anthropic: Claude 3.5 Haiku

$0.8 in / $4 out per 1M

$0.001600$40.00$120.00$160.00

MoonshotAI: Kimi K2.6

$0.95 in / $4 out per 1M

$0.001675$47.50$120.00$167.50

OpenAI: GPT-5.4 Mini

$0.75 in / $4.5 out per 1M

$0.001725$37.50$135.00$172.50

Qwen: Qwen-Max

$1.04 in / $4.16 out per 1M

$0.001768$52.00$124.80$176.80

Z.ai: GLM 5V Turbo

$1.2 in / $4 out per 1M

$0.001800$60.00$120.00$180.00

Z.ai: GLM 5 Turbo

$1.2 in / $4 out per 1M

$0.001800$60.00$120.00$180.00

OpenAI: GPT-5 Image Mini

$2.5 in / $2 out per 1M

$0.001850$125.00$60.00$185.00

OpenAI: o4 Mini High

$1.1 in / $4.4 out per 1M

$0.001870$55.00$132.00$187.00

OpenAI: o4 Mini

$1.1 in / $4.4 out per 1M

$0.001870$55.00$132.00$187.00

OpenAI: o3 Mini High

$1.1 in / $4.4 out per 1M

$0.001870$55.00$132.00$187.00

OpenAI: o3 Mini

$1.1 in / $4.4 out per 1M

$0.001870$55.00$132.00$187.00

DeepSeek: DeepSeek V4 Pro

$1.74 in / $3.48 out per 1M

$0.001914$87.00$104.40$191.40

Anthropic: Claude Haiku 4.5

$1 in / $5 out per 1M

$0.002000$50.00$150.00$200.00

Writer: Palmyra X5

$0.6 in / $6 out per 1M

$0.002100$30.00$180.00$210.00

Z.ai: GLM 5.1

$1.55 in / $4.65 out per 1M

$0.002170$77.50$139.50$217.00

Sao10K: Llama 3.1 70B Hanami x1

$3 in / $3 out per 1M

$0.002400$150.00$90.00$240.00

OpenAI: GPT-3.5 Turbo 16k

$3 in / $4 out per 1M

$0.002700$150.00$120.00$270.00

Mistral Large 2411

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

Mistral Large 2407

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

Mistral Large

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

xAI: Grok 4.20 Multi-Agent

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

xAI: Grok 4.20

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

Mistral: Pixtral Large 2411

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

Mistral: Mixtral 8x22B Instruct

$2 in / $6 out per 1M

$0.002800$100.00$180.00$280.00

Magnum v4 72B

$3 in / $5 out per 1M

$0.003000$150.00$150.00$300.00

OpenAI: o4 Mini Deep Research

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

OpenAI: o3

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

OpenAI: GPT-4.1

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

AI21: Jamba Large 1.7

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

Perplexity: Sonar Reasoning Pro

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

Perplexity: Sonar Deep Research

$2 in / $8 out per 1M

$0.003400$100.00$240.00$340.00

OpenAI: GPT-5.1-Codex-Max

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5.1

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5.1 Chat

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5.1-Codex

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

Google: Gemini 2.5 Pro

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

Google: Gemini 2.5 Pro Preview 06-05

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

Google: Gemini 2.5 Pro Preview 05-06

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5 Codex

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

OpenAI: GPT-5 Chat

$1.25 in / $10 out per 1M

$0.003625$62.50$300.00$362.50

Goliath 120B

$3.75 in / $7.5 out per 1M

$0.004125$187.50$225.00$412.50

OpenAI: GPT-4o Audio

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

OpenAI: GPT-4o Search Preview

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

OpenAI: GPT-4o (2024-11-20)

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

OpenAI: GPT-4o (2024-08-06)

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

OpenAI: GPT-4o

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

Cohere: Command R+ (08-2024)

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

OpenAI: GPT Audio

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

Cohere: Command A

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

Inflection: Inflection 3 Pi

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

Inflection: Inflection 3 Productivity

$2.5 in / $10 out per 1M

$0.004250$125.00$300.00$425.00

AionLabs: Aion-1.0

$4 in / $8 out per 1M

$0.004400$200.00$240.00$440.00

Google: Gemini 3.1 Pro Preview Custom Tools

$2 in / $12 out per 1M

$0.004600$100.00$360.00$460.00

Google: Gemini 3.1 Pro Preview

$2 in / $12 out per 1M

$0.004600$100.00$360.00$460.00

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

$2 in / $12 out per 1M

$0.004600$100.00$360.00$460.00

Amazon: Nova Premier 1.0

$2.5 in / $12.5 out per 1M

$0.005000$125.00$375.00$500.00

OpenAI: GPT-5.3 Chat

$1.75 in / $14 out per 1M

$0.005075$87.50$420.00$507.50

OpenAI: GPT-5.3-Codex

$1.75 in / $14 out per 1M

$0.005075$87.50$420.00$507.50

OpenAI: GPT-5.2-Codex

$1.75 in / $14 out per 1M

$0.005075$87.50$420.00$507.50

OpenAI: GPT-5.2 Chat

$1.75 in / $14 out per 1M

$0.005075$87.50$420.00$507.50

OpenAI: GPT-5.2

$1.75 in / $14 out per 1M

$0.005075$87.50$420.00$507.50

OpenAI: GPT-5.4

$2.5 in / $15 out per 1M

$0.005750$125.00$450.00$575.00

xAI: Grok 3

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

xAI: Grok 3 Beta

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Anthropic: Claude Sonnet 4.6

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Anthropic: Claude Sonnet 4.5

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Anthropic: Claude Sonnet 4

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Anthropic: Claude 3.7 Sonnet

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Anthropic: Claude 3.7 Sonnet (thinking)

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Perplexity: Sonar Pro Search

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

xAI: Grok 4

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

Perplexity: Sonar Pro

$3 in / $15 out per 1M

$0.006000$150.00$450.00$600.00

OpenAI: GPT-4o (2024-05-13)

$5 in / $15 out per 1M

$0.007000$250.00$450.00$700.00

OpenAI: GPT-5 Image

$10 in / $10 out per 1M

$0.008000$500.00$300.00$800.00

OpenAI: GPT-4o (extended)

$6 in / $18 out per 1M

$0.008400$300.00$540.00$840.00

Anthropic: Claude Opus 4.7

$5 in / $25 out per 1M

$0.0100$250.00$750.00$1000.00

Anthropic: Claude Opus 4.6

$5 in / $25 out per 1M

$0.0100$250.00$750.00$1000.00

Anthropic: Claude Opus 4.5

$5 in / $25 out per 1M

$0.0100$250.00$750.00$1000.00

OpenAI: GPT-5.5

$5 in / $30 out per 1M

$0.0115$250.00$900.00$1150.00

OpenAI: GPT-4 Turbo

$10 in / $30 out per 1M

$0.0140$500.00$900.00$1400.00

OpenAI: GPT-4 Turbo Preview

$10 in / $30 out per 1M

$0.0140$500.00$900.00$1400.00

OpenAI: GPT-4 Turbo (older v1106)

$10 in / $30 out per 1M

$0.0140$500.00$900.00$1400.00

OpenAI: o3 Deep Research

$10 in / $40 out per 1M

$0.0170$500.00$1200.00$1700.00

OpenAI: o1

$15 in / $60 out per 1M

$0.0255$750.00$1800.00$2550.00

Anthropic: Claude Opus 4.1

$15 in / $75 out per 1M

$0.0300$750.00$2250.00$3000.00

Anthropic: Claude Opus 4

$15 in / $75 out per 1M

$0.0300$750.00$2250.00$3000.00

OpenAI: GPT-4 (older v0314)

$30 in / $60 out per 1M

$0.0330$1500.00$1800.00$3300.00

OpenAI: GPT-4

$30 in / $60 out per 1M

$0.0330$1500.00$1800.00$3300.00

OpenAI: o3 Pro

$20 in / $80 out per 1M

$0.0340$1000.00$2400.00$3400.00

OpenAI: GPT-5 Pro

$15 in / $120 out per 1M

$0.0435$750.00$3600.00$4350.00

OpenAI: GPT-5.2 Pro

$21 in / $168 out per 1M

$0.0609$1050.00$5040.00$6090.00

OpenAI: GPT-5.5 Pro

$30 in / $180 out per 1M

$0.0690$1500.00$5400.00$6900.00

OpenAI: GPT-5.4 Pro

$30 in / $180 out per 1M

$0.0690$1500.00$5400.00$6900.00

OpenAI: o1-pro

$150 in / $600 out per 1M

$0.2550$7500.00$18,000$25,500

List-price estimate. Real bills typically run 1.3-1.7x higher after retries, system-prompt re-sends, and tool-call round-trips. See per-million-tokens true cost for the adders.

Understanding AI API Pricing in 2026

AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.

Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.

How to Optimize AI API Costs

The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.

Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.

Pricing Trends to Watch

  • Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
  • Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
  • Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
  • Cached pricing: More providers offering discounted rates for repeated context