Which AI model is the cheapest?

Gemini 2.0 Flash is currently the cheapest major AI model at $0.10 per million input tokens and $0.40 per million output tokens. DeepSeek V3 and Llama 4 Scout are also extremely cost-effective options.

How much does the GPT-4o API cost?

GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens. This represents a 50% price reduction from its launch price.

What is the best value AI model?

DeepSeek V3 offers the best value with a quality index of 86 at just $0.27/$1.10 per million tokens. For closed-source options, Gemini 2.5 Pro offers strong quality at competitive pricing.

How do AI model prices compare?

AI model pricing varies by 100x between the cheapest and most expensive options. Fast models like Gemini Flash cost $0.10/1M input tokens while premium reasoning models like Claude Opus 4 cost $15.00/1M input tokens.

How can I reduce my AI API costs?

Use a model routing gateway like Swfte Connect to automatically route each request to the optimal model based on task complexity. This typically saves 30-60% on API costs without sacrificing quality. Also consider using cached input pricing where available.

Updated May 14, 2026

AI Model Pricing Index

Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.

326

Models Tracked

Providers

$0.02

Cheapest Input

8824x

Price Range

Full Pricing Table

326 models

Model	Provider	Input / 1M	Output / 1M	Blended	Quality	Value	Context
Mistral: Mistral NemoOSS Open-source	Mistral AI	$0.02	$0.04	$0.03	72	2400.0	131K
Google: Gemma 3n 4BOSS Open-source	Google	$0.02	$0.04	$0.03	50	1666.7	33K
Meta: Llama 3.1 8B InstructOSS Open-source	Meta	$0.02	$0.05	$0.04	65	1857.1	16K
Meta: Llama 3 8B InstructOSS Open-source	Meta	$0.03	$0.04	$0.04	65	1857.1	8K
Llama Guard 3 8BOSS Open-source	Meta	$0.02	$0.06	$0.04	50	1250.0	131K
Sao10K: Llama 3 8B LunarisOSS Hard reasoning	sao10k	$0.04	$0.05	$0.04	50	1111.1	8K
Meta: Llama 3.2 11B Vision InstructOSS Open-source	Meta	$0.05	$0.05	$0.05	50	1020.4	131K
Google: Gemma 3 4BOSS Open-source	Google	$0.04	$0.08	$0.06	65	1083.3	131K
Google: Gemma 2 9BOSS Open-source	Google	$0.03	$0.09	$0.06	65	1083.3	8K
MythoMax 13B Speed & cost	gryphe	$0.06	$0.06	$0.06	58	966.7	4K
Qwen: Qwen2.5 Coder 7B InstructOSS Code generation	Alibaba Cloud	$0.03	$0.09	$0.06	50	833.3	33K
IBM: Granite 4.0 Micro Speed & cost	ibm-granite	$0.02	$0.11	$0.06	62	976.4	131K
Mistral: Mistral Small 3OSS Open-source	Mistral AI	$0.05	$0.08	$0.07	72	1107.7	33K
Mistral: Mistral Small 3.1 24BOSS Open-source	Mistral AI	$0.03	$0.11	$0.07	72	1028.6	131K
OpenAI: gpt-oss-20b Speed & cost	OpenAI	$0.03	$0.11	$0.07	50	714.3	131K
Qwen: Qwen2.5 7B InstructOSS Open-source	Alibaba Cloud	$0.04	$0.10	$0.07	50	714.3	33K
LiquidAI: LFM2-24B-A2B Speed & cost	liquid	$0.03	$0.12	$0.07	50	666.7	33K
Qwen: Qwen-TurboOSS Open-source	Alibaba Cloud	$0.03	$0.13	$0.08	50	615.4	131K
Google: Gemma 3 12BOSS Open-source	Google	$0.04	$0.13	$0.09	74	870.6	131K
Qwen: Qwen3 235B A22B Instruct 2507OSS Open-source	Alibaba Cloud	$0.07	$0.10	$0.09	82	959.1	262K
Amazon: Nova Micro 1.0 Speed & cost	Amazon	$0.04	$0.14	$0.09	62	708.6	128K
Cohere: Command R7B (12-2024)OSS Open-source	Cohere	$0.04	$0.15	$0.09	50	533.3	128K
Arcee AI: Trinity Mini Speed & cost	arcee	$0.04	$0.15	$0.10	50	512.8	131K
Qwen: Qwen3.5-9BOSS Open-source	Alibaba Cloud	$0.05	$0.15	$0.10	82	820.0	256K
NVIDIA: Nemotron Nano 9B V2 Speed & cost	nvidia	$0.04	$0.16	$0.10	62	620.0	131K
Reka Edge Speed & cost	rekaai	$0.10	$0.10	$0.10	58	580.0	16K
Mistral: Ministral 3 3B 2512OSS Speed & cost	Mistral AI	$0.10	$0.10	$0.10	58	580.0	131K
Z.ai: GLM 4 32B OSS Open-source	z-ai	$0.10	$0.10	$0.10	58	580.0	128K
Microsoft: Phi 4OSS Speed & cost	microsoft	$0.07	$0.14	$0.10	65	634.1	16K
Meta: Llama 3.2 1B InstructOSS Open-source	Meta	$0.03	$0.20	$0.11	50	440.5	60K
OpenAI: gpt-oss-120b Speed & cost	OpenAI	$0.04	$0.19	$0.11	50	436.7	131K
Google: Gemma 3 27BOSS Open-source	Google	$0.08	$0.16	$0.12	74	616.7	131K
NVIDIA: Nemotron 3 Nano 30B A3B Speed & cost	nvidia	$0.05	$0.20	$0.13	62	496.0	262K
AllenAI: Olmo 2 32B InstructOSS Open-source	allenai	$0.05	$0.20	$0.13	50	400.0	128K
Mistral: Mistral Small 3.2 24BOSS Open-source	Mistral AI	$0.07	$0.20	$0.14	72	523.6	128K
NousResearch: Hermes 2 Pro - Llama-3 8BOSS Search + citations	nousresearch	$0.14	$0.14	$0.14	65	464.3	8K
Qwen: Qwen3 14BOSS Open-source	Alibaba Cloud	$0.06	$0.24	$0.15	82	546.7	41K
EssentialAI: Rnj 1 Instruct Speed & cost	essentialai	$0.15	$0.15	$0.15	58	386.7	33K
Mistral: Ministral 3 8B 2512OSS Speed & cost	Mistral AI	$0.15	$0.15	$0.15	58	386.7	262K
Amazon: Nova Lite 1.0 Speed & cost	Amazon	$0.06	$0.24	$0.15	58	386.7	300K
Mistral: Mistral 7B Instruct v0.1OSS Open-source	Mistral AI	$0.11	$0.19	$0.15	58	386.7	3K
ByteDance: UI-TARS 7B Speed & cost	bytedance	$0.10	$0.20	$0.15	58	386.7	128K
Reka Flash 3 Speed & cost	rekaai	$0.10	$0.20	$0.15	58	386.7	66K
Qwen: Qwen3 32BOSS Open-source	Alibaba Cloud	$0.08	$0.24	$0.16	82	512.5	41K
Qwen: Qwen3.5-FlashOSS Speed & cost	Alibaba Cloud	$0.07	$0.26	$0.16	82	504.6	1M
Qwen: Qwen3 Coder 30B A3B InstructOSS Code generation	Alibaba Cloud	$0.07	$0.27	$0.17	82	482.4	160K
Baidu: ERNIE 4.5 21B A3B Thinking Hard reasoning	baidu	$0.07	$0.28	$0.18	58	331.4	131K
Baidu: ERNIE 4.5 21B A3B Speed & cost	baidu	$0.07	$0.28	$0.18	58	331.4	120K
Arcee AI: Spotlight Speed & cost	arcee	$0.18	$0.18	$0.18	58	322.2	131K
Meta: Llama Guard 4 12BOSS Open-source	Meta	$0.18	$0.18	$0.18	58	322.2	164K
Qwen: Qwen3 30B A3BOSS Open-source	Alibaba Cloud	$0.08	$0.28	$0.18	82	455.6	41K
Google: Gemini 2.0 Flash Lite Speed & cost	Google	$0.07	$0.30	$0.19	73	389.3	1M
ByteDance Seed: Seed 1.6 Flash Speed & cost	bytedance	$0.07	$0.30	$0.19	58	309.3	262K
OpenAI: gpt-oss-safeguard-20b Speed & cost	OpenAI	$0.07	$0.30	$0.19	58	309.3	131K
Meta: Llama 4 ScoutOSS Speed & cost	Meta	$0.08	$0.30	$0.19	74	389.5	328K
Xiaomi: MiMo-V2-Flash Speed & cost	xiaomi	$0.09	$0.29	$0.19	58	305.3	262K
Qwen: Qwen3 30B A3B Instruct 2507OSS Open-source	Alibaba Cloud	$0.09	$0.30	$0.20	82	420.5	262K
Meta: Llama 3.2 3B InstructOSS Open-source	Meta	$0.05	$0.34	$0.20	58	296.7	80K
Mistral: Mistral Small CreativeOSS Open-source	Mistral AI	$0.10	$0.30	$0.20	72	360.0	33K
StepFun: Step 3.5 Flash Speed & cost	stepfun	$0.10	$0.30	$0.20	58	290.0	262K
Mistral: Ministral 3 14B 2512OSS Speed & cost	Mistral AI	$0.20	$0.20	$0.20	58	290.0	262K
Mistral: Voxtral Small 24B 2507OSS Open-source	Mistral AI	$0.10	$0.30	$0.20	58	290.0	32K
Mistral: Devstral Small 1.1OSS Open-source	Mistral AI	$0.10	$0.30	$0.20	58	290.0	131K
DeepSeek: DeepSeek V4 Flash NewOSS Cheap-and-fast cascade tier	DeepSeek	$0.14	$0.28	$0.21	85	404.8	1M
Meta: Llama 3.3 70B InstructOSS Open-source	Meta	$0.10	$0.32	$0.21	74	352.4	131K
Qwen: Qwen3 8BOSS Open-source	Alibaba Cloud	$0.05	$0.40	$0.23	82	364.4	41K
OpenAI: GPT-5 Nano Speed & cost	OpenAI	$0.05	$0.40	$0.23	72	320.0	400K
Z.ai: GLM 4.7 FlashOSS Speed & cost	z-ai	$0.06	$0.40	$0.23	58	252.2	203K
Qwen: Qwen3 30B A3B Thinking 2507OSS Hard reasoning	Alibaba Cloud	$0.08	$0.40	$0.24	82	341.7	131K
Google: Gemini 2.5 Flash Lite Preview 09-2025 Speed & cost	Google	$0.10	$0.40	$0.25	80	320.0	1M
Google: Gemini 2.5 Flash Lite Speed & cost	Google	$0.10	$0.40	$0.25	80	320.0	1M
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5OSS Open-source	nvidia	$0.10	$0.40	$0.25	74	296.0	131K
Google: Gemini 2.0 Flash Speed & cost	Google	$0.10	$0.40	$0.25	73	292.0	1M
OpenAI: GPT-4.1 Nano Speed & cost	OpenAI	$0.10	$0.40	$0.25	72	288.0	1M
ByteDance Seed: Seed-2.0-Mini Speed & cost	bytedance	$0.10	$0.40	$0.25	58	232.0	262K
Qwen2.5 72B InstructOSS Open-source	Alibaba Cloud	$0.12	$0.39	$0.26	82	321.6	33K
Qwen: Qwen3 VL 32B InstructOSS Open-source	Alibaba Cloud	$0.10	$0.42	$0.26	82	315.4	131K
Google: Gemma 4 26B A4B OSS Open-source	Google	$0.13	$0.40	$0.27	76	286.8	262K
Nous: Hermes 4 70BOSS Search + citations	nousresearch	$0.13	$0.40	$0.27	58	218.9	131K
Google: Gemma 4 31BOSS Open-source	Google	$0.14	$0.40	$0.27	76	281.5	262K
Tongyi DeepResearch 30B A3BOSS Search + citations	Alibaba Cloud	$0.09	$0.45	$0.27	58	214.8	131K
Qwen: Qwen VL PlusOSS Open-source	Alibaba Cloud	$0.14	$0.41	$0.27	58	212.5	131K
DeepSeek: R1 Distill Qwen 32BOSS Hard reasoning	DeepSeek	$0.29	$0.29	$0.29	91	313.8	33K
Qwen: Qwen3 VL 8B InstructOSS Open-source	Alibaba Cloud	$0.08	$0.50	$0.29	82	282.8	131K
Nous: Hermes 3 70B InstructOSS Search + citations	nousresearch	$0.30	$0.30	$0.30	74	246.7	131K
NVIDIA: Nemotron 3 Super Speed & cost	nvidia	$0.10	$0.50	$0.30	58	193.3	262K
TheDrummer: Rocinante 12B Speed & cost	thedrummer	$0.17	$0.43	$0.30	58	193.3	33K
Nex AGI: DeepSeek V3.1 Nex N1OSS Open-source	nex-agi	$0.14	$0.50	$0.32	86	270.9	131K
DeepSeek: DeepSeek V3.2OSS Open-source	DeepSeek	$0.26	$0.38	$0.32	87	271.9	164K
Qwen: Qwen3 VL 30B A3B InstructOSS Open-source	Alibaba Cloud	$0.13	$0.52	$0.33	82	252.3	131K
AllenAI: Olmo 3 32B ThinkOSS Hard reasoning	allenai	$0.15	$0.50	$0.33	58	178.5	66K
DeepSeek: DeepSeek V3.2 ExpOSS Open-source	DeepSeek	$0.27	$0.41	$0.34	86	252.9	164K
xAI: Grok 4.1 Fast Speed & cost	xAI	$0.20	$0.50	$0.35	58	165.7	2M
xAI: Grok 4 Fast Speed & cost	xAI	$0.20	$0.50	$0.35	58	165.7	2M
Baidu: ERNIE 4.5 VL 28B A3B Speed & cost	baidu	$0.14	$0.56	$0.35	58	165.7	30K
Tencent: Hunyuan A13B Instruct Speed & cost	tencent	$0.14	$0.57	$0.35	58	163.4	131K
Qwen: QwQ 32BOSS Hard reasoning	Alibaba Cloud	$0.15	$0.58	$0.36	58	158.9	131K
Meta: Llama 4 MaverickOSS Open-source	Meta	$0.15	$0.60	$0.38	82	218.7	1M
OpenAI: GPT-4o-mini Search Preview Search + citations	OpenAI	$0.15	$0.60	$0.38	80	213.3	128K
OpenAI: GPT-4o-mini (2024-07-18) Speed & cost	OpenAI	$0.15	$0.60	$0.38	80	213.3	128K
OpenAI: GPT-4o-mini Speed & cost	OpenAI	$0.15	$0.60	$0.38	80	213.3	128K
Mistral: Mistral Small 4OSS Open-source	Mistral AI	$0.15	$0.60	$0.38	72	192.0	262K
Upstage: Solar Pro 3 Speed & cost	upstage	$0.15	$0.60	$0.38	58	154.7	128K
Cohere: Command R (08-2024)OSS Open-source	Cohere	$0.15	$0.60	$0.38	58	154.7	128K
xAI: Grok 3 Mini Speed & cost	xAI	$0.30	$0.50	$0.40	82	205.0	131K
xAI: Grok 3 Mini Beta Speed & cost	xAI	$0.30	$0.50	$0.40	82	205.0	131K
Meta: Llama 3.1 70B InstructOSS Open-source	Meta	$0.40	$0.40	$0.40	74	185.0	131K
TheDrummer: UnslopNemo 12B Speed & cost	thedrummer	$0.40	$0.40	$0.40	66	165.0	33K
NVIDIA: Nemotron Nano 12B 2 VL Speed & cost	nvidia	$0.20	$0.60	$0.40	62	155.0	131K
AllenAI: Olmo 3.1 32B InstructOSS Open-source	allenai	$0.20	$0.60	$0.40	58	145.0	66K
TheDrummer: Cydonia 24B V4.1 Speed & cost	thedrummer	$0.30	$0.50	$0.40	58	145.0	131K
Qwen: Qwen2.5 VL 32B InstructOSS Open-source	Alibaba Cloud	$0.20	$0.60	$0.40	58	145.0	128K
Mistral: SabaOSS Open-source	Mistral AI	$0.20	$0.60	$0.40	58	145.0	33K
Qwen: Qwen3 Coder NextOSS Code generation	Alibaba Cloud	$0.12	$0.75	$0.43	82	188.5	262K
Qwen: Qwen3 Next 80B A3B ThinkingOSS Hard reasoning	Alibaba Cloud	$0.10	$0.78	$0.44	82	186.9	131K
DeepSeek: DeepSeek V3.1OSS Open-source	DeepSeek	$0.15	$0.75	$0.45	86	191.1	33K
DeepSeek: DeepSeek V3 0324OSS Open-source	DeepSeek	$0.20	$0.77	$0.48	86	177.3	164K
Z.ai: GLM 4.5 AirOSS Open-source	z-ai	$0.13	$0.85	$0.49	58	118.4	131K
DeepSeek: DeepSeek V3.1 TerminusOSS Open-source	DeepSeek	$0.21	$0.79	$0.50	86	172.0	164K
Inception: Mercury 2 Speed & cost	inception	$0.25	$0.75	$0.50	58	116.0	128K
Meituan: LongCat Flash Chat Speed & cost	meituan	$0.20	$0.80	$0.50	58	116.0	131K
Inception: Mercury Speed & cost	inception	$0.25	$0.75	$0.50	58	116.0	128K
Inception: Mercury Coder Code generation	inception	$0.25	$0.75	$0.50	58	116.0	128K
Qwen: Qwen Plus 0728 (thinking)OSS Hard reasoning	Alibaba Cloud	$0.26	$0.78	$0.52	58	111.5	1M
Qwen: Qwen Plus 0728OSS Open-source	Alibaba Cloud	$0.26	$0.78	$0.52	58	111.5	1M
Qwen: Qwen-PlusOSS Open-source	Alibaba Cloud	$0.26	$0.78	$0.52	58	111.5	1M
Arcee AI: Trinity Large Thinking Hard reasoning	arcee	$0.22	$0.85	$0.54	58	108.4	262K
Qwen: Qwen3 VL 235B A22B InstructOSS Open-source	Alibaba Cloud	$0.20	$0.88	$0.54	82	151.9	262K
Mistral: Mixtral 8x7B InstructOSS Open-source	Mistral AI	$0.54	$0.54	$0.54	72	133.3	33K
ReMM SLERP 13B Speed & cost	undi95	$0.45	$0.65	$0.55	66	120.0	6K
MiniMax: MiniMax M2.5 Speed & cost	minimax	$0.12	$0.99	$0.55	58	104.7	197K
Qwen: Qwen3 Coder FlashOSS Code generation	Alibaba Cloud	$0.20	$0.97	$0.58	82	140.2	1M
Qwen: Qwen3 Next 80B A3B InstructOSS Open-source	Alibaba Cloud	$0.09	$1.10	$0.60	82	137.8	262K
Mistral: Codestral 2508OSS Code generation	Mistral AI	$0.30	$0.90	$0.60	78	130.0	256K
Z.ai: GLM 4.6VOSS Open-source	z-ai	$0.30	$0.90	$0.60	58	96.7	131K
DeepSeek: DeepSeek V3OSS Open-source	DeepSeek	$0.32	$0.89	$0.60	86	142.1	164K
Qwen: Qwen3 Coder 480B A35BOSS Code generation	Alibaba Cloud	$0.22	$1.00	$0.61	82	134.4	262K
MiniMax: MiniMax M2.1 Speed & cost	minimax	$0.27	$0.95	$0.61	58	95.1	197K
WizardLM-2 8x22BOSS Speed & cost	microsoft	$0.62	$0.62	$0.62	62	100.0	66K
Meta: Llama 3 70B InstructOSS Open-source	Meta	$0.51	$0.74	$0.63	66	105.6	8K
MiniMax: MiniMax M2 Speed & cost	minimax	$0.26	$1.00	$0.63	58	92.4	197K
Arcee AI: Coder Large Code generation	arcee	$0.50	$0.80	$0.65	66	101.5	33K
Google: Gemma 2 27BOSS Open-source	Google	$0.65	$0.65	$0.65	65	100.0	8K
Prime Intellect: INTELLECT-3 Speed & cost	prime-intellect	$0.20	$1.10	$0.65	58	89.2	131K
MiniMax: MiniMax-01 Speed & cost	minimax	$0.20	$1.10	$0.65	58	89.2	1M
TheDrummer: Skyfall 36B V2 Speed & cost	thedrummer	$0.55	$0.80	$0.68	66	97.8	33K
Baidu: ERNIE 4.5 300B A47B Speed & cost	baidu	$0.28	$1.10	$0.69	58	84.1	123K
Sao10K: Llama 3.3 Euryale 70BOSS Hard reasoning	sao10k	$0.65	$0.75	$0.70	66	94.3	131K
TNG: DeepSeek R1T2 ChimeraOSS Hard reasoning	tngtech	$0.30	$1.10	$0.70	91	130.0	164K
OpenAI: GPT-5.4 Nano Speed & cost	OpenAI	$0.20	$1.25	$0.72	72	99.3	400K
Qwen: Qwen3.5-35B-A3BOSS Open-source	Alibaba Cloud	$0.16	$1.30	$0.73	82	112.1	262K
Qwen: Qwen3 VL 8B ThinkingOSS Hard reasoning	Alibaba Cloud	$0.12	$1.36	$0.74	82	110.7	131K
DeepSeek: R1 Distill Llama 70BOSS Hard reasoning	DeepSeek	$0.70	$0.80	$0.75	91	121.3	131K
Anthropic: Claude 3 Haiku Speed & cost	Anthropic	$0.25	$1.25	$0.75	72	96.0	200K
Kwaipilot: KAT-Coder-Pro V2 Code generation	kwaipilot	$0.30	$1.20	$0.75	58	77.3	256K
MiniMax: MiniMax M2.7 Speed & cost	minimax	$0.30	$1.20	$0.75	58	77.3	205K
MiniMax: MiniMax M2-her Speed & cost	minimax	$0.30	$1.20	$0.75	58	77.3	66K
DeepSeek: DeepSeek V3.2 SpecialeOSS Open-source	DeepSeek	$0.40	$1.20	$0.80	86	107.5	164K
Qwen: Qwen2.5 VL 72B InstructOSS Open-source	Alibaba Cloud	$0.80	$0.80	$0.80	66	82.5	33K
Qwen: Qwen3 235B A22B Thinking 2507OSS Hard reasoning	Alibaba Cloud	$0.15	$1.50	$0.82	82	99.7	131K
Qwen2.5 Coder 32B InstructOSS Code generation	Alibaba Cloud	$0.66	$1.00	$0.83	66	79.5	33K
Baidu: ERNIE 4.5 VL 424B A47B Speed & cost	baidu	$0.42	$1.25	$0.83	66	79.0	123K
Qwen: Qwen3 VL 30B A3B ThinkingOSS Hard reasoning	Alibaba Cloud	$0.13	$1.56	$0.84	82	97.0	131K
Sao10K: Llama 3.1 Euryale 70B v2.2OSS Hard reasoning	sao10k	$0.85	$0.85	$0.85	66	77.6	131K
xAI: Grok Code Fast 1 Speed & cost	xAI	$0.20	$1.50	$0.85	58	68.2	256K
Mancer: Weaver (alpha) Speed & cost	mancer	$0.75	$1.00	$0.88	66	75.4	8K
Google: Gemini 3.1 Flash Lite Preview Speed & cost	Google	$0.25	$1.50	$0.88	62	70.9	1M
Qwen: Qwen3.5-27BOSS Open-source	Alibaba Cloud	$0.20	$1.56	$0.88	82	93.4	262K
Qwen: Qwen3.5 Plus 2026-02-15OSS Open-source	Alibaba Cloud	$0.26	$1.56	$0.91	82	90.1	1M
Arcee AI: Virtuoso Large Speed & cost	arcee	$0.75	$1.20	$0.97	66	67.7	131K
Mistral: Mistral Large 3 2512OSS Open-source	Mistral AI	$0.50	$1.50	$1.00	85	85.0	262K
OpenAI: GPT-4.1 Mini Speed & cost	OpenAI	$0.40	$1.60	$1.00	80	80.0	1M
Morph: Morph V3 Fast Speed & cost	morph	$0.80	$1.20	$1.00	66	66.0	82K
EleutherAI: Llemma 7b Speed & cost	eleutherai	$0.80	$1.20	$1.00	66	66.0	4K
AlfredPros: CodeLLaMa 7B Instruct SolidityOSS Open-source	alfredpros	$0.80	$1.20	$1.00	66	66.0	4K
Perplexity: Sonar Search + citations	Perplexity	$1.00	$1.00	$1.00	66	66.0	127K
Nous: Hermes 3 405B InstructOSS Search + citations	nousresearch	$1.00	$1.00	$1.00	66	66.0	131K
OpenAI: GPT-3.5 Turbo Speed & cost	OpenAI	$0.50	$1.50	$1.00	66	66.0	16K
AionLabs: Aion-1.0-Mini Speed & cost	aion	$0.70	$1.40	$1.05	66	62.9	131K
Relace: Relace Apply 3 Speed & cost	relace	$0.85	$1.25	$1.05	66	62.9	256K
MoonshotAI: Kimi K2.5 Speed & cost	Moonshot AI	$0.38	$1.72	$1.05	89	84.7	262K
Z.ai: GLM 4.7OSS Open-source	z-ai	$0.39	$1.75	$1.07	66	61.7	203K
OpenAI: GPT-5 Mini Speed & cost	OpenAI	$0.25	$2.00	$1.13	83	73.8	400K
ByteDance Seed: Seed-2.0-Lite Speed & cost	bytedance	$0.25	$2.00	$1.13	58	51.6	262K
ByteDance Seed: Seed 1.6 Speed & cost	bytedance	$0.25	$2.00	$1.13	58	51.6	262K
OpenAI: GPT-5.1-Codex-Mini Code generation	OpenAI	$0.25	$2.00	$1.13	58	51.6	400K
Qwen: Qwen3 235B A22BOSS Open-source	Alibaba Cloud	$0.46	$1.82	$1.14	82	72.1	131K
Z.ai: GLM 4.6OSS Open-source	z-ai	$0.39	$1.90	$1.15	66	57.6	205K
Qwen: Qwen3.5-122B-A10BOSS Open-source	Alibaba Cloud	$0.26	$2.08	$1.17	82	70.1	262K
NVIDIA: Llama 3.1 Nemotron 70B InstructOSS Open-source	nvidia	$1.20	$1.20	$1.20	74	61.7	131K
Xiaomi: MiMo-V2-Omni Speed & cost	xiaomi	$0.40	$2.00	$1.20	66	55.0	262K
Mistral: Devstral 2 2512OSS Open-source	Mistral AI	$0.40	$2.00	$1.20	66	55.0	262K
MoonshotAI: Kimi K2 0905 Speed & cost	Moonshot AI	$0.40	$2.00	$1.20	66	55.0	131K
Mistral: Mistral Medium 3.1OSS Open-source	Mistral AI	$0.40	$2.00	$1.20	66	55.0	131K
Z.ai: GLM 4.5VOSS Open-source	z-ai	$0.60	$1.80	$1.20	66	55.0	66K
Mistral: Devstral MediumOSS Open-source	Mistral AI	$0.40	$2.00	$1.20	66	55.0	131K
Mistral: Mistral Medium 3OSS Open-source	Mistral AI	$0.40	$2.00	$1.20	66	55.0	131K
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1OSS Open-source	nvidia	$0.60	$1.80	$1.20	66	55.0	131K
AionLabs: Aion-2.0 Speed & cost	aion	$0.80	$1.60	$1.20	66	55.0	131K
AionLabs: Aion-RP 1.0 (8B)OSS Open-source	aion	$0.80	$1.60	$1.20	65	54.2	33K
MoonshotAI: Kimi K2 Thinking Hard reasoning	Moonshot AI	$0.47	$2.00	$1.23	66	53.4	131K
Deep Cogito: Cogito v2.1 671B General purpose	deepcogito	$1.25	$1.25	$1.25	74	59.2	128K
DeepSeek: R1 0528OSS Hard reasoning	DeepSeek	$0.45	$2.15	$1.30	91	70.0	164K
MiniMax: MiniMax M1 Speed & cost	minimax	$0.40	$2.20	$1.30	66	50.8	1M
Qwen: Qwen VL MaxOSS Open-source	Alibaba Cloud	$0.52	$2.08	$1.30	66	50.8	131K
Qwen: Qwen3.5 397B A17BOSS Open-source	Alibaba Cloud	$0.39	$2.34	$1.36	82	60.1	262K
Google: Nano Banana (Gemini 2.5 Flash Image) Image generation	Google	$0.30	$2.50	$1.40	80	57.1	33K
Google: Gemini 2.5 Flash Speed & cost	Google	$0.30	$2.50	$1.40	80	57.1	1M
Morph: Morph V3 Large Speed & cost	morph	$0.90	$1.90	$1.40	66	47.1	262K
Amazon: Nova 2 Lite Speed & cost	Amazon	$0.30	$2.50	$1.40	58	41.4	1M
Z.ai: GLM 4.5OSS Open-source	z-ai	$0.60	$2.20	$1.40	66	47.1	131K
Qwen: Qwen3 VL 235B A22B ThinkingOSS Hard reasoning	Alibaba Cloud	$0.26	$2.60	$1.43	82	57.3	131K
MoonshotAI: Kimi K2 0711 Speed & cost	Moonshot AI	$0.57	$2.30	$1.43	66	46.0	131K
Sao10k: Llama 3 Euryale 70B v2.1OSS Hard reasoning	sao10k	$1.48	$1.48	$1.48	74	50.0	8K
OpenAI: GPT Audio Mini Speed & cost	OpenAI	$0.60	$2.40	$1.50	66	44.0	128K
OpenAI: GPT-3.5 Turbo (older v0613) Speed & cost	OpenAI	$1.00	$2.00	$1.50	66	44.0	4K
Z.ai: GLM 5OSS Open-source	z-ai	$0.72	$2.30	$1.51	88	58.3	80K
DeepSeek: R1OSS Hard reasoning	DeepSeek	$0.70	$2.50	$1.60	91	56.9	64K
Google: Gemini 3 Flash Preview Speed & cost	Google	$0.50	$3.00	$1.75	80	45.7	1M
OpenAI: GPT-3.5 Turbo Instruct General purpose	OpenAI	$1.50	$2.00	$1.75	74	42.3	4K
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) Image generation	Google	$0.50	$3.00	$1.75	66	37.7	66K
xAI: Grok 4.3 New Agentic tasks & real-time info	xAI	$1.25	$2.50	$1.88	94	50.1	1M
Qwen: Qwen3 Coder PlusOSS Code generation	Alibaba Cloud	$0.65	$3.25	$1.95	82	42.1	1M
Xiaomi: MiMo-V2-Pro Speed & cost	xiaomi	$1.00	$3.00	$2.00	66	33.0	1M
Relace: Relace Search Search + citations	relace	$1.00	$3.00	$2.00	66	33.0	256K
Nous: Hermes 4 405BOSS Search + citations	nousresearch	$1.00	$3.00	$2.00	66	33.0	131K
Amazon: Nova Pro 1.0 Speed & cost	Amazon	$0.80	$3.20	$2.00	66	33.0	300K
Arcee AI: Maestro Reasoning Speed & cost	arcee	$0.90	$3.30	$2.10	66	31.4	131K
Switchpoint Router Speed & cost	switchpoint	$0.85	$3.40	$2.13	66	31.1	131K
OpenAI: GPT-5 Image Mini Image generation	OpenAI	$2.50	$2.00	$2.25	74	32.9	400K
Qwen: Qwen3 Max ThinkingOSS Hard reasoning	Alibaba Cloud	$0.78	$3.90	$2.34	82	35.0	262K
Qwen: Qwen3 MaxOSS Open-source	Alibaba Cloud	$0.78	$3.90	$2.34	82	35.0	262K
Anthropic: Claude 3.5 Haiku Speed & cost	Anthropic	$0.80	$4.00	$2.40	76	31.7	200K
MoonshotAI: Kimi K2.6 Frontier quality at low cost	Moonshot AI	$0.95	$4.00	$2.48	93	37.6	256K
Z.ai: GLM 5V TurboOSS Open-source	z-ai	$1.20	$4.00	$2.60	74	28.5	203K
Z.ai: GLM 5 TurboOSS Open-source	z-ai	$1.20	$4.00	$2.60	74	28.5	203K
Qwen: Qwen-Max OSS Open-source	Alibaba Cloud	$1.04	$4.16	$2.60	74	28.5	33K
DeepSeek: DeepSeek V4 Pro NewOSS Open-source value leader	DeepSeek	$1.74	$3.48	$2.61	92	35.2	1M
OpenAI: GPT-5.4 Mini Speed & cost	OpenAI	$0.75	$4.50	$2.63	83	31.6	400K
OpenAI: o4 Mini High Hard reasoning	OpenAI	$1.10	$4.40	$2.75	82	29.8	200K
OpenAI: o4 Mini Hard reasoning	OpenAI	$1.10	$4.40	$2.75	82	29.8	200K
OpenAI: o3 Mini High Hard reasoning	OpenAI	$1.10	$4.40	$2.75	82	29.8	200K
OpenAI: o3 Mini Hard reasoning	OpenAI	$1.10	$4.40	$2.75	82	29.8	200K
Anthropic: Claude Haiku 4.5 Speed & cost	Anthropic	$1.00	$5.00	$3.00	76	25.3	200K
Sao10K: Llama 3.1 70B Hanami x1OSS Hard reasoning	sao10k	$3.00	$3.00	$3.00	74	24.7	16K
Z.ai: GLM 5.1 NewOSS Open-weight agentic & tool use	z-ai	$1.55	$4.65	$3.10	90	29.0	200K
Writer: Palmyra X5 Speed & cost	writer	$0.60	$6.00	$3.30	66	20.0	1M
OpenAI: GPT-3.5 Turbo 16k General purpose	OpenAI	$3.00	$4.00	$3.50	74	21.1	16K
Mistral Large 2411OSS Open-source	Mistral AI	$2.00	$6.00	$4.00	85	21.3	131K
Mistral Large 2407OSS Open-source	Mistral AI	$2.00	$6.00	$4.00	85	21.3	131K
Mistral LargeOSS Open-source	Mistral AI	$2.00	$6.00	$4.00	85	21.3	128K
xAI: Grok 4.20 Multi-Agent General purpose	xAI	$2.00	$6.00	$4.00	74	18.5	2M
xAI: Grok 4.20 General purpose	xAI	$2.00	$6.00	$4.00	93	23.3	2M
Mistral: Pixtral Large 2411OSS Open-source	Mistral AI	$2.00	$6.00	$4.00	74	18.5	131K
Magnum v4 72B General purpose	anthracite-org	$3.00	$5.00	$4.00	74	18.5	16K
Mistral: Mixtral 8x22B InstructOSS Open-source	Mistral AI	$2.00	$6.00	$4.00	72	18.0	66K
OpenAI: o4 Mini Deep Research Deep research	OpenAI	$2.00	$8.00	$5.00	96	19.2	200K
OpenAI: o3 Hard reasoning	OpenAI	$2.00	$8.00	$5.00	92	18.4	200K
OpenAI: GPT-4.1 General purpose	OpenAI	$2.00	$8.00	$5.00	89	17.8	1M
AI21: Jamba Large 1.7 General purpose	ai21	$2.00	$8.00	$5.00	74	14.8	256K
Perplexity: Sonar Reasoning Pro Search + citations	Perplexity	$2.00	$8.00	$5.00	74	14.8	128K
Perplexity: Sonar Deep Research Deep research	Perplexity	$2.00	$8.00	$5.00	74	14.8	128K
OpenAI: GPT-5.1-Codex-Max Code generation	OpenAI	$1.25	$10.00	$5.63	93	16.5	400K
OpenAI: GPT-5.1 General purpose	OpenAI	$1.25	$10.00	$5.63	93	16.5	400K
OpenAI: GPT-5.1 Chat General purpose	OpenAI	$1.25	$10.00	$5.63	93	16.5	128K
OpenAI: GPT-5.1-Codex Code generation	OpenAI	$1.25	$10.00	$5.63	93	16.5	400K
OpenAI: GPT-5 General purpose	OpenAI	$1.25	$10.00	$5.63	90	16.0	400K
Google: Gemini 2.5 Pro Speed & cost	Google	$1.25	$10.00	$5.63	91	16.2	1M
Google: Gemini 2.5 Pro Preview 06-05 Speed & cost	Google	$1.25	$10.00	$5.63	91	16.2	1M
Google: Gemini 2.5 Pro Preview 05-06 Speed & cost	Google	$1.25	$10.00	$5.63	91	16.2	1M
Goliath 120B General purpose	alpindale	$3.75	$7.50	$5.63	82	14.6	6K
OpenAI: GPT-5 Codex Code generation	OpenAI	$1.25	$10.00	$5.63	74	13.2	400K
OpenAI: GPT-5 Chat General purpose	OpenAI	$1.25	$10.00	$5.63	74	13.2	128K
AionLabs: Aion-1.0 General purpose	aion	$4.00	$8.00	$6.00	82	13.7	131K
OpenAI: GPT-4o Audio General purpose	OpenAI	$2.50	$10.00	$6.25	88	14.1	128K
OpenAI: GPT-4o Search Preview Search + citations	OpenAI	$2.50	$10.00	$6.25	88	14.1	128K
OpenAI: GPT-4o (2024-11-20) General purpose	OpenAI	$2.50	$10.00	$6.25	88	14.1	128K
OpenAI: GPT-4o (2024-08-06) General purpose	OpenAI	$2.50	$10.00	$6.25	88	14.1	128K
OpenAI: GPT-4o General purpose	OpenAI	$2.50	$10.00	$6.25	88	14.1	128K
Cohere: Command R+ (08-2024)OSS Open-source	Cohere	$2.50	$10.00	$6.25	84	13.4	128K
OpenAI: GPT Audio General purpose	OpenAI	$2.50	$10.00	$6.25	74	11.8	128K
Cohere: Command A General purpose	Cohere	$2.50	$10.00	$6.25	74	11.8	256K
Inflection: Inflection 3 Pi General purpose	inflection	$2.50	$10.00	$6.25	74	11.8	8K
Inflection: Inflection 3 Productivity General purpose	inflection	$2.50	$10.00	$6.25	74	11.8	8K
Google: Gemini 3.1 Pro Preview Custom Tools Speed & cost	Google	$2.00	$12.00	$7.00	96	13.7	1M
Google: Gemini 3.1 Pro Preview Speed & cost	Google	$2.00	$12.00	$7.00	96	13.7	1M
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) Image generation	Google	$2.00	$12.00	$7.00	94	13.4	66K
Amazon: Nova Premier 1.0 General purpose	Amazon	$2.50	$12.50	$7.50	74	9.9	1M
OpenAI: GPT-5.3 Chat General purpose	OpenAI	$1.75	$14.00	$7.88	93	11.8	128K
OpenAI: GPT-5.3-Codex Code generation	OpenAI	$1.75	$14.00	$7.88	93	11.8	400K
OpenAI: GPT-5.2-Codex Code generation	OpenAI	$1.75	$14.00	$7.88	93	11.8	400K
OpenAI: GPT-5.2 Chat General purpose	OpenAI	$1.75	$14.00	$7.88	93	11.8	128K
OpenAI: GPT-5.2 General purpose	OpenAI	$1.75	$14.00	$7.88	93	11.8	400K
OpenAI: GPT-5.4 General purpose	OpenAI	$2.50	$15.00	$8.75	93	10.6	1M
xAI: Grok 3 General purpose	xAI	$3.00	$15.00	$9.00	90	10.0	131K
xAI: Grok 3 Beta General purpose	xAI	$3.00	$15.00	$9.00	90	10.0	131K
Anthropic: Claude Sonnet 4.6 General purpose	Anthropic	$3.00	$15.00	$9.00	91	10.1	1M
Anthropic: Claude Sonnet 4.5 General purpose	Anthropic	$3.00	$15.00	$9.00	88	9.8	1M
Anthropic: Claude Sonnet 4 General purpose	Anthropic	$3.00	$15.00	$9.00	86	9.6	200K
Anthropic: Claude 3.7 Sonnet General purpose	Anthropic	$3.00	$15.00	$9.00	86	9.6	200K
Anthropic: Claude 3.7 Sonnet (thinking) Hard reasoning	Anthropic	$3.00	$15.00	$9.00	86	9.6	200K
Perplexity: Sonar Pro Search Search + citations	Perplexity	$3.00	$15.00	$9.00	74	8.2	200K
xAI: Grok 4 General purpose	xAI	$3.00	$15.00	$9.00	74	8.2	256K
Perplexity: Sonar Pro Search + citations	Perplexity	$3.00	$15.00	$9.00	74	8.2	200K
OpenAI: GPT-5 Image Multimodal	OpenAI	$10.00	$10.00	$10.00	88	8.8	400K
OpenAI: GPT-4o (2024-05-13) General purpose	OpenAI	$5.00	$15.00	$10.00	88	8.8	128K
OpenAI: GPT-4o (extended) Multimodal	OpenAI	$6.00	$18.00	$12.00	88	7.3	128K
Anthropic: Claude Opus 4.7 Coding & agentic workflows	Anthropic	$5.00	$25.00	$15.00	97	6.5	1M
Anthropic: Claude Opus 4.6 General purpose	Anthropic	$5.00	$25.00	$15.00	95	6.3	1M
Anthropic: Claude Opus 4.5 General purpose	Anthropic	$5.00	$25.00	$15.00	95	6.3	200K
OpenAI: GPT-5.5 New Frontier general purpose	OpenAI	$5.00	$30.00	$17.50	98	5.6	1M
OpenAI: GPT-4 Turbo Multimodal	OpenAI	$10.00	$30.00	$20.00	88	4.4	128K
OpenAI: GPT-4 Turbo Preview Complex analysis	OpenAI	$10.00	$30.00	$20.00	88	4.4	128K
OpenAI: GPT-4 Turbo (older v1106) Multimodal	OpenAI	$10.00	$30.00	$20.00	88	4.4	128K
OpenAI: o3 Deep Research Deep research	OpenAI	$10.00	$40.00	$25.00	96	3.8	200K
OpenAI: o1 Hard reasoning	OpenAI	$15.00	$60.00	$37.50	88	2.3	200K
Anthropic: Claude Opus 4.1 Multimodal	Anthropic	$15.00	$75.00	$45.00	94	2.1	200K
Anthropic: Claude Opus 4 Multimodal	Anthropic	$15.00	$75.00	$45.00	94	2.1	200K
OpenAI: GPT-4 (older v0314) Complex analysis	OpenAI	$30.00	$60.00	$45.00	93	2.1	8K
OpenAI: GPT-4 Multimodal	OpenAI	$30.00	$60.00	$45.00	93	2.1	8K
OpenAI: o3 Pro Hard reasoning	OpenAI	$20.00	$80.00	$50.00	96	1.9	200K
OpenAI: GPT-5 Pro Complex analysis	OpenAI	$15.00	$120.00	$67.50	88	1.3	400K
OpenAI: GPT-5.2 Pro Complex analysis	OpenAI	$21.00	$168.00	$94.50	97	1.0	400K
OpenAI: GPT-5.5 Pro New Reasoning at any cost	OpenAI	$30.00	$180.00	$105.00	99	0.9	1M
OpenAI: GPT-5.4 Pro Complex analysis	OpenAI	$30.00	$180.00	$105.00	97	0.9	1M
OpenAI: o1-pro Hard reasoning	OpenAI	$150.00	$600.00	$375.00	93	0.2	200K

Blended = avg of input + output per 1M tokensQuality = composite benchmark score (0-100)Value = quality per dollar (higher is better)

Estimate Your Monthly Cost

Monthly cost estimate

Enter your typical request shape. Costs below are projected over one month, based on current public list-price API rates.

Requests per month

Input tokens per request

Output tokens per request

Per month: 100K requests · 50.0M input tokens · 30.0M output tokens. Excludes prompt caching, batch discounts, retries, and fees.

Cheapest

Mistral: Mistral Nemo

$2.20

per month at this volume

Best value (quality ≥ 80)

Qwen: Qwen3 235B A22B Instruct 2507 · Q 82

$6.55

per month at this volume

Most expensive

OpenAI: o1-pro

$25,500

per month at this volume

Save 30-60% with Mixture-of-Routers

Most production traffic is mixed-difficulty. Send the easy 60% to a cheap model and the hard 10% to a frontier model — same quality, fraction of the cost.

See the math

Full breakdown by model

Sorted cheapest to most expensive

Model	Cost / request	Input cost / mo	Output cost / mo	Total / mo
Mistral: Mistral Nemo $0.02 in / $0.04 out per 1M	$0.000022	$1.00	$1.20	$2.20
Google: Gemma 3n 4B $0.02 in / $0.04 out per 1M	$0.000022	$1.00	$1.20	$2.20
Meta: Llama 3.1 8B Instruct $0.02 in / $0.05 out per 1M	$0.000025	$1.00	$1.50	$2.50
Meta: Llama 3 8B Instruct $0.03 in / $0.04 out per 1M	$0.000027	$1.50	$1.20	$2.70
Llama Guard 3 8B $0.02 in / $0.06 out per 1M	$0.000028	$1.00	$1.80	$2.80
Sao10K: Llama 3 8B Lunaris $0.04 in / $0.05 out per 1M	$0.000035	$2.00	$1.50	$3.50
Meta: Llama 3.2 11B Vision Instruct $0.049 in / $0.049 out per 1M	$0.000039	$2.45	$1.47	$3.92
IBM: Granite 4.0 Micro $0.017 in / $0.11 out per 1M	$0.000042	$0.8500	$3.30	$4.15
Google: Gemma 2 9B $0.03 in / $0.09 out per 1M	$0.000042	$1.50	$2.70	$4.20
Qwen: Qwen2.5 Coder 7B Instruct $0.03 in / $0.09 out per 1M	$0.000042	$1.50	$2.70	$4.20
Google: Gemma 3 4B $0.04 in / $0.08 out per 1M	$0.000044	$2.00	$2.40	$4.40
Mistral: Mistral Small 3.1 24B $0.03 in / $0.11 out per 1M	$0.000048	$1.50	$3.30	$4.80
MythoMax 13B $0.06 in / $0.06 out per 1M	$0.000048	$3.00	$1.80	$4.80
OpenAI: gpt-oss-20b $0.03 in / $0.11 out per 1M	$0.000048	$1.50	$3.30	$4.80
Mistral: Mistral Small 3 $0.05 in / $0.08 out per 1M	$0.000049	$2.50	$2.40	$4.90
Qwen: Qwen2.5 7B Instruct $0.04 in / $0.1 out per 1M	$0.000050	$2.00	$3.00	$5.00
LiquidAI: LFM2-24B-A2B $0.03 in / $0.12 out per 1M	$0.000051	$1.50	$3.60	$5.10
Qwen: Qwen-Turbo $0.0325 in / $0.13 out per 1M	$0.000055	$1.63	$3.90	$5.53
Google: Gemma 3 12B $0.04 in / $0.13 out per 1M	$0.000059	$2.00	$3.90	$5.90
Amazon: Nova Micro 1.0 $0.035 in / $0.14 out per 1M	$0.000060	$1.75	$4.20	$5.95
Cohere: Command R7B (12-2024) $0.0375 in / $0.15 out per 1M	$0.000064	$1.88	$4.50	$6.38
Qwen: Qwen3 235B A22B Instruct 2507 $0.071 in / $0.1 out per 1M	$0.000065	$3.55	$3.00	$6.55
Arcee AI: Trinity Mini $0.045 in / $0.15 out per 1M	$0.000068	$2.25	$4.50	$6.75
NVIDIA: Nemotron Nano 9B V2 $0.04 in / $0.16 out per 1M	$0.000068	$2.00	$4.80	$6.80
Qwen: Qwen3.5-9B $0.05 in / $0.15 out per 1M	$0.000070	$2.50	$4.50	$7.00
Meta: Llama 3.2 1B Instruct $0.027 in / $0.2 out per 1M	$0.000073	$1.35	$6.00	$7.35
Microsoft: Phi 4 $0.065 in / $0.14 out per 1M	$0.000075	$3.25	$4.20	$7.45
OpenAI: gpt-oss-120b $0.039 in / $0.19 out per 1M	$0.000077	$1.95	$5.70	$7.65
Reka Edge $0.1 in / $0.1 out per 1M	$0.000080	$5.00	$3.00	$8.00
Mistral: Ministral 3 3B 2512 $0.1 in / $0.1 out per 1M	$0.000080	$5.00	$3.00	$8.00
Z.ai: GLM 4 32B $0.1 in / $0.1 out per 1M	$0.000080	$5.00	$3.00	$8.00
NVIDIA: Nemotron 3 Nano 30B A3B $0.05 in / $0.2 out per 1M	$0.000085	$2.50	$6.00	$8.50
AllenAI: Olmo 2 32B Instruct $0.05 in / $0.2 out per 1M	$0.000085	$2.50	$6.00	$8.50
Google: Gemma 3 27B $0.08 in / $0.16 out per 1M	$0.000088	$4.00	$4.80	$8.80
Mistral: Mistral Small 3.2 24B $0.075 in / $0.2 out per 1M	$0.000097	$3.75	$6.00	$9.75
Qwen: Qwen3 14B $0.06 in / $0.24 out per 1M	$0.000102	$3.00	$7.20	$10.20
Amazon: Nova Lite 1.0 $0.06 in / $0.24 out per 1M	$0.000102	$3.00	$7.20	$10.20
ByteDance: UI-TARS 7B $0.1 in / $0.2 out per 1M	$0.000110	$5.00	$6.00	$11.00
Reka Flash 3 $0.1 in / $0.2 out per 1M	$0.000110	$5.00	$6.00	$11.00
Qwen: Qwen3.5-Flash $0.065 in / $0.26 out per 1M	$0.000111	$3.25	$7.80	$11.05
Qwen: Qwen3 32B $0.08 in / $0.24 out per 1M	$0.000112	$4.00	$7.20	$11.20
Mistral: Mistral 7B Instruct v0.1 $0.11 in / $0.19 out per 1M	$0.000112	$5.50	$5.70	$11.20
NousResearch: Hermes 2 Pro - Llama-3 8B $0.14 in / $0.14 out per 1M	$0.000112	$7.00	$4.20	$11.20
Qwen: Qwen3 Coder 30B A3B Instruct $0.07 in / $0.27 out per 1M	$0.000116	$3.50	$8.10	$11.60
Baidu: ERNIE 4.5 21B A3B Thinking $0.07 in / $0.28 out per 1M	$0.000119	$3.50	$8.40	$11.90
Baidu: ERNIE 4.5 21B A3B $0.07 in / $0.28 out per 1M	$0.000119	$3.50	$8.40	$11.90
EssentialAI: Rnj 1 Instruct $0.15 in / $0.15 out per 1M	$0.000120	$7.50	$4.50	$12.00
Mistral: Ministral 3 8B 2512 $0.15 in / $0.15 out per 1M	$0.000120	$7.50	$4.50	$12.00
Qwen: Qwen3 30B A3B $0.08 in / $0.28 out per 1M	$0.000124	$4.00	$8.40	$12.40
Google: Gemini 2.0 Flash Lite $0.075 in / $0.3 out per 1M	$0.000128	$3.75	$9.00	$12.75
ByteDance Seed: Seed 1.6 Flash $0.075 in / $0.3 out per 1M	$0.000128	$3.75	$9.00	$12.75
OpenAI: gpt-oss-safeguard-20b $0.075 in / $0.3 out per 1M	$0.000128	$3.75	$9.00	$12.75
Meta: Llama 3.2 3B Instruct $0.051 in / $0.34 out per 1M	$0.000128	$2.55	$10.20	$12.75
Meta: Llama 4 Scout $0.08 in / $0.3 out per 1M	$0.000130	$4.00	$9.00	$13.00
Xiaomi: MiMo-V2-Flash $0.09 in / $0.29 out per 1M	$0.000132	$4.50	$8.70	$13.20
Qwen: Qwen3 30B A3B Instruct 2507 $0.09 in / $0.3 out per 1M	$0.000135	$4.50	$9.00	$13.50
Mistral: Mistral Small Creative $0.1 in / $0.3 out per 1M	$0.000140	$5.00	$9.00	$14.00
StepFun: Step 3.5 Flash $0.1 in / $0.3 out per 1M	$0.000140	$5.00	$9.00	$14.00
Mistral: Voxtral Small 24B 2507 $0.1 in / $0.3 out per 1M	$0.000140	$5.00	$9.00	$14.00
Mistral: Devstral Small 1.1 $0.1 in / $0.3 out per 1M	$0.000140	$5.00	$9.00	$14.00
Arcee AI: Spotlight $0.18 in / $0.18 out per 1M	$0.000144	$9.00	$5.40	$14.40
Meta: Llama Guard 4 12B $0.18 in / $0.18 out per 1M	$0.000144	$9.00	$5.40	$14.40
Qwen: Qwen3 8B $0.05 in / $0.4 out per 1M	$0.000145	$2.50	$12.00	$14.50
OpenAI: GPT-5 Nano $0.05 in / $0.4 out per 1M	$0.000145	$2.50	$12.00	$14.50
Meta: Llama 3.3 70B Instruct $0.1 in / $0.32 out per 1M	$0.000146	$5.00	$9.60	$14.60
Z.ai: GLM 4.7 Flash $0.06 in / $0.4 out per 1M	$0.000150	$3.00	$12.00	$15.00
DeepSeek: DeepSeek V4 Flash $0.14 in / $0.28 out per 1M	$0.000154	$7.00	$8.40	$15.40
Qwen: Qwen3 30B A3B Thinking 2507 $0.08 in / $0.4 out per 1M	$0.000160	$4.00	$12.00	$16.00
Mistral: Ministral 3 14B 2512 $0.2 in / $0.2 out per 1M	$0.000160	$10.00	$6.00	$16.00
Google: Gemini 2.5 Flash Lite Preview 09-2025 $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
Google: Gemini 2.5 Flash Lite $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
Google: Gemini 2.0 Flash $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
OpenAI: GPT-4.1 Nano $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
ByteDance Seed: Seed-2.0-Mini $0.1 in / $0.4 out per 1M	$0.000170	$5.00	$12.00	$17.00
Qwen: Qwen3 VL 32B Instruct $0.104 in / $0.416 out per 1M	$0.000177	$5.20	$12.48	$17.68
Qwen2.5 72B Instruct $0.12 in / $0.39 out per 1M	$0.000177	$6.00	$11.70	$17.70
Tongyi DeepResearch 30B A3B $0.09 in / $0.45 out per 1M	$0.000180	$4.50	$13.50	$18.00
Google: Gemma 4 26B A4B $0.13 in / $0.4 out per 1M	$0.000185	$6.50	$12.00	$18.50
Nous: Hermes 4 70B $0.13 in / $0.4 out per 1M	$0.000185	$6.50	$12.00	$18.50
Qwen: Qwen3 VL 8B Instruct $0.08 in / $0.5 out per 1M	$0.000190	$4.00	$15.00	$19.00
Google: Gemma 4 31B $0.14 in / $0.4 out per 1M	$0.000190	$7.00	$12.00	$19.00
Qwen: Qwen VL Plus $0.1365 in / $0.4095 out per 1M	$0.000191	$6.83	$12.29	$19.11
NVIDIA: Nemotron 3 Super $0.1 in / $0.5 out per 1M	$0.000200	$5.00	$15.00	$20.00
TheDrummer: Rocinante 12B $0.17 in / $0.43 out per 1M	$0.000214	$8.50	$12.90	$21.40
Nex AGI: DeepSeek V3.1 Nex N1 $0.135 in / $0.5 out per 1M	$0.000218	$6.75	$15.00	$21.75
Qwen: Qwen3 VL 30B A3B Instruct $0.13 in / $0.52 out per 1M	$0.000221	$6.50	$15.60	$22.10
AllenAI: Olmo 3 32B Think $0.15 in / $0.5 out per 1M	$0.000225	$7.50	$15.00	$22.50
DeepSeek: R1 Distill Qwen 32B $0.29 in / $0.29 out per 1M	$0.000232	$14.50	$8.70	$23.20
Baidu: ERNIE 4.5 VL 28B A3B $0.14 in / $0.56 out per 1M	$0.000238	$7.00	$16.80	$23.80
Nous: Hermes 3 70B Instruct $0.3 in / $0.3 out per 1M	$0.000240	$15.00	$9.00	$24.00
Tencent: Hunyuan A13B Instruct $0.14 in / $0.57 out per 1M	$0.000241	$7.00	$17.10	$24.10
DeepSeek: DeepSeek V3.2 $0.26 in / $0.38 out per 1M	$0.000244	$13.00	$11.40	$24.40
Qwen: QwQ 32B $0.15 in / $0.58 out per 1M	$0.000249	$7.50	$17.40	$24.90
xAI: Grok 4.1 Fast $0.2 in / $0.5 out per 1M	$0.000250	$10.00	$15.00	$25.00
xAI: Grok 4 Fast $0.2 in / $0.5 out per 1M	$0.000250	$10.00	$15.00	$25.00
Meta: Llama 4 Maverick $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
OpenAI: GPT-4o-mini Search Preview $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
OpenAI: GPT-4o-mini (2024-07-18) $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
OpenAI: GPT-4o-mini $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
Mistral: Mistral Small 4 $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
Upstage: Solar Pro 3 $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
Cohere: Command R (08-2024) $0.15 in / $0.6 out per 1M	$0.000255	$7.50	$18.00	$25.50
DeepSeek: DeepSeek V3.2 Exp $0.27 in / $0.41 out per 1M	$0.000258	$13.50	$12.30	$25.80
NVIDIA: Nemotron Nano 12B 2 VL $0.2 in / $0.6 out per 1M	$0.000280	$10.00	$18.00	$28.00
AllenAI: Olmo 3.1 32B Instruct $0.2 in / $0.6 out per 1M	$0.000280	$10.00	$18.00	$28.00
Qwen: Qwen2.5 VL 32B Instruct $0.2 in / $0.6 out per 1M	$0.000280	$10.00	$18.00	$28.00
Mistral: Saba $0.2 in / $0.6 out per 1M	$0.000280	$10.00	$18.00	$28.00
Qwen: Qwen3 Next 80B A3B Thinking $0.0975 in / $0.78 out per 1M	$0.000283	$4.88	$23.40	$28.28
Qwen: Qwen3 Coder Next $0.12 in / $0.75 out per 1M	$0.000285	$6.00	$22.50	$28.50
DeepSeek: DeepSeek V3.1 $0.15 in / $0.75 out per 1M	$0.000300	$7.50	$22.50	$30.00
xAI: Grok 3 Mini $0.3 in / $0.5 out per 1M	$0.000300	$15.00	$15.00	$30.00
xAI: Grok 3 Mini Beta $0.3 in / $0.5 out per 1M	$0.000300	$15.00	$15.00	$30.00
TheDrummer: Cydonia 24B V4.1 $0.3 in / $0.5 out per 1M	$0.000300	$15.00	$15.00	$30.00
Meta: Llama 3.1 70B Instruct $0.4 in / $0.4 out per 1M	$0.000320	$20.00	$12.00	$32.00
TheDrummer: UnslopNemo 12B $0.4 in / $0.4 out per 1M	$0.000320	$20.00	$12.00	$32.00
Z.ai: GLM 4.5 Air $0.13 in / $0.85 out per 1M	$0.000320	$6.50	$25.50	$32.00
DeepSeek: DeepSeek V3 0324 $0.2 in / $0.77 out per 1M	$0.000331	$10.00	$23.10	$33.10
Meituan: LongCat Flash Chat $0.2 in / $0.8 out per 1M	$0.000340	$10.00	$24.00	$34.00
DeepSeek: DeepSeek V3.1 Terminus $0.21 in / $0.79 out per 1M	$0.000342	$10.50	$23.70	$34.20
Inception: Mercury 2 $0.25 in / $0.75 out per 1M	$0.000350	$12.50	$22.50	$35.00
Inception: Mercury $0.25 in / $0.75 out per 1M	$0.000350	$12.50	$22.50	$35.00
Inception: Mercury Coder $0.25 in / $0.75 out per 1M	$0.000350	$12.50	$22.50	$35.00
MiniMax: MiniMax M2.5 $0.118 in / $0.99 out per 1M	$0.000356	$5.90	$29.70	$35.60
Qwen: Qwen3 VL 235B A22B Instruct $0.2 in / $0.88 out per 1M	$0.000364	$10.00	$26.40	$36.40
Qwen: Qwen Plus 0728 (thinking) $0.26 in / $0.78 out per 1M	$0.000364	$13.00	$23.40	$36.40
Qwen: Qwen Plus 0728 $0.26 in / $0.78 out per 1M	$0.000364	$13.00	$23.40	$36.40
Qwen: Qwen-Plus $0.26 in / $0.78 out per 1M	$0.000364	$13.00	$23.40	$36.40
Arcee AI: Trinity Large Thinking $0.22 in / $0.85 out per 1M	$0.000365	$11.00	$25.50	$36.50
Qwen: Qwen3 Next 80B A3B Instruct $0.09 in / $1.1 out per 1M	$0.000375	$4.50	$33.00	$37.50
Qwen: Qwen3 Coder Flash $0.195 in / $0.975 out per 1M	$0.000390	$9.75	$29.25	$39.00
Qwen: Qwen3 Coder 480B A35B $0.22 in / $1 out per 1M	$0.000410	$11.00	$30.00	$41.00
Mistral: Codestral 2508 $0.3 in / $0.9 out per 1M	$0.000420	$15.00	$27.00	$42.00
ReMM SLERP 13B $0.45 in / $0.65 out per 1M	$0.000420	$22.50	$19.50	$42.00
MiniMax: MiniMax M2.1 $0.27 in / $0.95 out per 1M	$0.000420	$13.50	$28.50	$42.00
Z.ai: GLM 4.6V $0.3 in / $0.9 out per 1M	$0.000420	$15.00	$27.00	$42.00
DeepSeek: DeepSeek V3 $0.32 in / $0.89 out per 1M	$0.000427	$16.00	$26.70	$42.70
MiniMax: MiniMax M2 $0.255 in / $1 out per 1M	$0.000427	$12.75	$30.00	$42.75
Prime Intellect: INTELLECT-3 $0.2 in / $1.1 out per 1M	$0.000430	$10.00	$33.00	$43.00
MiniMax: MiniMax-01 $0.2 in / $1.1 out per 1M	$0.000430	$10.00	$33.00	$43.00
Mistral: Mixtral 8x7B Instruct $0.54 in / $0.54 out per 1M	$0.000432	$27.00	$16.20	$43.20
Qwen: Qwen3 VL 8B Thinking $0.117 in / $1.365 out per 1M	$0.000468	$5.85	$40.95	$46.80
Baidu: ERNIE 4.5 300B A47B $0.28 in / $1.1 out per 1M	$0.000470	$14.00	$33.00	$47.00
Qwen: Qwen3.5-35B-A3B $0.1625 in / $1.3 out per 1M	$0.000471	$8.13	$39.00	$47.13
OpenAI: GPT-5.4 Nano $0.2 in / $1.25 out per 1M	$0.000475	$10.00	$37.50	$47.50
Meta: Llama 3 70B Instruct $0.51 in / $0.74 out per 1M	$0.000477	$25.50	$22.20	$47.70
TNG: DeepSeek R1T2 Chimera $0.3 in / $1.1 out per 1M	$0.000480	$15.00	$33.00	$48.00
Arcee AI: Coder Large $0.5 in / $0.8 out per 1M	$0.000490	$25.00	$24.00	$49.00
WizardLM-2 8x22B $0.62 in / $0.62 out per 1M	$0.000496	$31.00	$18.60	$49.60
Anthropic: Claude 3 Haiku $0.25 in / $1.25 out per 1M	$0.000500	$12.50	$37.50	$50.00
Kwaipilot: KAT-Coder-Pro V2 $0.3 in / $1.2 out per 1M	$0.000510	$15.00	$36.00	$51.00
MiniMax: MiniMax M2.7 $0.3 in / $1.2 out per 1M	$0.000510	$15.00	$36.00	$51.00
MiniMax: MiniMax M2-her $0.3 in / $1.2 out per 1M	$0.000510	$15.00	$36.00	$51.00
TheDrummer: Skyfall 36B V2 $0.55 in / $0.8 out per 1M	$0.000515	$27.50	$24.00	$51.50
Google: Gemma 2 27B $0.65 in / $0.65 out per 1M	$0.000520	$32.50	$19.50	$52.00
Qwen: Qwen3 235B A22B Thinking 2507 $0.1495 in / $1.495 out per 1M	$0.000523	$7.47	$44.85	$52.33
Qwen: Qwen3 VL 30B A3B Thinking $0.13 in / $1.56 out per 1M	$0.000533	$6.50	$46.80	$53.30
Sao10K: Llama 3.3 Euryale 70B $0.65 in / $0.75 out per 1M	$0.000550	$32.50	$22.50	$55.00
xAI: Grok Code Fast 1 $0.2 in / $1.5 out per 1M	$0.000550	$10.00	$45.00	$55.00
DeepSeek: DeepSeek V3.2 Speciale $0.4 in / $1.2 out per 1M	$0.000560	$20.00	$36.00	$56.00
Qwen: Qwen3.5-27B $0.195 in / $1.56 out per 1M	$0.000566	$9.75	$46.80	$56.55
Google: Gemini 3.1 Flash Lite Preview $0.25 in / $1.5 out per 1M	$0.000575	$12.50	$45.00	$57.50
Baidu: ERNIE 4.5 VL 424B A47B $0.42 in / $1.25 out per 1M	$0.000585	$21.00	$37.50	$58.50
DeepSeek: R1 Distill Llama 70B $0.7 in / $0.8 out per 1M	$0.000590	$35.00	$24.00	$59.00
Qwen: Qwen3.5 Plus 2026-02-15 $0.26 in / $1.56 out per 1M	$0.000598	$13.00	$46.80	$59.80
Qwen2.5 Coder 32B Instruct $0.66 in / $1 out per 1M	$0.000630	$33.00	$30.00	$63.00
Qwen: Qwen2.5 VL 72B Instruct $0.8 in / $0.8 out per 1M	$0.000640	$40.00	$24.00	$64.00
Mancer: Weaver (alpha) $0.75 in / $1 out per 1M	$0.000675	$37.50	$30.00	$67.50
OpenAI: GPT-4.1 Mini $0.4 in / $1.6 out per 1M	$0.000680	$20.00	$48.00	$68.00
Sao10K: Llama 3.1 Euryale 70B v2.2 $0.85 in / $0.85 out per 1M	$0.000680	$42.50	$25.50	$68.00
Mistral: Mistral Large 3 2512 $0.5 in / $1.5 out per 1M	$0.000700	$25.00	$45.00	$70.00
OpenAI: GPT-3.5 Turbo $0.5 in / $1.5 out per 1M	$0.000700	$25.00	$45.00	$70.00
MoonshotAI: Kimi K2.5 $0.3827 in / $1.72 out per 1M	$0.000707	$19.13	$51.60	$70.73
Z.ai: GLM 4.7 $0.39 in / $1.75 out per 1M	$0.000720	$19.50	$52.50	$72.00
OpenAI: GPT-5 Mini $0.25 in / $2 out per 1M	$0.000725	$12.50	$60.00	$72.50
ByteDance Seed: Seed-2.0-Lite $0.25 in / $2 out per 1M	$0.000725	$12.50	$60.00	$72.50
ByteDance Seed: Seed 1.6 $0.25 in / $2 out per 1M	$0.000725	$12.50	$60.00	$72.50
OpenAI: GPT-5.1-Codex-Mini $0.25 in / $2 out per 1M	$0.000725	$12.50	$60.00	$72.50
Arcee AI: Virtuoso Large $0.75 in / $1.2 out per 1M	$0.000735	$37.50	$36.00	$73.50
Qwen: Qwen3.5-122B-A10B $0.26 in / $2.08 out per 1M	$0.000754	$13.00	$62.40	$75.40
Morph: Morph V3 Fast $0.8 in / $1.2 out per 1M	$0.000760	$40.00	$36.00	$76.00
EleutherAI: Llemma 7b $0.8 in / $1.2 out per 1M	$0.000760	$40.00	$36.00	$76.00
AlfredPros: CodeLLaMa 7B Instruct Solidity $0.8 in / $1.2 out per 1M	$0.000760	$40.00	$36.00	$76.00
Z.ai: GLM 4.6 $0.39 in / $1.9 out per 1M	$0.000765	$19.50	$57.00	$76.50
AionLabs: Aion-1.0-Mini $0.7 in / $1.4 out per 1M	$0.000770	$35.00	$42.00	$77.00
Qwen: Qwen3 235B A22B $0.455 in / $1.82 out per 1M	$0.000773	$22.75	$54.60	$77.35
Xiaomi: MiMo-V2-Omni $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Mistral: Devstral 2 2512 $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Relace: Relace Apply 3 $0.85 in / $1.25 out per 1M	$0.000800	$42.50	$37.50	$80.00
MoonshotAI: Kimi K2 0905 $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Mistral: Mistral Medium 3.1 $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Mistral: Devstral Medium $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Mistral: Mistral Medium 3 $0.4 in / $2 out per 1M	$0.000800	$20.00	$60.00	$80.00
Perplexity: Sonar $1 in / $1 out per 1M	$0.000800	$50.00	$30.00	$80.00
Nous: Hermes 3 405B Instruct $1 in / $1 out per 1M	$0.000800	$50.00	$30.00	$80.00
MoonshotAI: Kimi K2 Thinking $0.47 in / $2 out per 1M	$0.000835	$23.50	$60.00	$83.50
Z.ai: GLM 4.5V $0.6 in / $1.8 out per 1M	$0.000840	$30.00	$54.00	$84.00
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 $0.6 in / $1.8 out per 1M	$0.000840	$30.00	$54.00	$84.00
MiniMax: MiniMax M1 $0.4 in / $2.2 out per 1M	$0.000860	$20.00	$66.00	$86.00
DeepSeek: R1 0528 $0.45 in / $2.15 out per 1M	$0.000870	$22.50	$64.50	$87.00
AionLabs: Aion-2.0 $0.8 in / $1.6 out per 1M	$0.000880	$40.00	$48.00	$88.00
AionLabs: Aion-RP 1.0 (8B) $0.8 in / $1.6 out per 1M	$0.000880	$40.00	$48.00	$88.00
Qwen: Qwen VL Max $0.52 in / $2.08 out per 1M	$0.000884	$26.00	$62.40	$88.40
Qwen: Qwen3.5 397B A17B $0.39 in / $2.34 out per 1M	$0.000897	$19.50	$70.20	$89.70
Google: Nano Banana (Gemini 2.5 Flash Image) $0.3 in / $2.5 out per 1M	$0.000900	$15.00	$75.00	$90.00
Google: Gemini 2.5 Flash $0.3 in / $2.5 out per 1M	$0.000900	$15.00	$75.00	$90.00
Amazon: Nova 2 Lite $0.3 in / $2.5 out per 1M	$0.000900	$15.00	$75.00	$90.00
Qwen: Qwen3 VL 235B A22B Thinking $0.26 in / $2.6 out per 1M	$0.000910	$13.00	$78.00	$91.00
NVIDIA: Llama 3.1 Nemotron 70B Instruct $1.2 in / $1.2 out per 1M	$0.000960	$60.00	$36.00	$96.00
Z.ai: GLM 4.5 $0.6 in / $2.2 out per 1M	$0.000960	$30.00	$66.00	$96.00
MoonshotAI: Kimi K2 0711 $0.57 in / $2.3 out per 1M	$0.000975	$28.50	$69.00	$97.50
Deep Cogito: Cogito v2.1 671B $1.25 in / $1.25 out per 1M	$0.001000	$62.50	$37.50	$100.00
OpenAI: GPT Audio Mini $0.6 in / $2.4 out per 1M	$0.001020	$30.00	$72.00	$102.00
Morph: Morph V3 Large $0.9 in / $1.9 out per 1M	$0.001020	$45.00	$57.00	$102.00
Z.ai: GLM 5 $0.72 in / $2.3 out per 1M	$0.001050	$36.00	$69.00	$105.00
DeepSeek: R1 $0.7 in / $2.5 out per 1M	$0.001100	$35.00	$75.00	$110.00
OpenAI: GPT-3.5 Turbo (older v0613) $1 in / $2 out per 1M	$0.001100	$50.00	$60.00	$110.00
Google: Gemini 3 Flash Preview $0.5 in / $3 out per 1M	$0.001150	$25.00	$90.00	$115.00
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) $0.5 in / $3 out per 1M	$0.001150	$25.00	$90.00	$115.00
Sao10k: Llama 3 Euryale 70B v2.1 $1.48 in / $1.48 out per 1M	$0.001184	$74.00	$44.40	$118.40
Qwen: Qwen3 Coder Plus $0.65 in / $3.25 out per 1M	$0.001300	$32.50	$97.50	$130.00
OpenAI: GPT-3.5 Turbo Instruct $1.5 in / $2 out per 1M	$0.001350	$75.00	$60.00	$135.00
Amazon: Nova Pro 1.0 $0.8 in / $3.2 out per 1M	$0.001360	$40.00	$96.00	$136.00
xAI: Grok 4.3 $1.25 in / $2.5 out per 1M	$0.001375	$62.50	$75.00	$137.50
Xiaomi: MiMo-V2-Pro $1 in / $3 out per 1M	$0.001400	$50.00	$90.00	$140.00
Relace: Relace Search $1 in / $3 out per 1M	$0.001400	$50.00	$90.00	$140.00
Nous: Hermes 4 405B $1 in / $3 out per 1M	$0.001400	$50.00	$90.00	$140.00
Arcee AI: Maestro Reasoning $0.9 in / $3.3 out per 1M	$0.001440	$45.00	$99.00	$144.00
Switchpoint Router $0.85 in / $3.4 out per 1M	$0.001445	$42.50	$102.00	$144.50
Qwen: Qwen3 Max Thinking $0.78 in / $3.9 out per 1M	$0.001560	$39.00	$117.00	$156.00
Qwen: Qwen3 Max $0.78 in / $3.9 out per 1M	$0.001560	$39.00	$117.00	$156.00
Anthropic: Claude 3.5 Haiku $0.8 in / $4 out per 1M	$0.001600	$40.00	$120.00	$160.00
MoonshotAI: Kimi K2.6 $0.95 in / $4 out per 1M	$0.001675	$47.50	$120.00	$167.50
OpenAI: GPT-5.4 Mini $0.75 in / $4.5 out per 1M	$0.001725	$37.50	$135.00	$172.50
Qwen: Qwen-Max $1.04 in / $4.16 out per 1M	$0.001768	$52.00	$124.80	$176.80
Z.ai: GLM 5V Turbo $1.2 in / $4 out per 1M	$0.001800	$60.00	$120.00	$180.00
Z.ai: GLM 5 Turbo $1.2 in / $4 out per 1M	$0.001800	$60.00	$120.00	$180.00
OpenAI: GPT-5 Image Mini $2.5 in / $2 out per 1M	$0.001850	$125.00	$60.00	$185.00
OpenAI: o4 Mini High $1.1 in / $4.4 out per 1M	$0.001870	$55.00	$132.00	$187.00
OpenAI: o4 Mini $1.1 in / $4.4 out per 1M	$0.001870	$55.00	$132.00	$187.00
OpenAI: o3 Mini High $1.1 in / $4.4 out per 1M	$0.001870	$55.00	$132.00	$187.00
OpenAI: o3 Mini $1.1 in / $4.4 out per 1M	$0.001870	$55.00	$132.00	$187.00
DeepSeek: DeepSeek V4 Pro $1.74 in / $3.48 out per 1M	$0.001914	$87.00	$104.40	$191.40
Anthropic: Claude Haiku 4.5 $1 in / $5 out per 1M	$0.002000	$50.00	$150.00	$200.00
Writer: Palmyra X5 $0.6 in / $6 out per 1M	$0.002100	$30.00	$180.00	$210.00
Z.ai: GLM 5.1 $1.55 in / $4.65 out per 1M	$0.002170	$77.50	$139.50	$217.00
Sao10K: Llama 3.1 70B Hanami x1 $3 in / $3 out per 1M	$0.002400	$150.00	$90.00	$240.00
OpenAI: GPT-3.5 Turbo 16k $3 in / $4 out per 1M	$0.002700	$150.00	$120.00	$270.00
Mistral Large 2411 $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
Mistral Large 2407 $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
Mistral Large $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
xAI: Grok 4.20 Multi-Agent $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
xAI: Grok 4.20 $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
Mistral: Pixtral Large 2411 $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
Mistral: Mixtral 8x22B Instruct $2 in / $6 out per 1M	$0.002800	$100.00	$180.00	$280.00
Magnum v4 72B $3 in / $5 out per 1M	$0.003000	$150.00	$150.00	$300.00
OpenAI: o4 Mini Deep Research $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
OpenAI: o3 $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
OpenAI: GPT-4.1 $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
AI21: Jamba Large 1.7 $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
Perplexity: Sonar Reasoning Pro $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
Perplexity: Sonar Deep Research $2 in / $8 out per 1M	$0.003400	$100.00	$240.00	$340.00
OpenAI: GPT-5.1-Codex-Max $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5.1 $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5.1 Chat $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5.1-Codex $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5 $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
Google: Gemini 2.5 Pro $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
Google: Gemini 2.5 Pro Preview 06-05 $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
Google: Gemini 2.5 Pro Preview 05-06 $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5 Codex $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
OpenAI: GPT-5 Chat $1.25 in / $10 out per 1M	$0.003625	$62.50	$300.00	$362.50
Goliath 120B $3.75 in / $7.5 out per 1M	$0.004125	$187.50	$225.00	$412.50
OpenAI: GPT-4o Audio $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
OpenAI: GPT-4o Search Preview $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
OpenAI: GPT-4o (2024-11-20) $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
OpenAI: GPT-4o (2024-08-06) $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
OpenAI: GPT-4o $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
Cohere: Command R+ (08-2024) $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
OpenAI: GPT Audio $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
Cohere: Command A $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
Inflection: Inflection 3 Pi $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
Inflection: Inflection 3 Productivity $2.5 in / $10 out per 1M	$0.004250	$125.00	$300.00	$425.00
AionLabs: Aion-1.0 $4 in / $8 out per 1M	$0.004400	$200.00	$240.00	$440.00
Google: Gemini 3.1 Pro Preview Custom Tools $2 in / $12 out per 1M	$0.004600	$100.00	$360.00	$460.00
Google: Gemini 3.1 Pro Preview $2 in / $12 out per 1M	$0.004600	$100.00	$360.00	$460.00
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) $2 in / $12 out per 1M	$0.004600	$100.00	$360.00	$460.00
Amazon: Nova Premier 1.0 $2.5 in / $12.5 out per 1M	$0.005000	$125.00	$375.00	$500.00
OpenAI: GPT-5.3 Chat $1.75 in / $14 out per 1M	$0.005075	$87.50	$420.00	$507.50
OpenAI: GPT-5.3-Codex $1.75 in / $14 out per 1M	$0.005075	$87.50	$420.00	$507.50
OpenAI: GPT-5.2-Codex $1.75 in / $14 out per 1M	$0.005075	$87.50	$420.00	$507.50
OpenAI: GPT-5.2 Chat $1.75 in / $14 out per 1M	$0.005075	$87.50	$420.00	$507.50
OpenAI: GPT-5.2 $1.75 in / $14 out per 1M	$0.005075	$87.50	$420.00	$507.50
OpenAI: GPT-5.4 $2.5 in / $15 out per 1M	$0.005750	$125.00	$450.00	$575.00
xAI: Grok 3 $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
xAI: Grok 3 Beta $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Anthropic: Claude Sonnet 4.6 $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Anthropic: Claude Sonnet 4.5 $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Anthropic: Claude Sonnet 4 $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Anthropic: Claude 3.7 Sonnet $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Anthropic: Claude 3.7 Sonnet (thinking) $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Perplexity: Sonar Pro Search $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
xAI: Grok 4 $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
Perplexity: Sonar Pro $3 in / $15 out per 1M	$0.006000	$150.00	$450.00	$600.00
OpenAI: GPT-4o (2024-05-13) $5 in / $15 out per 1M	$0.007000	$250.00	$450.00	$700.00
OpenAI: GPT-5 Image $10 in / $10 out per 1M	$0.008000	$500.00	$300.00	$800.00
OpenAI: GPT-4o (extended) $6 in / $18 out per 1M	$0.008400	$300.00	$540.00	$840.00
Anthropic: Claude Opus 4.7 $5 in / $25 out per 1M	$0.0100	$250.00	$750.00	$1000.00
Anthropic: Claude Opus 4.6 $5 in / $25 out per 1M	$0.0100	$250.00	$750.00	$1000.00
Anthropic: Claude Opus 4.5 $5 in / $25 out per 1M	$0.0100	$250.00	$750.00	$1000.00
OpenAI: GPT-5.5 $5 in / $30 out per 1M	$0.0115	$250.00	$900.00	$1150.00
OpenAI: GPT-4 Turbo $10 in / $30 out per 1M	$0.0140	$500.00	$900.00	$1400.00
OpenAI: GPT-4 Turbo Preview $10 in / $30 out per 1M	$0.0140	$500.00	$900.00	$1400.00
OpenAI: GPT-4 Turbo (older v1106) $10 in / $30 out per 1M	$0.0140	$500.00	$900.00	$1400.00
OpenAI: o3 Deep Research $10 in / $40 out per 1M	$0.0170	$500.00	$1200.00	$1700.00
OpenAI: o1 $15 in / $60 out per 1M	$0.0255	$750.00	$1800.00	$2550.00
Anthropic: Claude Opus 4.1 $15 in / $75 out per 1M	$0.0300	$750.00	$2250.00	$3000.00
Anthropic: Claude Opus 4 $15 in / $75 out per 1M	$0.0300	$750.00	$2250.00	$3000.00
OpenAI: GPT-4 (older v0314) $30 in / $60 out per 1M	$0.0330	$1500.00	$1800.00	$3300.00
OpenAI: GPT-4 $30 in / $60 out per 1M	$0.0330	$1500.00	$1800.00	$3300.00
OpenAI: o3 Pro $20 in / $80 out per 1M	$0.0340	$1000.00	$2400.00	$3400.00
OpenAI: GPT-5 Pro $15 in / $120 out per 1M	$0.0435	$750.00	$3600.00	$4350.00
OpenAI: GPT-5.2 Pro $21 in / $168 out per 1M	$0.0609	$1050.00	$5040.00	$6090.00
OpenAI: GPT-5.5 Pro $30 in / $180 out per 1M	$0.0690	$1500.00	$5400.00	$6900.00
OpenAI: GPT-5.4 Pro $30 in / $180 out per 1M	$0.0690	$1500.00	$5400.00	$6900.00
OpenAI: o1-pro $150 in / $600 out per 1M	$0.2550	$7500.00	$18,000	$25,500

List-price estimate. Real bills typically run 1.3-1.7x higher after retries, system-prompt re-sends, and tool-call round-trips. See per-million-tokens true cost for the adders.

Understanding AI API Pricing in 2026

AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.

Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.

How to Optimize AI API Costs

The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.

Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.

Pricing Trends to Watch

Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
Cached pricing: More providers offering discounted rates for repeated context