AI Model Pricing Index
Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.
23
Models Tracked
11
Providers
$0.10
Cheapest Input
150x
Price Range
Full Pricing Table
| Model | Provider | Input / 1M | Output / 1M | Blended | Quality | Value | Context |
|---|---|---|---|---|---|---|---|
Fastest + cheapest | $0.10 | $0.40 | $0.25 | 74 | 296.0 | 1M | |
Longest context | Meta | $0.15 | $0.40 | $0.28 | 71 | 258.2 | 10M |
Open-source coding | Alibaba Cloud | $0.15 | $0.45 | $0.30 | 74 | 246.7 | 131K |
High throughput | OpenAI | $0.15 | $0.60 | $0.38 | 72 | 192.0 | 128K |
Open-source value | Meta | $0.20 | $0.60 | $0.40 | 80 | 200.0 | 1M |
Budget reasoning | xAI | $0.30 | $0.50 | $0.40 | 78 | 195.0 | 131K |
Code generation | Mistral AI | $0.30 | $0.90 | $0.60 | 76 | 126.7 | 256K |
Qwen 2.5 72BOSS Open-source flagship | Alibaba Cloud | $0.30 | $0.90 | $0.60 | 80 | 133.3 | 131K |
DeepSeek V3OSS Best open-source value | DeepSeek | $0.27 | $1.10 | $0.69 | 86 | 125.5 | 128K |
DeepSeek R1OSS Cheap reasoning | DeepSeek | $0.55 | $2.19 | $1.37 | 91 | 66.4 | 128K |
AWS ecosystem | Amazon | $0.80 | $3.20 | $2.00 | 70 | 35.0 | 300K |
Claude 3.5 Haiku-20% Speed & cost | Anthropic | $0.80 | $4.00 | $2.40 | 75 | 31.3 | 200K |
Reasoning & math | OpenAI | $1.10 | $4.40 | $2.75 | 88 | 32.0 | 200K |
Mistral Large 2-33% Multilingual | Mistral AI | $2.00 | $6.00 | $4.00 | 79 | 19.8 | 128K |
Long context | OpenAI | $2.00 | $8.00 | $5.00 | 89 | 17.8 | 1M |
Multimodal + value | $1.25 | $10.00 | $5.63 | 92 | 16.4 | 1M | |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 85 | 13.6 | 128K |
Enterprise RAG | Cohere | $2.50 | $10.00 | $6.25 | 68 | 10.9 | 128K |
Coding & balance | Anthropic | $3.00 | $15.00 | $9.00 | 88 | 9.8 | 200K |
Real-time info | xAI | $3.00 | $15.00 | $9.00 | 87 | 9.7 | 131K |
Search + citations | Perplexity | $3.00 | $15.00 | $9.00 | 78 | 8.7 | 200K |
Hard reasoning | OpenAI | $10.00 | $40.00 | $25.00 | 96 | 3.8 | 200K |
Complex analysis | Anthropic | $15.00 | $75.00 | $45.00 | 95 | 2.1 | 200K |
Estimate Your Monthly Cost
Cost Calculator
Cheapest
$17.00/mo
Gemini 2.0 Flash
Best Value
$28.00/mo
Llama 4 Maverick (quality >= 80)
Most Expensive
$3.0K/mo
Claude Opus 4
Save 30-60% with smart model routing
Swfte Connect automatically routes each request to the optimal model based on complexity, reducing costs without sacrificing quality.
All Models — Estimated Monthly Cost
Recent Price Changes
Claude 3.5 Haiku
Feb 1, 2025
$0.8 / $4
Mistral Large 2
Nov 18, 2024
$2 / $6
GPT-4o
Oct 1, 2024
$2.5 / $10
Command R+
Aug 15, 2024
$2.5 / $10
Understanding AI API Pricing in 2026
AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.
Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.
How to Optimize AI API Costs
The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.
Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.
Pricing Trends to Watch
- Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
- Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
- Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
- Cached pricing: More providers offering discounted rates for repeated context